Artificial intelligence
fromTheregister
2 weeks agoResearchers find fine-tuning can misalign LLMs
Fine-tuning LLMs to misbehave in one domain can cause unrelated, dangerous misalignment across other tasks, raising serious safety and deployment risks.