Training New Method - Search News

Microsoft's new AI training method eliminates bloated system prompts without sacrificing model performance

Microsoft researchers have developed On-Policy Context Distillation (OPCD), a training method that permanently embeds ...

Tech Xplore on MSN

A new method to steer AI output uncovers vulnerabilities and potential improvements

A team of researchers has found a way to steer the output of large language models by manipulating specific concepts inside these models. The new method could lead to more reliable, more efficient, ...

VentureBeat

Nvidia researchers boost LLMs reasoning skills by getting them to 'think' during pre-training

Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. The method, called reinforcement learning pre-training (RLP), integrates ...

Mirage News

New AI Steering Method Unveils Flaws, Improvements

A team of researchers has found a way to steer the output of large language models by manipulating specific concepts inside these models. The new ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results