#reinforcement-fine-tuning

[ follow ]
Artificial intelligence
fromInfoQ
22 hours ago

OpenAI at QCon AI NYC: Fine Tuning the Enterprise

Agent RFT applies reinforcement fine-tuning to tool-using agents to improve multi-step, tool-mediated decision-making via graded rewards and trajectory-level credit assignment.
[ Load more ]