Fique off-line com o app Player FM !
Fine-tuning and Preference Alignment in a Single Streamlined Process
Manage episode 423374192 series 2570898
Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model.
Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/
Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS.
Detailed show notes can be found on The Data Exchange web site.
256 episódios
Manage episode 423374192 series 2570898
Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model.
Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/
Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS.
Detailed show notes can be found on The Data Exchange web site.
256 episódios
كل الحلقات
×Bem vindo ao Player FM!
O Player FM procura na web por podcasts de alta qualidade para você curtir agora mesmo. É o melhor app de podcast e funciona no Android, iPhone e web. Inscreva-se para sincronizar as assinaturas entre os dispositivos.