Artwork

Conteúdo fornecido por IVANCAST PODCAST. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por IVANCAST PODCAST ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.
Player FM - Aplicativo de podcast
Fique off-line com o app Player FM !

Peering Into the Black Box: The Rise of Representation Engineering

37:27
 
Compartilhar
 

Manage episode 448992995 series 3351512
Conteúdo fornecido por IVANCAST PODCAST. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por IVANCAST PODCAST ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

Join us in SHIFTERLABS’ latest experimental podcast series powered by Notebook LM, where we bridge research and conversation to illuminate groundbreaking ideas in AI. In this episode, we dive into “Representation Engineering: A Top-Down Approach to AI Transparency,” an insightful paper from the Center for AI Safety, Carnegie Mellon University, Stanford, and other leading institutions. This research redefines how we view transparency in deep learning by shifting the focus from neurons and circuits to high-level representations.

Discover how Representation Engineering (RepE) introduces new methods for reading and controlling cognitive processes in AI models, offering innovative solutions to challenges like honesty, hallucination detection, and fairness. We explore its applications across essential safety domains, including model control and ethical behavior. Tune in to learn how these advances could shape a future of AI that is more transparent, accountable, and aligned with human values.

This series is part of SHIFTERLABS’ ongoing commitment to pushing the boundaries of educational technology and fostering discussions at the intersection of research, technology, and responsible innovation.

  continue reading

100 episódios

Artwork
iconCompartilhar
 
Manage episode 448992995 series 3351512
Conteúdo fornecido por IVANCAST PODCAST. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por IVANCAST PODCAST ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

Join us in SHIFTERLABS’ latest experimental podcast series powered by Notebook LM, where we bridge research and conversation to illuminate groundbreaking ideas in AI. In this episode, we dive into “Representation Engineering: A Top-Down Approach to AI Transparency,” an insightful paper from the Center for AI Safety, Carnegie Mellon University, Stanford, and other leading institutions. This research redefines how we view transparency in deep learning by shifting the focus from neurons and circuits to high-level representations.

Discover how Representation Engineering (RepE) introduces new methods for reading and controlling cognitive processes in AI models, offering innovative solutions to challenges like honesty, hallucination detection, and fairness. We explore its applications across essential safety domains, including model control and ethical behavior. Tune in to learn how these advances could shape a future of AI that is more transparent, accountable, and aligned with human values.

This series is part of SHIFTERLABS’ ongoing commitment to pushing the boundaries of educational technology and fostering discussions at the intersection of research, technology, and responsible innovation.

  continue reading

100 episódios

Semua episod

×
 
Loading …

Bem vindo ao Player FM!

O Player FM procura na web por podcasts de alta qualidade para você curtir agora mesmo. É o melhor app de podcast e funciona no Android, iPhone e web. Inscreva-se para sincronizar as assinaturas entre os dispositivos.

 

Guia rápido de referências