Artwork

Conteúdo fornecido por Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.
Player FM - Aplicativo de podcast
Fique off-line com o app Player FM !

Claude Opus 4.5, Olmo 3, and a Paper on Diffusion + Auto Regression

47:45
 
Compartilhar
 

Manage episode 521719471 series 3703995
Conteúdo fornecido por Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

In this episode of Artificial Developer Intelligence, hosts Shimin and Dan explore the latest advancements in AI models, including the release of Claude Opus 4.5 and Gemini 3. They discuss the implications of these models on software engineering, the rise of open-source models like Olmo 3, and the enhancements in the Claude Developer Platform. The conversation also delves into the challenges of relying on AI for coding tasks, the potential pitfalls of the AI bubble, and the future of written exams in the age of AI.

Takeaways

  • Claude Opus 4.5 setting benchmarks, enhance usability and reduce token consumption.
  • The introduction of open-source models like Olmo 3 is a significant development in AI.
  • The future of written exams may be challenged by AI's ability to generate human-like responses.
  • Relying too heavily on AI can lead to a lack of critical thinking and problem-solving skills.
  • The AI bubble is at 25s to midnight
  • Recent research suggests that AI models can improve their performance through emulating query based search.
  • The importance of prompt engineering in AI interactions is highlighted.

Resources Mentioned
Introducing Claude Opus 4.5
Build with Nano Banana Pro, our Gemini 3 Pro Image model
Andrej Karpathy's Post about Nano Banana Pro
Olmo 3: Charting a path through the model flow to lead open-source AI
Introducing advanced tool use on the Claude Developer Platform
TiDAR: Think in Diffusion, Talk in Autoregression
SSRL: SELF-SEARCH REINFORCEMENT LEARNING
Mira Murati's Thinking Machines seeks $50 billion valuation in funding talks, Bloomberg News reports
Boom, bubble, bust, boom. Why should AI be different?
Nvidia didn’t save the market. What’s next for the AI trade?

Chapters

  • (00:00) - Introduction to Artificial Developer Intelligence
  • (01:25) - Claude Opus 4.5
  • (07:02) - Exploring Gemini 3 and Image Models
  • (11:24) - Olmo 3 and The Rise of Open Flow Models
  • (15:46) - Innovations in AI Tools and Platforms
  • (19:33) - Research Insights: Diffusion and Auto-Regression Models
  • (23:39) - Advancements in AI Output Efficiency
  • (25:45) - Exploring Self Search Reinforcement Learning
  • (27:48) - The Dilemma of Language Models
  • (30:11) - Prompt Engineering and Search Integration
  • (32:55) - Dan's Rants on AI Limitations
  • (38:17) - 2 Minutes to Midnight
  • (46:41) - Outro

Connect with ADIPod
  continue reading

4 episódios

Artwork
iconCompartilhar
 
Manage episode 521719471 series 3703995
Conteúdo fornecido por Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

In this episode of Artificial Developer Intelligence, hosts Shimin and Dan explore the latest advancements in AI models, including the release of Claude Opus 4.5 and Gemini 3. They discuss the implications of these models on software engineering, the rise of open-source models like Olmo 3, and the enhancements in the Claude Developer Platform. The conversation also delves into the challenges of relying on AI for coding tasks, the potential pitfalls of the AI bubble, and the future of written exams in the age of AI.

Takeaways

  • Claude Opus 4.5 setting benchmarks, enhance usability and reduce token consumption.
  • The introduction of open-source models like Olmo 3 is a significant development in AI.
  • The future of written exams may be challenged by AI's ability to generate human-like responses.
  • Relying too heavily on AI can lead to a lack of critical thinking and problem-solving skills.
  • The AI bubble is at 25s to midnight
  • Recent research suggests that AI models can improve their performance through emulating query based search.
  • The importance of prompt engineering in AI interactions is highlighted.

Resources Mentioned
Introducing Claude Opus 4.5
Build with Nano Banana Pro, our Gemini 3 Pro Image model
Andrej Karpathy's Post about Nano Banana Pro
Olmo 3: Charting a path through the model flow to lead open-source AI
Introducing advanced tool use on the Claude Developer Platform
TiDAR: Think in Diffusion, Talk in Autoregression
SSRL: SELF-SEARCH REINFORCEMENT LEARNING
Mira Murati's Thinking Machines seeks $50 billion valuation in funding talks, Bloomberg News reports
Boom, bubble, bust, boom. Why should AI be different?
Nvidia didn’t save the market. What’s next for the AI trade?

Chapters

  • (00:00) - Introduction to Artificial Developer Intelligence
  • (01:25) - Claude Opus 4.5
  • (07:02) - Exploring Gemini 3 and Image Models
  • (11:24) - Olmo 3 and The Rise of Open Flow Models
  • (15:46) - Innovations in AI Tools and Platforms
  • (19:33) - Research Insights: Diffusion and Auto-Regression Models
  • (23:39) - Advancements in AI Output Efficiency
  • (25:45) - Exploring Self Search Reinforcement Learning
  • (27:48) - The Dilemma of Language Models
  • (30:11) - Prompt Engineering and Search Integration
  • (32:55) - Dan's Rants on AI Limitations
  • (38:17) - 2 Minutes to Midnight
  • (46:41) - Outro

Connect with ADIPod
  continue reading

4 episódios

Todos os episódios

×
 
Loading …

Bem vindo ao Player FM!

O Player FM procura na web por podcasts de alta qualidade para você curtir agora mesmo. É o melhor app de podcast e funciona no Android, iPhone e web. Inscreva-se para sincronizar as assinaturas entre os dispositivos.

 

Guia rápido de referências

Ouça este programa enquanto explora
Reproduzir