Artwork

Conteúdo fornecido por Greg Lambert & Marlene Gebauer, Greg Lambert, and Marlene Gebauer. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por Greg Lambert & Marlene Gebauer, Greg Lambert, and Marlene Gebauer ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.
Player FM - Aplicativo de podcast
Fique off-line com o app Player FM !

Can AI Bring Both Speed and Accuracy: Josh Broyde of AI21 Labs

37:06
 
Compartilhar
 

Manage episode 429415589 series 3068634
Conteúdo fornecido por Greg Lambert & Marlene Gebauer, Greg Lambert, and Marlene Gebauer. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por Greg Lambert & Marlene Gebauer, Greg Lambert, and Marlene Gebauer ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

This week, we are joined by Joshua Broyde, PhD and Principal Solutions Architect at AI21 Labs. Broyde discusses AI21 Labs' work in developing foundation models and AI systems for enterprise use, with a focus on their latest model, Jamba-Instruct.

Josh explains the concept of foundation models and how they differ from traditional AI models. He highlights AI21 Labs' work with financial institutions on use cases like term sheet generation and financial document Q&A. The conversation explores the challenges and benefits of training models on company-specific data versus using retrieval augmented generation (RAG) techniques.

The interview delves into the development of Jamba Instruct, a hybrid model combining Mamba and Transformer architectures to achieve both speed and accuracy. Broyde discusses the model's performance, industry reaction, and potential applications.

Safety and security considerations for AI models are addressed, with Broyde explaining AI21 Labs' approach to implementing guardrails and secure deployment options for regulated industries. The discussion also covers the balance between model quality and cost, and the trend towards matching specific models to appropriate tasks.

Josh also shares his thoughts on future developments in the field, including the potential for agent-based approaches and increased focus on cost optimization in AI workflows.

Listen on mobile platforms: ⁠⁠⁠⁠⁠⁠⁠⁠⁠Apple Podcasts⁠⁠⁠⁠⁠⁠⁠⁠⁠ | ⁠⁠⁠⁠⁠⁠⁠⁠⁠Spotify⁠⁠⁠⁠⁠⁠⁠⁠⁠ | ⁠⁠⁠⁠⁠⁠⁠⁠YouTube⁠⁠⁠⁠⁠⁠⁠

Contact Us:

Twitter: ⁠⁠⁠⁠⁠@gebauerm⁠⁠⁠⁠⁠, or ⁠⁠⁠⁠⁠@glambert⁠⁠⁠⁠⁠

Email: geekinreviewpodcast@gmail.com

Music: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Jerry David DeCicca⁠⁠⁠⁠⁠⁠⁠⁠

Transcript on 3 Geeks

  continue reading

274 episódios

Artwork
iconCompartilhar
 
Manage episode 429415589 series 3068634
Conteúdo fornecido por Greg Lambert & Marlene Gebauer, Greg Lambert, and Marlene Gebauer. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por Greg Lambert & Marlene Gebauer, Greg Lambert, and Marlene Gebauer ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

This week, we are joined by Joshua Broyde, PhD and Principal Solutions Architect at AI21 Labs. Broyde discusses AI21 Labs' work in developing foundation models and AI systems for enterprise use, with a focus on their latest model, Jamba-Instruct.

Josh explains the concept of foundation models and how they differ from traditional AI models. He highlights AI21 Labs' work with financial institutions on use cases like term sheet generation and financial document Q&A. The conversation explores the challenges and benefits of training models on company-specific data versus using retrieval augmented generation (RAG) techniques.

The interview delves into the development of Jamba Instruct, a hybrid model combining Mamba and Transformer architectures to achieve both speed and accuracy. Broyde discusses the model's performance, industry reaction, and potential applications.

Safety and security considerations for AI models are addressed, with Broyde explaining AI21 Labs' approach to implementing guardrails and secure deployment options for regulated industries. The discussion also covers the balance between model quality and cost, and the trend towards matching specific models to appropriate tasks.

Josh also shares his thoughts on future developments in the field, including the potential for agent-based approaches and increased focus on cost optimization in AI workflows.

Listen on mobile platforms: ⁠⁠⁠⁠⁠⁠⁠⁠⁠Apple Podcasts⁠⁠⁠⁠⁠⁠⁠⁠⁠ | ⁠⁠⁠⁠⁠⁠⁠⁠⁠Spotify⁠⁠⁠⁠⁠⁠⁠⁠⁠ | ⁠⁠⁠⁠⁠⁠⁠⁠YouTube⁠⁠⁠⁠⁠⁠⁠

Contact Us:

Twitter: ⁠⁠⁠⁠⁠@gebauerm⁠⁠⁠⁠⁠, or ⁠⁠⁠⁠⁠@glambert⁠⁠⁠⁠⁠

Email: geekinreviewpodcast@gmail.com

Music: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Jerry David DeCicca⁠⁠⁠⁠⁠⁠⁠⁠

Transcript on 3 Geeks

  continue reading

274 episódios

Todos os episódios

×
 
Loading …

Bem vindo ao Player FM!

O Player FM procura na web por podcasts de alta qualidade para você curtir agora mesmo. É o melhor app de podcast e funciona no Android, iPhone e web. Inscreva-se para sincronizar as assinaturas entre os dispositivos.

 

Guia rápido de referências