Artwork

Conteúdo fornecido por Duncan Epping, Frank Denneman, Johan van Amersfoort, Duncan Epping, Frank Denneman, and Johan van Amersfoort. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por Duncan Epping, Frank Denneman, Johan van Amersfoort, Duncan Epping, Frank Denneman, and Johan van Amersfoort ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.
Player FM - Aplicativo de podcast
Fique off-line com o app Player FM !

#072 - Chris Gully and the rise of Small Language Models

48:26
 
Compartilhar
 

Manage episode 412553546 series 2987137
Conteúdo fornecido por Duncan Epping, Frank Denneman, Johan van Amersfoort, Duncan Epping, Frank Denneman, and Johan van Amersfoort. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por Duncan Epping, Frank Denneman, Johan van Amersfoort, Duncan Epping, Frank Denneman, and Johan van Amersfoort ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

Summary
Chris Gully discusses his current role in the new Broadcom organization and highlights of his career. He emphasizes the importance of staying relevant in the technology industry and the value of working with cool and smart people. The conversation then shifts to the topic of small language models (SLMs) and their role in the landscape of gen AI applications. Gully explains that SLMs offer a more progressive approach to working with large language models (LLMs) and enable more efficient and scalable deployments. The discussion also touches on the components of gen AI applications, the need for right-sizing models, and the challenges of scalability and efficiency. Gully highlights the importance of data and its role in driving business outcomes through AI. The conversation concludes with a discussion on the benefits and limitations of fine-tuning LLMs and the potential future of SLMs. The conversation explores the concept of SLMs (Small Language Models) and their role in AI development. It discusses the advantages of SLMs over LLMs (Large Language Models) regarding efficiency, optimization, and governance. The conversation also touches on the challenges of infrastructure management and resource allocation in AI deployments. It highlights the importance of right-sizing workloads, distributing workloads across data centers, and maximizing resource utilization. The conversation concludes with a discussion on the future trends in machine learning and AI, including advancements in math and the need for accessible and efficient technology.

Links

Takeaways
Staying relevant in the technology industry is crucial for career success.

  • Small language models (SLMs) offer a more efficient and scalable approach to working with large language models (LLMs).
  • Data is the most important and untapped asset for organizations, and leveraging it through AI can drive business outcomes.
  • Scalability and efficiency are key challenges in deploying gen AI applications.
  • Fine-tuning LLMs can enhance their precision and reduce the need for extensive training.
  • The future of SLMs may involve dynamic training and efficient distribution to support evolving business needs. SLMs offer advantages in terms of efficiency, optimization, and governance compared to LLMs.
  • Infrastructure management and resource allocation are crucial in AI deployments.
  • Right-sizing workloads and maximizing resource utilization are key considerations.
  • Future trends in machine learning and AI include advancements in math and the need for accessible and efficient technology.

Follow us on X for updates and news about upcoming episodes: https://x.com/UnexploredPod.

Last but not least, make sure to hit that subscribe button and share the episode with your friends and colleagues!
Disclaimer: The thoughts and opinions shared in this podcast are our own/guest(s), and not necessarily those of Broadcom or VMware by Broadcom.

  continue reading

74 episódios

Artwork
iconCompartilhar
 
Manage episode 412553546 series 2987137
Conteúdo fornecido por Duncan Epping, Frank Denneman, Johan van Amersfoort, Duncan Epping, Frank Denneman, and Johan van Amersfoort. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por Duncan Epping, Frank Denneman, Johan van Amersfoort, Duncan Epping, Frank Denneman, and Johan van Amersfoort ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

Summary
Chris Gully discusses his current role in the new Broadcom organization and highlights of his career. He emphasizes the importance of staying relevant in the technology industry and the value of working with cool and smart people. The conversation then shifts to the topic of small language models (SLMs) and their role in the landscape of gen AI applications. Gully explains that SLMs offer a more progressive approach to working with large language models (LLMs) and enable more efficient and scalable deployments. The discussion also touches on the components of gen AI applications, the need for right-sizing models, and the challenges of scalability and efficiency. Gully highlights the importance of data and its role in driving business outcomes through AI. The conversation concludes with a discussion on the benefits and limitations of fine-tuning LLMs and the potential future of SLMs. The conversation explores the concept of SLMs (Small Language Models) and their role in AI development. It discusses the advantages of SLMs over LLMs (Large Language Models) regarding efficiency, optimization, and governance. The conversation also touches on the challenges of infrastructure management and resource allocation in AI deployments. It highlights the importance of right-sizing workloads, distributing workloads across data centers, and maximizing resource utilization. The conversation concludes with a discussion on the future trends in machine learning and AI, including advancements in math and the need for accessible and efficient technology.

Links

Takeaways
Staying relevant in the technology industry is crucial for career success.

  • Small language models (SLMs) offer a more efficient and scalable approach to working with large language models (LLMs).
  • Data is the most important and untapped asset for organizations, and leveraging it through AI can drive business outcomes.
  • Scalability and efficiency are key challenges in deploying gen AI applications.
  • Fine-tuning LLMs can enhance their precision and reduce the need for extensive training.
  • The future of SLMs may involve dynamic training and efficient distribution to support evolving business needs. SLMs offer advantages in terms of efficiency, optimization, and governance compared to LLMs.
  • Infrastructure management and resource allocation are crucial in AI deployments.
  • Right-sizing workloads and maximizing resource utilization are key considerations.
  • Future trends in machine learning and AI include advancements in math and the need for accessible and efficient technology.

Follow us on X for updates and news about upcoming episodes: https://x.com/UnexploredPod.

Last but not least, make sure to hit that subscribe button and share the episode with your friends and colleagues!
Disclaimer: The thoughts and opinions shared in this podcast are our own/guest(s), and not necessarily those of Broadcom or VMware by Broadcom.

  continue reading

74 episódios

Todos os episódios

×
 
Loading …

Bem vindo ao Player FM!

O Player FM procura na web por podcasts de alta qualidade para você curtir agora mesmo. É o melhor app de podcast e funciona no Android, iPhone e web. Inscreva-se para sincronizar as assinaturas entre os dispositivos.

 

Guia rápido de referências