Episode 154 - Sind LLMs auf Benchmark Daten manipuliert?

Knowledge Science - Alles über KI, ML und NLP

Conteúdo fornecido por Sigurd Schacht, Carsten Lanquillon, Sigurd Schacht, and Carsten Lanquillon. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por Sigurd Schacht, Carsten Lanquillon, Sigurd Schacht, and Carsten Lanquillon ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

8M ago 36:40

MP3•Home de episódios

Send us a text

In der heutigen Sendung versuchen wir rauszufinden, ob man sich auf die öffentlichen Benchmarks zum Testen und Vergleichen von Sprachmodellen verlassen kann. Oder ob Benchmark Testdaten zum Trainieren verwendet werden. Hierbei handelt es sich um das Benchmark Leakage. Hören Sie rein.
Wir sprechen vor allem über das Paper: Benchmarking Benchmark Leakage in Large Language Models https://arxiv.org/abs/2404.18824

Support the show

208 episódios

#Technologie #Bildung #Sigurd Schacht, Carsten Lanquillon #Carsten Lanquillon #Sigurd Schacht #Wissenschaft #Künstliche Intelligenz

Episode 154 - Sind LLMs auf Benchmark Daten manipuliert?

Knowledge Science - Alles über KI, ML und NLP

14 subscribers

published 8M ago

MP3•Home de episódios

Send us a text

Support the show

208 episódios

#Technologie #Bildung #Sigurd Schacht, Carsten Lanquillon #Carsten Lanquillon #Sigurd Schacht #Wissenschaft #Künstliche Intelligenz

Todos os episódios

Bem vindo ao Player FM!

O Player FM procura na web por podcasts de alta qualidade para você curtir agora mesmo. É o melhor app de podcast e funciona no Android, iPhone e web. Inscreva-se para sincronizar as assinaturas entre os dispositivos.

Ouça a 500+ tópicos

Parecido com Knowledge Science - Alles über KI, ML und NLP

Podcasts que valem a pena ouvir

Knowledge Science - Alles über KI, ML und NLP « » Episode 154 - Sind LLMs auf Benchmark Daten manipuliert?

Episode 154 - Sind LLMs auf Benchmark Daten manipuliert?

Podcasts que valem a pena ouvir

Bem vindo ao Player FM!

Parecido com Knowledge Science - Alles über KI, ML und NLP

Guia rápido de referências

Knowledge Science - Alles über KI, ML und NLP « »
Episode 154 - Sind LLMs auf Benchmark Daten manipuliert?