[07] John Schulman - Optimizing Expectations: From Deep RL To Stochastic Computation Graphs The Thesis Review podcast

Artwork

Science Thesis Review Sean Welleck

Conteúdo fornecido por The Thesis Review and Sean Welleck. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por The Thesis Review and Sean Welleck ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

The Thesis Review « »
[07] John Schulman - Optimizing Expectations: From Deep RL to Stochastic Computation Graphs

4y ago 1:04:28

Compartilhar

MP3•Home de episódios

Conteúdo fornecido por The Thesis Review and Sean Welleck. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por The Thesis Review and Sean Welleck ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

John Schulman is a Research Scientist and co-founder of Open AI. John co-leads the reinforcement learning team, researching algorithms that safely and efficiently learn by trial and error and by imitating humans. His PhD thesis is titled "Optimizing Expectations: From Deep Reinforcement Learning to Stochastic Computation Graphs", which he completed in 2016 at Berkeley. We talk about his work on stochastic computation graphs and TRPO, how it evolved to PPO and how it's used in large-scale applications like Open AI Five, as well as his recent work on generalization in RL. Episode notes: https://cs.nyu.edu/~welleck/episode7.html Follow the Thesis Review (@thesisreview) and Sean Welleck (@wellecks) on Twitter, and find out more info about the show at https://cs.nyu.edu/~welleck/podcast.html Support The Thesis Review at www.buymeacoffee.com/thesisreview

… continue reading

47 episódios

#Science #Thesis Review #Sean Welleck

Artwork

[07] John Schulman - Optimizing Expectations: From Deep RL to Stochastic Computation Graphs

The Thesis Review

published 4y ago

Compartilhar

MP3•Home de episódios

Conteúdo fornecido por The Thesis Review and Sean Welleck. Todo o conteúdo do podcast, incluindo episódios, gráficos e descrições de podcast, é carregado e fornecido diretamente por The Thesis Review and Sean Welleck ou por seu parceiro de plataforma de podcast. Se você acredita que alguém está usando seu trabalho protegido por direitos autorais sem sua permissão, siga o processo descrito aqui https://pt.player.fm/legal.

John Schulman is a Research Scientist and co-founder of Open AI. John co-leads the reinforcement learning team, researching algorithms that safely and efficiently learn by trial and error and by imitating humans. His PhD thesis is titled "Optimizing Expectations: From Deep Reinforcement Learning to Stochastic Computation Graphs", which he completed in 2016 at Berkeley. We talk about his work on stochastic computation graphs and TRPO, how it evolved to PPO and how it's used in large-scale applications like Open AI Five, as well as his recent work on generalization in RL. Episode notes: https://cs.nyu.edu/~welleck/episode7.html Follow the Thesis Review (@thesisreview) and Sean Welleck (@wellecks) on Twitter, and find out more info about the show at https://cs.nyu.edu/~welleck/podcast.html Support The Thesis Review at www.buymeacoffee.com/thesisreview

… continue reading

47 episódios

#Science #Thesis Review #Sean Welleck

همه قسمت ها

×

Bem vindo ao Player FM!

O Player FM procura na web por podcasts de alta qualidade para você curtir agora mesmo. É o melhor app de podcast e funciona no Android, iPhone e web. Inscreva-se para sincronizar as assinaturas entre os dispositivos.

Ouça a 500+ tópicos

Ajuda/FAQ | Atualizar | Anunciar

Arte|Negócios|Comédia|Economia|Entretenimento|Notícias|Política|Religião

Ciência|Futebol|Desporto|Narração de histórias|Tecnologia|Crimes verdadeiros

Direitos autorais 2024 | Mapa do site | Política de Privacidade | Termos de serviço | | direito autoral