Give that model a treat! : Reinforcement learning explained

Tic-Tac-Toe the Hard Way

MP3•Pagina principale dell'episodio

22:49

Jonathan Jones is an NFL cornerback for the Washington Commanders who rose from the undrafted ranks to become two-time Super Bowl champion with the New England Patriots, a businessman, philanthropist, and licensed pilot. In 2019, Jonathan founded the Jonathan Jones Next Step Foundation in 2019, a platform dedicated to empowering youth through education, professional development, and mentorship. The foundation works to alleviate food insecurity, promote women in stem and sports, and to promote professional development in the communities where he lives. Jay and Jonathan talk about investing in the communities they live in, acknowledging the people who helped you become the person you are, and paying that same investment forward to the next generation. Episode Chapters 0:00 intro 1:24 Building local connections 4:25 Jonathan’s mentors and mentees 10:54 Jonathan’s pride in his mentees’ successes 13:04 how Jonathan chooses his causes 14:08 Jonathan’s support for girls and young women 17:19: Jonathan’s passion for flying 19:40 The Next Step Foundation 20:29 Goodbye For video episodes, watch on www.youtube.com/@therudermanfamilyfoundation Stay in touch: X: @JayRuderman | @RudermanFdn LinkedIn: Jay Ruderman | Ruderman Family Foundation Instagram: All About Change Podcast | Ruderman Family Foundation To learn more about the podcast, visit https://allaboutchangepodcast.com/ Looking for more insights into the world of activism? Be sure to check out Jay’s brand new book, Find Your Fight , in which Jay teaches the next generation of activists and advocates how to step up and bring about lasting change. You can find Find Your Fight wherever you buy your books, and you can learn more about it at www.jayruderman.com .…

5 anni fa 26:04

Deep Learning for JavaScript book

Playing Atari with Deep Reinforcement Learning

Two Minute Papers episode on Atari DQN

For more information about the show, check out pair.withgoogle.com/thehardway/.

You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.

10 episodi

#Tech #Podcasting Education #Rebecca Salois #People AI Research #Machine Learning #Human Centered #Reinforcement Learning #Supervised Learning #Tic-tac-toe #Games #Google

Give that model a treat! : Reinforcement learning explained

Tic-Tac-Toe the Hard Way

published 5 anni fa

MP3•Pagina principale dell'episodio

Resources:

Deep Learning for JavaScript book

Playing Atari with Deep Reinforcement Learning

Two Minute Papers episode on Atari DQN

For more information about the show, check out pair.withgoogle.com/thehardway/.

You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.

10 episodi

#Tech #Podcasting Education #Rebecca Salois #People AI Research #Machine Learning #Human Centered #Reinforcement Learning #Supervised Learning #Tic-tac-toe #Games #Google

Tutti gli episodi

Tic-Tac-Toe the Hard Way

1
Lessons learned 33:01

5 years fa33:01

33:01

What have we learned about machine learning and the human decisions that shape it? And is machine learning perhaps changing our minds about how the world outside of machine learning — also known as the world — works? For more information about the show, check out pair.withgoogle.com/thehardway/ . You can reach out to the hosts on Twitter: @dweinberger and @tafsiri .…

Tic-Tac-Toe the Hard Way

1
Head to Head: The Even Bigger ML Smackdown! 24:26

5 years fa24:26

24:26

Yannick and David’s systems play against each other in 500 games. Who’s going to win? And what can we learn about how the ML may be working by thinking about the results? See the agents play each other in Tic-Tac-Two ! For more information about the show, check out pair.withgoogle.com/thehardway/ . You can reach out to the hosts on Twitter: @dweinberger and @tafsiri .…

Tic-Tac-Toe the Hard Way

1
Enter tic-tac-two 21:20

5 years fa21:20

21:20

David’s variant of tic-tac-toe that we’re calling tic-tac-two is only slightly different but turns out to be far more complex. This requires rethinking what the ML system will need in order to learn how to play, and how to represent that data. For more information about the show, check out pair.withgoogle.com/thehardway/ . You can reach out to the hosts on Twitter: @dweinberger and @tafsiri .…

Tic-Tac-Toe the Hard Way

1
Head to Head: the Big ML Smackdown! 25:19

5 years fa25:19

25:19

David and Yannick’s tic-tac-toe ML agents face-off against each other in tic-tac-toe! See the agents play each other ! For more information about the show, check out pair.withgoogle.com/thehardway/ . You can reach out to the hosts on Twitter: @dweinberger and @tafsiri .

Tic-Tac-Toe the Hard Way

1
Give that model a treat! : Reinforcement learning explained 26:04

5 years fa26:04

26:04

Switching gears, we focus on how Yannick’s been training his model using reinforcement learning. He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves. Resources: Deep Learning for JavaScript book Playing Atari with Deep Reinforcement Learning Two Minute Papers episode on Atari DQN For more information about the show, check out pair.withgoogle.com/thehardway/ . You can reach out to the hosts on Twitter: @dweinberger and @tafsiri .…

Tic-Tac-Toe the Hard Way

1
Beating random: What it means to have trained a model 17:14

5 years fa17:14

17:14

David did it! He trained a machine learning model to play tic-tac-toe! (Well, with lots of help from Yannick.) How did the whole training experience go? How do you tell how training went? How did his model do against a player that makes random tic-tac-toe moves? For more information about the show, check out pair.withgoogle.com/thehardway/ . You can reach out to the hosts on Twitter: @dweinberger and @tafsiri .…

Tic-Tac-Toe the Hard Way

1
From tic-tac-toe moves to ML model 21:37

5 years fa21:37

21:37

Once we have the data we need—thousands of sample games--how do we turn it into something the ML can train itself on? That means understanding how training works, and what a model is. Resources: See a definition of one-hot encoding For more information about the show, check out pair.withgoogle.com/thehardway . You can reach out to the hosts on Twitter: @dweinberger and @tafsiri .…

Tic-Tac-Toe the Hard Way

1
What does a tic-tac-toe board look like to machine learning? 23:26

5 years fa23:26

23:26

How should David represent the data needed to train his machine learning system? What does a tic-tac-toe board “look” like to ML? Should he train it on games or on individual boards? How does this decision affect how and how well the machine will learn to play? Plus, an intro to reinforcement learning, the approach Yannick will be taking. For more information about the show, check out pair.withgoogle.com/thehardway . You can reach out to the hosts on Twitter: @dweinberger and @tafsiri .…

Tic-Tac-Toe the Hard Way

1
Howdy, and the myth of “pouring in data” 22:01

5 years fa22:01

22:01

Welcome to the podcast! We’re Yannick and David, a software engineer and a non-technical writer. Over the next 9 episodes we’re going to use two different approaches to build machine learning systems that play two versions of tic-tac-toe. Building a machine learning app requires humans making a lot of decisions. We start by agreeing that David will use a “supervised learning” approach while Yannick will go with “reinforcement learning.” For more information about the show, check out pair.withgoogle.com/thehardway . You can reach out to the hosts on Twitter: @dweinberger and @tafsiri .…

Tic-Tac-Toe the Hard Way

1
Introducing Tic-Tac-Toe the Hard Way 2:09

5 years fa2:09