video thumbnail 21:01
Gradient descent, how neural networks learn | Chapter 2, Deep learning

2017-10-16

[public] 3.82M views, 147K likes, 619 dislikes audio only

channel thumb3Blue1Brown

Enjoy these videos? Consider sharing one or two.

Help fund future projects: https://www.patreon.com/3blue1brown

Special thanks to these supporters: http://3b1b.co/nn2-thanks

Written/interactive form of this series: https://www.3blue1brown.com/topics/neural-networks

This video was supported by Amplify Partners.

For any early-stage ML startup founders, Amplify Partners would love to hear from you via 3blue1brown@amplifypartners.com

To learn more, I highly recommend the book by Michael Nielsen

http://neuralnetworksanddeeplearning.com/

The book walks through the code behind the example in these videos, which you can find here:

https://github.com/mnielsen/neural-networks-and-deep-learning

MNIST database:

http://yann.lecun.com/exdb/mnist/

Also check out Chris Olah's blog:

http://colah.github.io/

His post on Neural networks and topology is particular beautiful, but honestly all of the stuff there is great.

And if you like that, you'll *love* the publications at distill:

https://distill.pub/

For more videos, Welch Labs also has some great series on machine learning:

/youtube/video/i8D90DkCLhI

/youtube/video/bxe2T-V8XRs

"But I've already voraciously consumed Nielsen's, Olah's and Welch's works", I hear you say. Well well, look at you then. That being the case, I might recommend that you continue on with the book "Deep Learning" by Goodfellow, Bengio, and Courville.

Thanks to Lisha Li (@lishali88) for her contributions at the end, and for letting me pick her brain so much about the material. Here are the articles she referenced at the end:

https://arxiv.org/abs/1611.03530

https://arxiv.org/abs/1706.05394

https://arxiv.org/abs/1412.0233

Music by Vincent Rubinetti:

https://vincerubinetti.bandcamp.com/album/the-music-of-3blue1brown

Thanks to these viewers for their contributions to translations

Hebrew: Omer Tuchfeld

Italian: @teobucci

-------------------

Video timeline

0:00 - Introduction

0:30 - Recap

1:49 - Using training data

3:01 - Cost functions

6:55 - Gradient descent

11:18 - More on gradient vectors

12:19 - Gradient descent recap

13:01 - Analyzing the network

16:37 - Learning more

17:38 - Lisha Li interview

19:58 - Closing thoughts

------------------

3blue1brown is a channel about animating math, in all senses of the word animate. And you know the drill with YouTube, if you want to stay posted on new videos, subscribe, and click the bell to receive notifications (if you're into that).

If you are new to this channel and want to see more, a good place to start is this playlist: http://3b1b.co/recommended

Various social media stuffs:

Website: https://www.3blue1brown.com

Twitter: https://twitter.com/3Blue1Brown

Patreon: https://patreon.com/3blue1brown

Facebook: https://www.facebook.com/3blue1brown

Reddit: https://www.reddit.com/r/3Blue1Brown


But what is a neural network? | Chapter 1, Deep learning by 3Blue1Brown
/youtube/video/aircAruvnKk
Patreon Support more of these videos
http://patreon.com/3blue1brown
Introduction
/youtube/video/IHZwWFHWa-w?t=0
Recap
/youtube/video/IHZwWFHWa-w?t=30
Using training data
/youtube/video/IHZwWFHWa-w?t=109
Cost functions
/youtube/video/IHZwWFHWa-w?t=181
Gradient descent
/youtube/video/IHZwWFHWa-w?t=415
More on gradient vectors
/youtube/video/IHZwWFHWa-w?t=678
Gradient descent recap
/youtube/video/IHZwWFHWa-w?t=739
Analyzing the network
/youtube/video/IHZwWFHWa-w?t=781
Learning more
/youtube/video/IHZwWFHWa-w?t=997
Lisha Li interview
/youtube/video/IHZwWFHWa-w?t=1058
Closing thoughts
/youtube/video/IHZwWFHWa-w?t=1198
3Blue1Brown 3Blue1Brown, by Grant Sanderson, is some combination of math and entertainment, depending on your disposition. The goal is for explanations to be driven by animations and for difficult problems to be made simple with changes in perspective. For more information, other projects, FAQs, and inquiries see the website: https://www.3blue1brown.com
/youtube/channel/UCYO_jab_esuFRV4b17AJtAw
But what is a convolution? 1,301,586 views
/youtube/video/KuXjwB4LzSA
Support on patreon patreon.com
https://www.patreon.com/3blue1brown
What is backpropagation really doing? | Chapter 3, Deep learning 3,449,628 views
/youtube/video/Ilg3gGewQ5U