Premium Only Content
PonderNet: Learning to Ponder (Machine Learning Research Paper Explained)
#pondernet #deepmind #machinelearning
Humans don't spend the same amount of mental effort on all problems equally. Instead, we respond quickly to easy tasks, and we take our time to deliberate hard tasks. DeepMind's PonderNet attempts to achieve the same by dynamically deciding how many computation steps to allocate to any single input sample. This is done via a recurrent architecture and a trainable function that computes a halting probability. The resulting model performs well in dynamic computation tasks and is surprisingly robust to different hyperparameter settings.
OUTLINE:
0:00 - Intro & Overview
2:30 - Problem Statement
8:00 - Probabilistic formulation of dynamic halting
14:40 - Training via unrolling
22:30 - Loss function and regularization of the halting distribution
27:35 - Experimental Results
37:10 - Sensitivity to hyperparameter choice
41:15 - Discussion, Conclusion, Broader Impact
Paper: https://arxiv.org/abs/2107.05407
Abstract:
In standard neural networks the amount of computation used grows with the size of the inputs, but not with the complexity of the problem being learnt. To overcome this limitation we introduce PonderNet, a new algorithm that learns to adapt the amount of computation based on the complexity of the problem at hand. PonderNet learns end-to-end the number of computational steps to achieve an effective compromise between training prediction accuracy, computational cost and generalization. On a complex synthetic problem, PonderNet dramatically improves performance over previous adaptive computation methods and additionally succeeds at extrapolation tests where traditional neural networks fail. Also, our method matched the current state of the art results on a real world question and answering dataset, but using less compute. Finally, PonderNet reached state of the art results on a complex task designed to test the reasoning capabilities of neural networks.1
Authors: Andrea Banino, Jan Balaguer, Charles Blundell
Links:
TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://discord.gg/4H8xxDF
BitChute: https://www.bitchute.com/channel/yann...
Minds: https://www.minds.com/ykilcher
Parler: https://parler.com/profile/YannicKilcher
LinkedIn: https://www.linkedin.com/in/yannic-ki...
BiliBili: https://space.bilibili.com/1824646584
If you want to support me, the best thing to do is to share out the content :)
If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannick...
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n
-
1:24
laurastephens99
3 years agoLove learning
28 -
LIVE
Nerdrotic
2 hours agoNerdrotic Nooner 442
2,337 watching -
1:01:08
The Dan Bongino Show
4 hours agoTrump Makes His First Big Moves (Ep. 2368) - 11/11/2024
559K1.01K -
1:57:22
Steven Crowder
4 hours agoThe 4B Movement: How Trump is Saving the World from Liberal Women
441K41 -
1:24:03
The Rubin Report
2 hours ago'Real Time' Crowd Stunned as Bill Maher Gives a Brutal Message to Democrats
44.2K8 -
LIVE
Benny Johnson
2 hours agoDC Swamp Declares WAR on TRUMP in Senate Battle to REPLACE Mitch McConnell! We EXPOSED Secret Ballot
16,651 watching -
LIVE
MTNTOUGH Fitness Lab
1 hour agoWhy 99% of People Will Fail: The Hardcore Truth About Entrepreneurship | MTNT POD #90
185 watching -
LIVE
The Nima Yamini Show
2 hours agoWhat is AFD In Germany?
136 watching -
1:28:43
Caleb Hammer
14 hours agoPsycho Tried To Manipulate Me | Financial Audit
18.7K -
2:02:49
LFA TV
13 hours agoJUSTICE IS COMING! | LIVE FROM AMERICA 11.11.24 11am EST
44.1K