Premium Only Content
Dynamic Inference with Neural Interpreters (w/ author interview)
#deeplearning #neuralinterpreter #ai
This video includes an interview with the paper's authors!
What if we treated deep networks like modular programs? Neural Interpreters divide computation into small modules and route data to them via a dynamic type inference system. The resulting model combines recurrent elements, weight sharing, attention, and more to tackle both abstract reasoning, as well as computer vision tasks.
OUTLINE:
0:00 - Intro & Overview
3:00 - Model Overview
7:00 - Interpreter weights and function code
9:40 - Routing data to functions via neural type inference
14:55 - ModLin layers
18:25 - Experiments
21:35 - Interview Start
24:50 - General Model Structure
30:10 - Function code and signature
40:30 - Explaining Modulated Layers
49:50 - A closer look at weight sharing
58:30 - Experimental Results
Paper: https://arxiv.org/abs/2110.06399
Guests:
Nasim Rahaman: https://twitter.com/nasim_rahaman
Francesco Locatello: https://twitter.com/FrancescoLocat8
Waleed Gondal: https://twitter.com/Wallii_gondal
Abstract:
Modern neural network architectures can leverage large amounts of data to generalize well within the training distribution. However, they are less capable of systematic generalization to data drawn from unseen but related distributions, a feat that is hypothesized to require compositional reasoning and reuse of knowledge. In this work, we present Neural Interpreters, an architecture that factorizes inference in a self-attention network as a system of modules, which we call \emph{functions}. Inputs to the model are routed through a sequence of functions in a way that is end-to-end learned. The proposed architecture can flexibly compose computation along width and depth, and lends itself well to capacity extension after training. To demonstrate the versatility of Neural Interpreters, we evaluate it in two distinct settings: image classification and visual abstract reasoning on Raven Progressive Matrices. In the former, we show that Neural Interpreters perform on par with the vision transformer using fewer parameters, while being transferrable to a new task in a sample efficient manner. In the latter, we find that Neural Interpreters are competitive with respect to the state-of-the-art in terms of systematic generalization
Authors: Nasim Rahaman, Muhammad Waleed Gondal, Shruti Joshi, Peter Gehler, Yoshua Bengio, Francesco Locatello, Bernhard Schölkopf
Links:
TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://discord.gg/4H8xxDF
BitChute: https://www.bitchute.com/channel/yann...
LinkedIn: https://www.linkedin.com/in/ykilcher
BiliBili: https://space.bilibili.com/2017636191
If you want to support me, the best thing to do is to share out the content :)
If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannick...
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n
-
28:33
Stephen Gardner
9 hours ago🔥Joe Biden SHOCKS the DNC...Then burns it down on way out!
107K163 -
1:04:33
JustPearlyThings
12 hours agoThe MISLEADING Tactics of the Feminist Agenda | Pearl Daily
53.6K27 -
1:26:57
Flyover Conservatives
1 day agoLara Logan’s Explosive Take: Open Borders. The Future for J6 Hostages. The Fallout from Hunter Biden’s Pardon. | FOC Show
55.8K26 -
1:27:53
Adam Does Movies
14 hours ago $12.17 earnedIt's Kraven Time! First 8 Minutes Revealed + Moana 2's Huge Box Office! - LIVE
61.5K2 -
7:20:47
-
1:05:40
Donald Trump Jr.
16 hours agoAmerica to the FBI: We Only Take Kash, Live with Mike Davis | TRIGGERED Ep.195
173K217 -
2:23:13
WeAreChange
12 hours agoBIDEN 180: Crime Family Coverup Exposed By Joe’s Pardon Of Hunter!
80.5K41 -
47:03
Kimberly Guilfoyle
14 hours agoThe FBI is now Kash Only, Live with Larry Elder & Steve Friend | Ep. 178
96.2K24 -
1:02:30
Sarah Westall
12 hours agoWhy Realistically Gold Could Increase to $100,000+ to Pay Off the National Debt w/ Andy Schectman
50.8K7 -
54:43
LFA TV
1 day agoThe Biden Crime Family Is Above the Law | Trumpet Daily 12.2.24 7PM EST
53.7K6