Premium Only Content
Efficient and Modular Implicit Differentiation (Machine Learning Research Paper Explained)
#implicitfunction #jax #autodiff
Many problems in Machine Learning involve loops of inner and outer optimization. Finding update steps for the outer loop is usually difficult, because of the.need to differentiate through the inner loop's procedure over multiple steps. Such loop unrolling is very limited and constrained to very few steps. Other papers have found solutions around unrolling in very specific, individual problems. This paper proposes a unified framework for implicit differentiation of inner optimization procedures without unrolling and provides implementations that integrate seamlessly into JAX.
OUTLINE:
0:00 - Intro & Overview
2:05 - Automatic Differentiation of Inner Optimizations
4:30 - Example: Meta-Learning
7:45 - Unrolling Optimization
13:00 - Unified Framework Overview & Pseudocode
21:10 - Implicit Function Theorem
25:45 - More Technicalities
28:45 - Experiments
ERRATA:
- Dataset Distillation is done with respect to the training set, not the validation or test set.
Paper: https://arxiv.org/abs/2105.15183
Code coming soon
Abstract:
Automatic differentiation (autodiff) has revolutionized machine learning. It allows expressing complex computations by composing elementary ones in creative ways and removes the burden of computing their derivatives by hand. More recently, differentiation of optimization problem solutions has attracted widespread attention with applications such as optimization as a layer, and in bi-level problems such as hyper-parameter optimization and meta-learning. However, the formulas for these derivatives often involve case-by-case tedious mathematical derivations. In this paper, we propose a unified, efficient and modular approach for implicit differentiation of optimization problems. In our approach, the user defines (in Python in the case of our implementation) a function F capturing the optimality conditions of the problem to be differentiated. Once this is done, we leverage autodiff of F and implicit differentiation to automatically differentiate the optimization problem. Our approach thus combines the benefits of implicit differentiation and autodiff. It is efficient as it can be added on top of any state-of-the-art solver and modular as the optimality condition specification is decoupled from the implicit differentiation mechanism. We show that seemingly simple principles allow to recover many recently proposed implicit differentiation methods and create new ones easily. We demonstrate the ease of formulating and solving bi-level optimization problems using our framework. We also showcase an application to the sensitivity analysis of molecular dynamics.
Authors: Mathieu Blondel, Quentin Berthet, Marco Cuturi, Roy Frostig, Stephan Hoyer, Felipe Llinares-López, Fabian Pedregosa, Jean-Philippe Vert
Links:
TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://discord.gg/4H8xxDF
BitChute: https://www.bitchute.com/channel/yann...
Minds: https://www.minds.com/ykilcher
Parler: https://parler.com/profile/YannicKilcher
LinkedIn: https://www.linkedin.com/in/yannic-ki...
BiliBili: https://space.bilibili.com/1824646584
If you want to support me, the best thing to do is to share out the content :)
If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannick...
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n
-
1:01:07
Calculus Lectures
4 years agoMath4A Lecture Overview MAlbert CH3 | 7 Implicit Differentiation
92 -
10:11
Math.Promo
3 years agoCalculus I Nibble: You Have Already Been Using Implicit Differentiation and the Chain Rule
11 -
49:34
PMG
13 hours ago"Hannah Faulkner and Ron Berutti | NEW SUPREME COURT CASES"
476 -
28:29
The Boomer Effect
14 hours agoPondering Life in Modern America
20 -
1:01:44
Grant Stinchfield
1 hour ago $2.09 earnedDon't Focus on the Drones... Focus on the Mysterious Floating Orbs!
13.3K1 -
LIVE
The Dana Show with Dana Loesch
1 hour agoSPENDING BATTLE COUNTDOWN | The Dana Show LIVE On Rumble!
566 watching -
59:38
The Dan Bongino Show
4 hours agoThings Have Changed, It's Trump's GOP Now (Ep. 2390) - 12/18/2024
559K1.11K -
LIVE
Viss
2 hours ago🔴LIVE - Is Delta Force the Best Casual Extraction Shooter? - Delta Force
241 watching -
53:22
The Rubin Report
3 hours agoElon Musk's Major Announcement Reveals His Next Target & It's Huge
44.2K32 -
2:15:29
Steven Crowder
5 hours agoThe True Cost of Fat Pride: How It's Destroying America
277K151