Premium Only Content

Improving Intrinsic Exploration with Language Abstractions (Machine Learning Paper Explained)
#reinforcementlearning #ai #explained
Exploration is one of the oldest challenges for Reinforcement Learning algorithms, with no clear solution to date. Especially in environments with sparse rewards, agents face significant challenges in deciding which parts of the environment to explore further. Providing intrinsic motivation in form of a pseudo-reward is sometimes used to overcome this challenge, but often relies on hand-crafted heuristics, and can lead to deceptive dead-ends. This paper proposes to use language descriptions of encountered states as a method of assessing novelty. In two procedurally generated environments, they demonstrate the usefulness of language, which is in itself highly concise and abstractive, which lends itself well for this task.
OUTLINE:
0:00 - Intro
1:10 - Paper Overview: Language for exploration
5:40 - The MiniGrid & MiniHack environments
7:00 - Annotating states with language
9:05 - Baseline algorithm: AMIGo
12:20 - Adding language to AMIGo
22:55 - Baseline algorithm: NovelD and Random Network Distillation
29:45 - Adding language to NovelD
31:50 - Aren't we just using extra data?
34:55 - Investigating the experimental results
40:45 - Final comments
Paper: https://arxiv.org/abs/2202.08938
Abstract:
Reinforcement learning (RL) agents are particularly hard to train when rewards are sparse. One common solution is to use intrinsic rewards to encourage agents to explore their environment. However, recent intrinsic exploration methods often use state-based novelty measures which reward low-level exploration and may not scale to domains requiring more abstract skills. Instead, we explore natural language as a general medium for highlighting relevant abstractions in an environment. Unlike previous work, we evaluate whether language can improve over existing exploration methods by directly extending (and comparing to) competitive intrinsic exploration baselines: AMIGo (Campero et al., 2021) and NovelD (Zhang et al., 2021). These language-based variants outperform their non-linguistic forms by 45-85% across 13 challenging tasks from the MiniGrid and MiniHack environment suites.
Authors: Jesse Mu, Victor Zhong, Roberta Raileanu, Minqi Jiang, Noah Goodman, Tim Rocktäschel, Edward Grefenstette
Links:
TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://discord.gg/4H8xxDF
BitChute: https://www.bitchute.com/channel/yann...
LinkedIn: https://www.linkedin.com/in/ykilcher
BiliBili: https://space.bilibili.com/2017636191
If you want to support me, the best thing to do is to share out the content :)
If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannick...
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n
-
2:04:30
TimcastIRL
10 hours agoEpstein Files Release IMMINENT, Trump AG Says The List Is ON HER DESK w/ Will Chamberlain
157K205 -
1:25:35
Roseanne Barr
15 hours ago $48.63 earnedKash Me Outside, Pedos | The Roseanne Barr Podcast #88
91.7K85 -
1:21:55
Kim Iversen
11 hours agoMultiple States To BAN mRNA Vaccines | They Want to Make Protesting Illegal, Here's How
67.5K96 -
7:34:25
Dr Disrespect
18 hours ago🔴LIVE - DR DISRESPECT - WARZONE - IMPOSSIBLE TRIPLE THREAT CHALLENGE
200K30 -
1:02:45
Tundra Tactical
8 hours ago $22.48 earned🛑 KASH PATEL NEW ATF DIRECTOR??? Breaking News!!!! 🛑
64.3K9 -
4:31:10
I_Came_With_Fire_Podcast
18 hours agoMy EURO Divorce | HOGG with a side of PAC | Foreign FUNDS Fudged
37.9K2 -
37:44
Glenn Greenwald
15 hours agoGlenn On Tearing Down the Military Industrial Complex, Exposing Pro-Israel Indoctrination, and More | SYSTEM UPDATE #411
115K129 -
4:04:20
Nerdrotic
14 hours ago $51.95 earnedAmazon Takes 007! Hollywood is Lost, Disney Cancels WHO? | Friday Night Tights 342 /w ItsAGundam
175K45 -
43:27
Tucker Carlson
14 hours agoRay Dalio: America’s Hidden Civil War, and the Race to Beat China in Tech, Economics, and Academia
169K190 -
56:56
Candace Show Podcast
14 hours agoEXCLUSIVE: Taylor Swift Will Be Deposed. | Candace Ep 150
206K156