Premium Only Content
[ML News] GPT-4 solves MIT Exam with 100% ACCURACY | OpenLLaMA 13B released
#gpt4 #mit #ai
A new paper claims to use GPT-4 to solve 100% of a set of MIT university exercises. Some people are skeptic and their investigations reveal more than one problem with this paper...
OUTLINE:
0:00 - ChatGPT gives out Windows 10 keys
0:30 - MIT exam paper
2:50 - Prompt engineering
5:30 - Automatic grading
6:45 - Response by other MIT students
8:30 - Unsolvable questions
10:50 - Duplicates
13:30 - Cascading the heuristics
22:40 - Other problems
29:25 - OpenLLaMA 13B published
References:
https://twitter.com/immasiddtweets/status/1669721470006857729/photo/1
https://arxiv.org/abs/2306.08997
https://arxiv.org/pdf/2306.08997.pdf
https://flower-nutria-41d.notion.site/No-GPT4-can-t-ace-MIT-b27e6796ab5a48368127a98216c76864
https://github.com/idrori/MITQ/commit/3feee1026318e537c0ad27968001ef76e4a36890
https://twitter.com/hardmaru/status/1670246674760077312
https://twitter.com/giffmana/status/1670258748286472193
https://twitter.com/T3816440886465/status/1670127224131862531
https://twitter.com/qrdl/status/1669856336652414977
https://www.chegg.com/homework-help/questions-and-answers/consider-mdp-set-possible-states-mathcal-s-0-1-2-3-set-possible-actions-mathcal-b-c--rewar-q111042613
https://github.com/openlm-research/open_llama
https://huggingface.co/openlm-research/open_llama_13b
Links:
Homepage: https://ykilcher.com
Merch: https://ykilcher.com/merch
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://ykilcher.com/discord
LinkedIn: https://www.linkedin.com/in/ykilcher
If you want to support me, the best thing to do is to share out the content :)
If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannickilcher
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n
-
4:20:21
Nerdrotic
8 hours ago $33.99 earnedDaradevil Born Again, Comics Industry CRASH, Neu-Hollywood REBUILD | Friday Night Tights #337
139K20 -
1:32:34
Glenn Greenwald
4 hours agoThe Future of Gaza With Abubaker Abed; Journalist Sam Husseini On His Physical Expulsion From Blinken’s Briefing & Biden’s Gaza Legacy | System Update #391
60.8K53 -
1:34:48
Roseanne Barr
7 hours ago $11.60 earnedWe are so F*cking Punk Rock! with Drea de Matteo | The Roseanne Barr Podcast #83
51.8K35 -
Man in America
8 hours ago🇨🇳 RedNote: A CCP Trojan Horse Deceiving Americans? w/ Levi Browde
15.8K13 -
LIVE
I_Came_With_Fire_Podcast
11 hours agoTrump SABOTAGE, LA FIRE CHIEF SUED, and BIDEN’S LAST F-U!
275 watching -
LIVE
Joker Effect
2 hours agoUkraine in a video game? Hardest thing I have done. S.T.A.L.K.E.R.2 Heart of Chornobyl,
878 watching -
1:15:22
Flyover Conservatives
22 hours agoEczema, Brain Fog, B.O., and Gas… Eating Steak and Butter Creates Ultimate Health Hack - Bella, Steak and Butter Gal | FOC Show
27.2K -
51:58
PMG
6 hours ago $0.94 earned"Can the Government Learn from Elon Musk’s 70% Labor Cut? A Deep Dive into Inefficient Agencies"
18.6K -
LIVE
Amish Zaku
6 hours agoRumble Spartans #10 - New Year New Maps
197 watching -
1:04:58
In The Litter Box w/ Jewels & Catturd
1 day agoNo Tax On Tips! | In the Litter Box w/ Jewels & Catturd – Ep. 722 – 1/17/2025
141K32