2 years agoSparse is Enough in Scaling Transformers (aka Terraformer) | ML Research Paper Explainedykilcher