3 years ago[ML News] Microsoft trains 530B model | ConvMixer model fits into single tweet | DeepMind profitableykilcher