This should be the default illustration for the power of scaling laws in large neural networks. It would also be interesting to illustrate in the same way sudden gains of capabilities at training time. https://t.co/ysTMHtrELk

— Arthur B. 🌮 (@ArthurB) June 23, 2022