This Paper Presents a Comprehensive Empirical Analysis of Algorithmic Progress in Language Model Pre-Training from 2012 to 2023
Marktechpost
MARCH 14, 2024
For instance, it was found that the compute required to reach a specific performance threshold has halved approximately every eight months between 2012 and 2023, a rate significantly faster than the improvements anticipated by Moore’s Law. If you like our work, you will love our newsletter.
Let's personalize your content