The Sequence Chat: Hugging Face's Leandro von Werra on StarCoder and Code Generating LLMs
TheSequence
MAY 24, 2023
Since a lot of developers are working on Python we continued to trainStarCoder for about 35B tokens (~3% of full training) on the Python subset which lead to a significant performance boost. data or auto-generated files). cell outputs) for code completion in Jupyter notebooks (see this Jupyter plugin ).
Let's personalize your content