Dia de las Secuelas (StarCoder, The Stack, Dune, SemiAnalysis)
HuggingFace and BigCode release StarCoder2-15B, a code model trained on over 600 programming languages using The Stack v2 dataset, with opt-out requests excluded from training data.
HuggingFace/BigCode has released StarCoder v2, including the StarCoder2-15B model trained on over 600 programming languages using the The Stack v2 dataset. This release marks a state-of-the-art achievement for models of this size, with opt-out requests excluded from training data.
A detailed technical report is available, highlighting the model's capabilities and training methodology. Additionally, a live event featuring Dylan Patel discussing GPU economics is announced for San Francisco.