StarCoder2
BigCode / Hugging Face · February 2024
● activeOpen Sourcedecoder onlycode
Parameters3B - 15B
Context Window16K tokens
Variants3B, 7B, 15B
Description
The successor to StarCoder, trained on The Stack v2 — an even larger and more diverse code dataset spanning over 600 programming languages. Available in 3B, 7B, and 15B sizes, it improved on its predecessor across coding benchmarks while maintaining the same commitment to transparent, ethically sourced training data.
Key Innovations
Code Gen
Code GenAbility to write, debug, and understand programming code across multiple languages.
Open Weight
Open WeightModel weights are publicly released but training data/code may not be. Enables fine-tuning but not full reproduction.
External Links
More from Community / Uncensored
WizardLM2023-06 · 13B
Dolphin (Eric Hartford)2023-07 · —
Hermes (Nous Research)2023-09 · —
LLaMA 4 Scout Abliterated2025-05 · 81B (17B active × 16 experts)
DeepSeek R1 Uncensored2025-03 · 671B (37B active)
Bark2023-04 · —
Mistral 7B Uncensored2024-01 · 7B
Qwen 2.5 72B Abliterated2025-01 · 72B
SOLAR 10.7B Uncensored2024-02 · 10.7B
LLaMA 3.1 405B Abliterated2024-08 · 405B