CodeLlama

Meta · August 2023

● activeOpen Weightdecoder onlycode

Parameters7B - 70B

Context Window16K tokens

Variants7B, 13B, 34B, 70B, Python, Instruct

Why It Matters

Meta's proof that specialized code models derived from general-purpose LLMs could outperform dedicated coding models.

Description

Meta's code-specialized version of LLaMA 2, further trained on code-heavy datasets. Available in sizes from 7B to 70B parameters, with specialized variants for Python and instruction-following. Proved that taking a strong general-purpose model and continuing to train it on code could outperform models built from scratch for coding.

Key Innovations

Code Gen

Code GenAbility to write, debug, and understand programming code across multiple languages.

Instruction Tuning

Instruction TuningFine-tuning a model on instruction-response pairs so it follows user commands more reliably.

Family Tree

Built On

LLaMA 2

Lineage

LLaMA→LLaMA 2→CodeLlama

Related Research (1)

LLaMA 2Scaling

2023 · Meta AI

Provided the most detailed public documentation of how to train, fine-tune, and safety-align a large language model, including their full RLHF methodo…

External Links

Research Paper

More from Meta LLaMA

LLaMA2023-02 · 7B - 65B

LLaMA 22023-07 · 7B - 70B

LLaMA 32024-04 · 8B / 70B

LLaMA 3.12024-07 · 8B / 70B / 405B

LLaMA 3.22024-09 · 1B / 3B / 11B / 90B

LLaMA 3.32024-12 · 70B

LLaMA 42025-04 · 17B active (Scout) / larger (Maverick)

MusicGen2023-06 · 3.3B

PreviousLLaMA 2

NextLLaMA 3