phi-4 reasoning models #2047

ysjprojects · 2025-05-19T02:17:12Z

Models added:
Phi-4-mini-reasoning | 3.8B
Phi-4-reasoning | 14B
Phi-4-reasoning-plus | 14B

Relevant links:
https://arxiv.org/abs/2504.21233 (mini-reasoning)
https://arxiv.org/abs/2504.21318 (reasoning)
https://azure.microsoft.com/en-us/blog/one-year-of-phi-small-language-models-making-big-leaps-in-ai/

Highlight:

Despite their significantly smaller size, both models achieve better performance than OpenAI o1-mini and DeepSeek-R1-Distill-Llama-70B at most benchmarks, including mathematical reasoning and Ph.D. level science questions. They achieve performance better than the full DeepSeek-R1 model (with 671-billion parameters) on the AIME 2025 test, the 2025 qualifier for the USA Math Olympiad.

for more information, see https://pre-commit.ci

…phi-4-reasoning divergent branch

for more information, see https://pre-commit.ci

…phi-4-reasoning necessary

phi-4 reasoning models

3a71216

ysjprojects requested review from williamFalcon, lantiga, t-vi and Borda as code owners May 19, 2025 02:17

pre-commit-ci bot and others added 9 commits May 19, 2025 02:17

[pre-commit.ci] auto fixes from pre-commit.com hooks

2159350

for more information, see https://pre-commit.ci

fixes

3bee94a

Merge branch 'phi-4-reasoning' of github.com:ysjprojects/litgpt into …

8e383f9

…phi-4-reasoning divergent branch

optional sys prompt for phi-4 and phi-4-mini

ddee100

[pre-commit.ci] auto fixes from pre-commit.com hooks

6ab1c2e

for more information, see https://pre-commit.ci

minor fix

b44a159

Merge branch 'phi-4-reasoning' of github.com:ysjprojects/litgpt into …

2b24bad

…phi-4-reasoning necessary

Merge branch 'main' into phi-4-reasoning

d13b81b

Merge branch 'main' into phi-4-reasoning

505467d

Borda approved these changes May 22, 2025

View reviewed changes

Borda enabled auto-merge (squash) May 22, 2025 12:11

Merge branch 'main' into phi-4-reasoning

63673b2

KaelanDt approved these changes May 23, 2025

View reviewed changes

Borda disabled auto-merge May 27, 2025 10:49

Borda merged commit 241bbd6 into Lightning-AI:main May 27, 2025
24 checks passed

Borda mentioned this pull request Jun 23, 2025

bump hf transformer version compatibility #1913

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

phi-4 reasoning models #2047

phi-4 reasoning models #2047

Uh oh!

ysjprojects commented May 19, 2025

Uh oh!

Uh oh!

Uh oh!

phi-4 reasoning models #2047

phi-4 reasoning models #2047

Uh oh!

Conversation

ysjprojects commented May 19, 2025

Uh oh!

Uh oh!

Uh oh!