Dataset Library

Reasoning traces for distilling frontier models

Curated datasets built by querying Claude, GPT, Gemini and other frontier models with diverse coding, math, and reasoning prompts. Designed for training small open models that still think clearly.

What's included

Each dataset includes detailed reasoning traces, carefully filtered conversations, and metadata ready for fine-tuning. Listings are synced hourly from Hugging Face.

Source:
41 datasets

claude-4.5-opus-high-reasoning-250x

Distilled from Claude Opus 4.5

SIZE<1KJSONTEXT
5.6K downloads292 likes

gemini-3-pro-preview-high-reasoning-250x

Distilled from Gemini 3 Pro

SIZE<1KJSONTEXT
1.1K downloads33 likes

gpt-5.2-high-reasoning-250x

SIZE<1KJSONTEXT
1.0K downloads24 likes

gemini-3-pro-preview-high-reasoning-1000x

Distilled from Gemini 3 Pro

SIZE1K–10KJSONTEXT
698 downloads72 likes

claude-sonnet-4.5-high-reasoning-250x

Distilled from Claude Sonnet 4.5

SIZE<1KJSONTEXT
441 downloads34 likes

Pony-Alpha-15k

SIZE10K–100KJSONTEXT
413 downloads54 likes

convo-v1

SIZE<1KJSONTEXT
402 downloads8 likes

Step-3.5-Flash-2600x

SIZE1K–10KJSONTEXT
391 downloads16 likes

gpt-5.1-codex-max-1000x

Distilled from GPT-5.1

SIZE1K–10KJSONTEXT
377 downloads22 likes

claude-haiku-4.5-high-reasoning-1700x

SIZE1K–10KJSONTEXT
297 downloads5 likes

deepseek-v3.2-speciale-OpenCodeReasoning-3k

Distilled from DeepSeek v3.2 Speciale

SIZE1K–10KJSONTEXT
283 downloads8 likes

MiniMax-M2.1-Code-SFT

SIZE1K–10KJSONTEXT
271 downloads13 likes

deepseek-v3.2-speciale-openr1-math-3k

Distilled from DeepSeek v3.2 Speciale

SIZE1K–10KJSONTEXT
249 downloads6 likes

gemini-3-flash-preview

SIZE10K–100KJSONTEXT
245 downloads11 likes

deepseek-v3.2-speciale-1000x

Distilled from DeepSeek v3.2 Speciale

SIZE<1KJSONTEXT
235 downloads9 likes

glm-4.7-2000x

SIZE1K–10KJSONTEXT
222 downloads89 likes

claude-haiku-4.5-1700x

SIZE1K–10KJSONTEXT
197 downloads4 likes

MiniMax-M2.1-8800x

SIZE1K–10KJSONTEXT
190 downloads15 likes

gemini-2.5-flash-11000x

Distilled from Gemini 2.5 Flash

SIZE10K–100KTEXT
185 downloads5 likes

gpt-5.1-high-reasoning-1000x

Distilled from GPT-5.1

SIZE1K–10KJSONTEXT
180 downloads26 likes

kimi-k2-thinking-1000x

Distilled from Kimi K2

SIZE<1KJSONTEXT
176 downloads9 likes

sherlock-thinking-alpha-11000x

SIZE10K–100KJSONTEXT
157 downloads4 likes

minimax-m2.1-1000x

SIZE<1KJSONTEXT
148 downloads1 likes

gpt-5-codex-1000x

Distilled from GPT-5 Codex

SIZE<1KJSONTEXT
118 downloads5 likes

glm-4.7-350x

SIZE<1KJSONTEXT
117 downloads4 likes

gemini-3-flash-preview-1000x

SIZE1K–10KJSONTEXT
105 downloads4 likes

Gemini-3-Flash-Preview-VIBE

SIZE<1KJSONTEXT
104 downloads4 likes

gpt-5-codex-250x

Distilled from GPT-5 Codex

SIZE<1KJSONTEXT
97 downloads13 likes

grok-code-fast-1-1000x

Distilled from Grok

SIZE1K–10KJSONTEXT
92 downloads6 likes

brainstorm-v3.1-grok-4-fast-200x

Distilled from Grok

SIZE<1KJSONTEXT
92 downloads3 likes

mistral-small-creative-500x

Distilled from Mistral

SIZE<1KJSONTEXT
85 downloads2 likes

Aurora-Alpha-15.5k

SIZE10K–100KJSONTEXT
72 downloads5 likes

gemini-2.5-flash-lite-2509-preview-1000x

Distilled from Gemini 2.5 Flash

SIZE<1KJSONTEXT
60 downloads1 likes

polaris-alpha-1000x

SIZE1K–10KJSONTEXT
58 downloads8 likes

MiMo-V2-Flash-2300x

SIZE1K–10KJSONTEXT
57 downloads4 likes

glm-4.6-250x

Distilled from GLM 4.6

SIZE<1KJSONTEXT
49 downloads5 likes

open-moderator-v1

SIZE10K–100KJSONTEXT
47 downloads1 likes

kimi-k2-thinking-250x

Distilled from Kimi K2

SIZE<1KJSONTEXT
46 downloads3 likes

gemini-3-flash-preview-standalone-html-1k

SIZE1K–10KJSONTEXT
42 downloads0 likes

sherlock-think-alpha-1000x

SIZE1K–10KJSONTEXT
37 downloads1 likes

sherlock-dash-alpha-1000x

SIZE1K–10KJSONTEXT
27 downloads0 likes