Dataset Library
Reasoning traces for distilling frontier models
Curated datasets built by querying Claude, GPT, Gemini and other frontier models with diverse coding, math, and reasoning prompts. Designed for training small open models that still think clearly.
What's included
Each dataset includes detailed reasoning traces, carefully filtered conversations, and metadata ready for fine-tuning. Listings are synced hourly from Hugging Face.
claude-4.5-opus-high-reasoning-250x
Distilled from Claude Opus 4.5
gemini-3-pro-preview-high-reasoning-1000x
Distilled from Gemini 3 Pro
gpt-5.1-high-reasoning-1000x
Distilled from GPT-5.1
gpt-5.2-high-reasoning-250x
claude-haiku-4.5-high-reasoning-1700x
gemini-3-flash-preview
claude-sonnet-4.5-high-reasoning-250x
Distilled from Claude Sonnet 4.5
glm-4.7-2000x
deepseek-v3.2-speciale-openr1-math-3k
Distilled from DeepSeek v3.2 Speciale
deepseek-v3.2-speciale-1000x
Distilled from DeepSeek v3.2 Speciale
gpt-5.1-codex-max-1000x
Distilled from GPT-5.1
gpt-5-codex-1000x
Distilled from GPT-5 Codex
deepseek-v3.2-speciale-OpenCodeReasoning-3k
Distilled from DeepSeek v3.2 Speciale
gpt-5-codex-250x
Distilled from GPT-5 Codex
gemini-3-pro-preview-high-reasoning-250x
Distilled from Gemini 3 Pro
MiniMax-M2.1-8800x
MiMo-V2-Flash-2300x
gemini-3-flash-preview-1000x
minimax-m2.1-1000x
grok-code-fast-1-1000x
Distilled from Grok
glm-4.6-250x
Distilled from GLM 4.6
kimi-k2-thinking-1000x
Distilled from Kimi K2
gemini-2.5-flash-11000x
Distilled from Gemini 2.5 Flash
claude-haiku-4.5-1700x
kimi-k2-thinking-250x
Distilled from Kimi K2
polaris-alpha-1000x
glm-4.7-350x
gemini-3-flash-preview-standalone-html-1k
brainstorm-v3.1-grok-4-fast-200x
Distilled from Grok
sherlock-thinking-alpha-11000x
open-moderator-v1
gemini-2.5-flash-lite-2509-preview-1000x
Distilled from Gemini 2.5 Flash