Home › Language Models › DeepSeek-R1-Distill-Qwen-1.5B

DeepSeek-R1-Distill-Qwen-1.5B - AI Language Models Tool

Overview

DeepSeek-R1-Distill-Qwen-1.5B is a distilled dense language model derived from Qwen2.5-Math-1.5B using the DeepSeek-R1 pipeline. It targets advanced reasoning, mathematical problem solving, and code generation, with evaluation metrics and deployment instructions published on Hugging Face.

Key Features

Distilled dense language model from Qwen2.5-Math-1.5B
Optimized via the DeepSeek-R1 pipeline
Designed for advanced reasoning tasks
Targets mathematical problem solving
Capable of code generation
Published evaluation metrics on Hugging Face
Available under an MIT license

Ideal Use Cases

Research into reasoning-focused language models
Automating mathematical problem solving workflows
Generating or assisting with code snippets
Deploying models with provided Hugging Face instructions

Getting Started

Open the model page on Hugging Face URL provided
Review the MIT license and evaluation metrics
Read the repository deployment instructions
Follow provided steps to download or deploy the model

Pricing

Pricing not disclosed. Model is released under an MIT license; hosting or compute costs are separate.

Key Information

Category: Language Models
Type: AI Language Models Tool

Visit Official Website

DeepSeek-R1-Distill-Qwen-1.5B - AI Language Models Tool

Overview

Key Features

Ideal Use Cases

Getting Started

Pricing

Key Information

Related Tools

Qwen2.5-7B

DeepSeek-V3

Llama 3

UNfilteredAI-1B

FLUX1.1 [pro]

Shuttle-3