DeepSeek-R1-Distill-Qwen-1.5B - AI Language Models Tool
Overview
DeepSeek-R1-Distill-Qwen-1.5B is a distilled dense language model derived from Qwen2.5-Math-1.5B using the DeepSeek-R1 pipeline. It targets advanced reasoning, mathematical problem solving, and code generation, with evaluation metrics and deployment instructions published on Hugging Face.
Key Features
- Distilled dense language model from Qwen2.5-Math-1.5B
- Optimized via the DeepSeek-R1 pipeline
- Designed for advanced reasoning tasks
- Targets mathematical problem solving
- Capable of code generation
- Published evaluation metrics on Hugging Face
- Available under an MIT license
Ideal Use Cases
- Research into reasoning-focused language models
- Automating mathematical problem solving workflows
- Generating or assisting with code snippets
- Deploying models with provided Hugging Face instructions
Getting Started
- Open the model page on Hugging Face URL provided
- Review the MIT license and evaluation metrics
- Read the repository deployment instructions
- Follow provided steps to download or deploy the model
Pricing
Pricing not disclosed. Model is released under an MIT license; hosting or compute costs are separate.
Key Information
- Category: Language Models
- Type: AI Language Models Tool