Mochi 1 Preview - AI Vision Models Tool
Overview
Mochi 1 Preview is an open text-to-video generation model from Genmo. It uses a 10 billion parameter diffusion model with an Asymmetric Diffusion Transformer to produce high-fidelity videos from text prompts and is released under the Apache 2.0 license.
Key Features
- 10 billion parameter diffusion model for text-to-video generation.
- Asymmetric Diffusion Transformer architecture for high-fidelity video synthesis.
- Open-source release under the Apache 2.0 license.
- Hosted on Hugging Face for model access and downloads.
Ideal Use Cases
- Generate short videos from text prompts for creative prototyping.
- Experiment with diffusion-based video synthesis research.
- Customize base model for experimental video pipelines.
- Distribute and modify model artifacts under Apache 2.0.
Getting Started
- Open the model page on Hugging Face: https://huggingface.co/genmo/mochi-1-preview.
- Read the model card and Apache 2.0 license details.
- Follow provided examples to load the model into your environment.
- Provide text prompts and run inference to generate videos.
Pricing
Pricing not disclosed.
Limitations
- Preview release — features, performance, or API may change.
Key Information
- Category: Vision Models
- Type: AI Vision Models Tool