Input: $1.7999999999999998, Output: $1.7999999999999998

Cost per million tokens

For enterprise pricing and custom weights or models

DeepSeek Chat

DeepSeek V3 represents a major advancement in open-source AI models, offering enhanced capabilities and performance. DeepSeek V3 is an open-source 671B parameter Mixture-of-Experts (MoE) model with 37B activated parameters per token. It features innovative load balancing and multi-token prediction, trained on 14.8T tokens. The model achieves state-of-the-art performance across benchmarks. It incorporates reasoning capabilities distilled from DeepSeek-R1 and supports a 128K context window.

Key Features of DeepSeek Chat

Speed Improvement: DeepSeek V3 processes 60 tokens per second, representing a 3x speed increase over its predecessor
Enhanced Capabilities: The model demonstrates improved overall performance across various tasks
Architecture: Built on a 671B Mixture-of-Experts (MoE) parameter architecture, with 37B activated parameters
Training Scale: Trained on 14.8 trillion high-quality tokens
API Compatibility: Maintains compatibility with previous versions for seamless transition
Open Source: Both the model and associated research papers are freely available to the community.

Popular Models

Story Diffusion Story Diffusion turns your written narratives into stunning image sequences.

Faceswap V2 Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training

SDXL Inpaint This model is capable of generating photo-realistic images given any text input, with the extra capability of inpainting the pictures by using a mask

Faceswap Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training