Skip to main content

Models & Pricing

The prices listed below are in unites of per 1M tokens. A token, the smallest unit of text that the model recognizes, can be a word, a number, or even a punctuation mark. We will bill based on the total number of input and output tokens by the model.

Pricing Details

MODELDESCRIPTIONCONTEXT LENGTHMAX OUTPUT TOKENSINPUT PRICEOUTPUT PRICE
deepseek-chat (1)Good at general tasks128K4K (8K Beta (2))$0.14 / 1M tokens$0.28 / 1M tokens
deepseek-coder (1)Good at coding and math tasks128K4K (8K Beta (2))$0.14 / 1M tokens$0.28 / 1M tokens
  • (1) The backend model of deepseek-chat and deepseek-coder has been updated to DeepSeek-V2-0628 and DeepSeek-Coder-V2-0724, you can access them without modification to the model name.
  • (2) The 8K output tokens limit of the Chat Completion API is in Beta and requires user to set base_url="https://api.deepseek.com/beta". If the base_url is not set to the Beta url, or max_tokens parameter is not set, the limit is 4K tokens.​

Deduction Rules

The expense = number of tokens × price. The corresponding fees will be directly deducted from your topped-up balance or granted balance, with a preference for using the granted balance first when both balances are available.

Product prices may vary and DeepSeek reserves the right to adjust them. We recommend topping up based on your actual usage and regularly checking this page for the most recent pricing information.