deepseek-v3:671b-q4_K_M

1.4M 4 months ago

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

671b

4 months ago

5da0e2d4a9e0 · 404GB

deepseek2
·
671B
·
Q4_K_M
{{- range $i, $_ := .Messages }} {{- if eq .Role "user" }}<|User|> {{- else if eq .Role "assista
DEEPSEEK LICENSE AGREEMENT Version 1.0, 23 October 2023 Copyright (c) 2023 DeepSeek Section I: PR
{ "stop": [ "<|begin▁of▁sentence|>", "<|end▁of▁sentence|>",

Readme

Note: this model requires Ollama 0.5.5 or later.

DeepSeek-V3 achieves a significant breakthrough in inference speed over previous models. It tops the leaderboard among open-source models and rivals the most advanced closed-source models globally.

References

GitHub

Paper