DeepSeek-R1T-Chimera

TNG Logo


Model merge of DeepSeek-R1 and DeepSeek-V3 (0324)

An open weights model combining the intelligence of R1 with the token efficiency of V3.

Announcement on X

Model Details

  • Architecture: DeepSeek-MoE Transformer-based language model
  • Combination Method: Merged model weights from DeepSeek-R1 and DeepSeek-V3 (0324)
  • Release Date: 2025-04-27

Contact

Downloads last month
491
Safetensors
Model size
685B params
Tensor type
F32
·
BF16
·
F8_E4M3
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 23 Ask for provider support

Model tree for tngtech/DeepSeek-R1T-Chimera

Quantized
(50)
this model
Quantizations
1 model

Spaces using tngtech/DeepSeek-R1T-Chimera 2