EN
deepseek-ai/Engram
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
Trend
3
4.3k 311 +4/wk
GitHub
DeepSeek-V3, DeepSeek-Coder, and open-weight LLMs
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
DeepSeek LLM: Let there be answers
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Janus-Series: Unified Multimodal Understanding and Generation Models