-
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 258 -
Magicoder: Source Code Is All You Need
Paper • 2312.02120 • Published • 82 -
Mixtral of Experts
Paper • 2401.04088 • Published • 160 -
Chain-of-Thought Reasoning Without Prompting
Paper • 2402.10200 • Published • 109
xiepengli
ginobiLi
AI & ML interests
LLM
Recent Activity
liked
a model
3 days ago
meituan/DeepSeek-R1-Channel-INT8
liked
a model
5 days ago
stduhpf/google-gemma-3-27b-it-qat-q4_0-gguf-small
liked
a model
6 days ago
bartowski/google_gemma-3-27b-it-qat-GGUF
Organizations
Collections
1
models
0
None public yet
datasets
0
None public yet