ldwang
ldwang
AI & ML interests
LLM, MLLM, Infra
Recent Activity
new activity
about 9 hours ago
junnyu/DeepScaleR-1.5B-Preview-Reproduce:entory_loss
upvoted
a
paper
about 18 hours ago
Learning to Reason under Off-Policy Guidance
new activity
3 days ago
junnyu/DeepScaleR-1.5B-Preview-Reproduce:question about training updates