Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10605.0
TFLOPS
12
8
Adam Yanxiao Zhao
sdpkjc
Follow
qgallouedec's profile picture
fredericmenezes's profile picture
2 followers
·
9 following
https://sdpkjc.com
sdpkjc_adam
sdpkjc
yanxiao-zhao
AI & ML interests
Reinforcement Learning
Recent Activity
published
a dataset
2 days ago
sdpkjc/SATQuest-RFT-7k
updated
a dataset
2 days ago
sdpkjc/SATQuest-RFT-3k
published
a dataset
2 days ago
sdpkjc/SATQuest-RFT-3k
View all activity
Organizations
Papers
2
arxiv:
2403.00673
arxiv:
2402.03046
models
98
Sort: Recently updated
sdpkjc/Qwen2.5-1.5B-Instruct-FT-DPO
Text Generation
•
Updated
Jan 22
•
4
sdpkjc/SmolLM2-FT-DPO
Text Generation
•
Updated
Jan 22
sdpkjc/SmolLM2-FT-MyDataset
Text Generation
•
Updated
Jan 21
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning
•
Updated
Jan 20, 2024
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning
•
Updated
Jan 20, 2024
Expand 98 models
datasets
16
Sort: Recently updated
sdpkjc/SATQuest-RFT-7k
Updated
2 days ago
•
2
sdpkjc/SATQuest-RFT-3k
Viewer
•
Updated
2 days ago
•
3k
•
97
sdpkjc/SATQuest-RFT-1k
Viewer
•
Updated
6 days ago
•
1k
•
164
sdpkjc/SATQuest-Max
Viewer
•
Updated
9 days ago
•
14k
•
259
sdpkjc/SATQuest-Tiny
Viewer
•
Updated
9 days ago
•
10
•
95
sdpkjc/SATQuest
Viewer
•
Updated
12 days ago
•
140
•
676
sdpkjc/SATQuest-G
Viewer
•
Updated
Mar 28
•
963
•
75
sdpkjc/NumBase-N01-S2g-B2g
Viewer
•
Updated
Feb 26
•
983k
•
20
sdpkjc/NumBase-N01-S2g-B28
Viewer
•
Updated
Feb 26
•
459k
•
21
sdpkjc/NumBase-N01-S2g-B24
Viewer
•
Updated
Feb 26
•
197k
•
23
Expand 16 datasets