Kevin King PRO
NeoCodes-dev
AI & ML interests
Deep RL, RL for LLMs
Recent Activity
updated
a collection
about 7 hours ago
Research Papers
updated
a collection
about 7 hours ago
Research Papers
upvoted
a
collection
about 7 hours ago
Multimodal LLM
Organizations
Collections
18
models
20
NeoCodes-dev/Qwen2-0.5B-GRPO-test
Updated
NeoCodes-dev/SmolLM_135M_GRPO
Text Generation
•
Updated
NeoCodes-dev/Qwen2_7B-GRPO-test
Updated
NeoCodes-dev/Qwen2.5_3B-GRPO-test
Updated
NeoCodes-dev/codeparrot-ds
Updated
•
1
NeoCodes-dev/gemma-2-2B-it-thinking-function_calling-V0
Updated
NeoCodes-dev/Unit8_part1_V1
Reinforcement Learning
•
Updated
NeoCodes-dev/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
NeoCodes-dev/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
6
NeoCodes-dev/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
datasets
0
None public yet