L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

L3 Lab
university
AI & ML interests
None defined yet.
Recent Activity
models
5
datasets
6
l3lab/miniCTX-v2
Viewer
•
Updated
•
668
•
203
l3lab/miniCTX
Viewer
•
Updated
•
662
•
463
•
2
l3lab/ntp-mathlib-instruct-context-fullproof
Viewer
•
Updated
•
144k
•
55
•
1
l3lab/ntp-mathlib-instruct-context
Viewer
•
Updated
•
614k
•
77
•
1
l3lab/ntp-mathlib
Viewer
•
Updated
•
213k
•
80
•
2
l3lab/ntp-mathlib-instruct-st
Viewer
•
Updated
•
307k
•
399