L1
Collection
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
•
7 items
•
Updated
•
8
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-7BTotally Free + Zero Barriers + No Login Required