Maksim Sapronov's picture

6

Maksim Sapronov

sapromak

·

sapromak

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

upvoted a paper 2 months ago

The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management

updated a model 2 months ago

JetBrains-Research/OpenCoder-1.5B-Masked-Leak

View all activity

Organizations

upvoted a paper about 2 months ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published Oct 27 • 20

upvoted 2 papers 2 months ago

The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management

Paper • 2508.21433 • Published Aug 29 • 7

On Pretraining for Project-Level Code Completion

Paper • 2510.13697 • Published Oct 15 • 6

upvoted a collection 2 months ago

📁 Repository-Level Pre-Trained OpenCoder 🧩

All the checkpoints from Table 3 of the paper “On Pretraining for Project-Level Code Completion.” • 33 items • Updated Oct 17 • 3

upvoted a paper 2 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 37

upvoted a collection 7 months ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 88