Yushi Bai's picture

Yushi Bai

bys0318

·

https://bys0318.github.io/

bys0318

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

zai-org/GLM-4.7

upvoted a paper 3 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

upvoted a paper 3 months ago

Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models

View all activity

Organizations

upvoted 4 papers 3 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 67

Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models

Paper • 2510.11683 • Published Oct 13, 2025 • 14

DeepPrune: Parallel Scaling without Inter-trace Redundancy

Paper • 2510.08483 • Published Oct 9, 2025 • 24

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2, 2025 • 52

upvoted 2 collections 3 months ago

GLM-4.6

7 items • Updated Nov 5, 2025 • 51

SIRI

Scaling Iterative Reinforcement Learning with Interleaved Compression • 5 items • Updated Sep 30, 2025 • 3

upvoted a paper 3 months ago

SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression

Paper • 2509.25176 • Published Sep 29, 2025 • 13

upvoted a paper 4 months ago

CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling

Paper • 2509.21114 • Published Sep 25, 2025 • 16

upvoted a paper 5 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 195

upvoted 2 collections 6 months ago

GLM-4.5

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated Aug 11, 2025 • 252

GLM-4.1V-Thinking

5 items • Updated Jul 2, 2025 • 57

upvoted a paper 7 months ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23, 2025 • 56

upvoted 2 collections 7 months ago

OpenSAE-LLaMA-3.1-8B

OpenSAE checkpoints for LLaMA 3.1 8B base model • 38 items • Updated Jan 29, 2025 • 5

VerIF

RL trained models and datasets for instruction-following • 7 items • Updated Jun 12, 2025 • 5

upvoted 4 papers 7 months ago

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

Paper • 2506.09942 • Published Jun 11, 2025 • 5

Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis

Paper • 2506.04142 • Published Jun 4, 2025 • 27

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Paper • 2506.04180 • Published Jun 4, 2025 • 33

Are Reasoning Models More Prone to Hallucination?

Paper • 2505.23646 • Published May 29, 2025 • 24

upvoted 2 papers 8 months ago

Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models

Paper • 2505.20152 • Published May 26, 2025 • 11

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19, 2025 • 83