Junxiong Wang PRO
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
updated
a model
22 days ago
togethercomputer/M1-3B
upvoted
a
paper
about 2 months ago
MambaByte: Token-free Selective State Space Model
upvoted
a
paper
about 2 months ago
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models