Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
7
4
2
Junxiong Wang
PRO
JunxiongWang
Follow
eliebak's profile picture
FermiQ's profile picture
emircanerol's profile picture
16 followers
·
3 following
https://www.cs.cornell.edu/~junxiong/
jxiw
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
updated
a model
22 days ago
togethercomputer/M1-3B
upvoted
a
paper
about 2 months ago
MambaByte: Token-free Selective State Space Model
upvoted
a
paper
about 2 months ago
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
View all activity
Organizations
JunxiongWang
's models
51
Sort: Recently updated
JunxiongWang/M1-3B
Text Generation
•
3B
•
Updated
Apr 16
•
1.44k
•
1
JunxiongWang/M1-3B-SFT
Text Generation
•
3B
•
Updated
Apr 16
•
6
•
1
JunxiongWang/MambaInLlama1B_SFT_MATH
1B
•
Updated
Feb 11
•
4
JunxiongWang/MambaInLlama3B_SFT_MATH
3B
•
Updated
Feb 7
•
8
JunxiongWang/MambaInLlama3B_DPO2
3B
•
Updated
Feb 5
•
4
JunxiongWang/MambaInLlama3B_DPO1
3B
•
Updated
Feb 5
•
3
JunxiongWang/MambaInLlama3B_Distill_MATH
3B
•
Updated
Jan 27
•
236
JunxiongWang/MambaInLlama3B_v3
3B
•
Updated
Jan 25
•
3
JunxiongWang/MambaInLlama1B_Distill_MATH
1B
•
Updated
Jan 23
•
4
JunxiongWang/mamba_0_5_distill
Updated
Dec 25, 2024
•
4
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
Nov 17, 2024
•
4
JunxiongWang/Llama3.2-Mamba-3B-distill
Updated
Nov 17, 2024
•
9
JunxiongWang/Llama3.2-Mamba2-3B-distill
Updated
Nov 17, 2024
•
1.05k
JunxiongWang/Llama3.2-Mamba2-3B-dpo
Updated
Nov 17, 2024
•
65
JunxiongWang/Llama3.1-Mamba2-8B-dpo
Updated
Nov 17, 2024
•
12
JunxiongWang/Llama3.1-Mamba-8B-dpo
Updated
Nov 17, 2024
•
4
JunxiongWang/Llama3.1-Mamba2-8B-distill
Updated
Nov 17, 2024
•
232
JunxiongWang/Llama3.1-Mamba-8B-distill
Updated
Nov 17, 2024
•
4
JunxiongWang/MambaByte_Stories
Text Generation
•
Updated
Sep 9, 2024
•
5
•
1
JunxiongWang/MambaByte_Arxiv
Text Generation
•
Updated
Sep 9, 2024
•
6
•
3
JunxiongWang/MambaByte_PG19_353M
Text Generation
•
Updated
Sep 9, 2024
•
27
JunxiongWang/MambaByte_Books
Text Generation
•
Updated
Sep 9, 2024
•
6
•
2
JunxiongWang/MambaByte_Code
Text Generation
•
Updated
Sep 9, 2024
•
3
•
2
JunxiongWang/MambaByte_PG19_972M
Text Generation
•
Updated
Sep 9, 2024
•
7
JunxiongWang/Mamba2InLlama_1
Updated
Sep 2, 2024
•
4
•
1
JunxiongWang/Mamba2InLlama_0_50
Updated
Sep 2, 2024
•
8
JunxiongWang/Mamba2InLlama_0_75
Updated
Sep 2, 2024
•
7
JunxiongWang/MambaInLlama_0_50
Updated
Sep 2, 2024
•
8
JunxiongWang/MambaInLlama_0_75
Updated
Sep 2, 2024
•
6
JunxiongWang/MambaInLlama_0_875
Updated
Sep 2, 2024
•
5
Previous
1
2
Next