Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
70
46
8
Mayank Mishra
mayank-mishra
Follow
maharshpatelx's profile picture
21world's profile picture
larryvrh's profile picture
64 followers
·
33 following
https://mayank31398.github.io/
mayankmish98
mayank31398
mayank31398
AI & ML interests
Large Language Models, Distributed Training and Inference
Recent Activity
upvoted
a
paper
13 days ago
Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
authored
a paper
3 months ago
FlashFormer: Whole-Model Kernels for Efficient Low-Batch Inference
authored
a paper
3 months ago
PaTH Attention: Position Encoding via Accumulating Householder Transformations
View all activity
Organizations
mayank-mishra
's models
8
Sort: Recently updated
mayank-mishra/granite-3b-code-glaive-20k
Text Generation
•
3B
•
Updated
Jun 5, 2024
•
3
mayank-mishra/granite-20b-code-instruct-Q4_K_M-GGUF
Text Generation
•
20B
•
Updated
May 19, 2024
•
6
mayank-mishra/starcoder-GPTQ-8bit-128g
Updated
May 5, 2023
•
11
mayank-mishra/starcoder-GPTQ-4bit-128g
Updated
May 5, 2023
•
16
mayank-mishra/starcoderbase-GPTQ-4bit-128g
Updated
May 5, 2023
•
21
mayank-mishra/starcoderbase-GPTQ-8bit-128g
Updated
May 5, 2023
•
3
mayank-mishra/santacoder-GPTQ-4bit-128g
Updated
May 4, 2023
•
2
mayank-mishra/santacoder-GPTQ-8bit-128g
Updated
May 4, 2023
•
1