Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mayank Mishra's picture
70 46 8

Mayank Mishra

mayank-mishra
larryvrh's profile picture rahulmisal's profile picture Flyxion's profile picture
·
https://mayank31398.github.io/
  • mayankmish98
  • mayank31398
  • mayank31398

AI & ML interests

Large Language Models, Distributed Training and Inference

Recent Activity

upvoted a paper 13 days ago
Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
authored a paper 3 months ago
FlashFormer: Whole-Model Kernels for Efficient Low-Batch Inference
authored a paper 3 months ago
PaTH Attention: Position Encoding via Accumulating Householder Transformations
View all activity

Organizations

IBM's profile picture BigCode's profile picture Aurora-M/MDEL's profile picture Blog-explorers's profile picture Aurora-M's profile picture IBM Granite's profile picture IBM Research's profile picture

mayank-mishra 's collections 1

Dolomite Engine Sample
This collections contains a sample dataset and model trained via dolomite-engine. Repo: https://github.com/ibm-granite/dolomite-engine/
  • mayank-mishra/glaive-code-assisstant-v3-20k

    Viewer • Updated Jun 5, 2024 • 20k • 7
  • mayank-mishra/granite-3b-code-glaive-20k

    Text Generation • 3B • Updated Jun 5, 2024 • 3
Dolomite Engine Sample
This collections contains a sample dataset and model trained via dolomite-engine. Repo: https://github.com/ibm-granite/dolomite-engine/
  • mayank-mishra/glaive-code-assisstant-v3-20k

    Viewer • Updated Jun 5, 2024 • 20k • 7
  • mayank-mishra/granite-3b-code-glaive-20k

    Text Generation • 3B • Updated Jun 5, 2024 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略