This collections contains a sample dataset and model trained via dolomite-engine. Repo: https://github.com/ibm-granite/dolomite-engine/
Mayank Mishra
mayank-mishra
AI & ML interests
Large Language Models, Distributed Training and Inference
Recent Activity
authored
a paper
3 months ago
FlashFormer: Whole-Model Kernels for Efficient Low-Batch Inference
authored
a paper
3 months ago
PaTH Attention: Position Encoding via Accumulating Householder
Transformations