Kashif Rasul's picture

Kashif Rasul

kashif

·

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

new activity 3 days ago

nanotron/ultrascale-playbook:typos

upvoted an article 5 days ago

Kimina-Prover-RL

upvoted an article 12 days ago

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

View all activity

Organizations

New activity in nanotron/ultrascale-playbook 3 days ago

typos

#119 opened 3 days ago by

upvoted an article 5 days ago

Article

Kimina-Prover-RL

By

and 18 others •

10 days ago

• 9

upvoted an article 12 days ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

By

and 4 others •

13 days ago

• 66

upvoted a paper 13 days ago

AlphaPO -- Reward shape matters for LLM alignment

Paper • 2501.03884 • Published Jan 7 • 2

upvoted an article 16 days ago

Article

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

By

and 4 others •

17 days ago

• 51

upvoted an article 17 days ago

Article

Vision Language Model Alignment in TRL ⚡️

By

and 4 others •

18 days ago

• 71

published an article 18 days ago

Article

Vision Language Model Alignment in TRL ⚡️

By

and 4 others •

18 days ago

• 71

upvoted an article 24 days ago

Article

Introducing Command A Vision: Multimodal AI built for Business

By

and 3 others •

24 days ago

• 63

upvoted a paper 25 days ago

EDGE-GRPO: Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity

Paper • 2507.21848 • Published 26 days ago • 7

upvoted a paper 28 days ago

GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface

Paper • 2507.18546 • Published about 1 month ago • 18

published a model about 1 month ago

kashif/uld-llama-from-qwen

Updated about 1 month ago

liked 2 models about 1 month ago

autogluon/mitra-classifier

Tabular Classification • Updated 24 days ago • 44.1k • 24

autogluon/mitra-regressor

Tabular Regression • Updated 24 days ago • 1.15M • 13

upvoted a collection about 1 month ago

ULD Loss (Universal LLMs Distillation)

The ULD loss, based on optimal transport, enables distillation across different LLM families without requiring shared tokenizers. • 14 items • Updated Jul 15 • 2

upvoted a paper about 1 month ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 87

updated a dataset about 1 month ago

kashif/record-test

Viewer • Updated Jul 13 • 1.79k • 41

published a dataset about 1 month ago

kashif/record-test

Viewer • Updated Jul 13 • 1.79k • 41

liked a dataset about 1 month ago

HuggingFaceTB/smoltalk2

Viewer • Updated Jul 11 • 8.61M • 41k • 93

upvoted a collection about 2 months ago

ThinkPRM

Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 • 8 items • Updated 26 days ago • 3

upvoted an article about 2 months ago

Article

Three Mighty Alerts Supporting Hugging Face’s Production Infrastructure

By

•

Jul 8

• 10