Rajdeep Haldar's picture

Rajdeep Haldar

rhaldar97

AI & ML interests

Adversarial Robustness Computer Vision LLM Human Alignment

Recent Activity

submitted a paper about 2 months ago

f-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

liked a dataset 12 months ago

argilla/distilabel-math-preference-dpo

updated a dataset over 1 year ago

rhaldar97/Safety_preference

View all activity

Organizations

None yet

Papers 1

arxiv:2502.00657

models 0

None public yet

datasets 2

rhaldar97/Safety_preference

Viewer • Updated Dec 1, 2024 • 1.26k • 11 • 1

rhaldar97/Safety_Accept_Reject

Viewer • Updated Nov 1, 2024 • 1.26k • 15