R-PRM: Reasoning-Driven Process Reward Modeling
Shuaijie She
kevinpro
AI & ML interests
Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization
Recent Activity
authored
a paper
3 days ago
Question Translation Training for Better Multilingual Reasoning
authored
a paper
3 days ago
R-PRM: Reasoning-Driven Process Reward Modeling
authored
a paper
3 days ago
How does Alignment Enhance LLMs' Multilingual Capabilities? A Language
Neurons Perspective