CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs Paper • 2505.24120 • Published May 30 • 49
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning Paper • 2504.16656 • Published Apr 23 • 58
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published Apr 8 • 86
High-resolution Piano Transcription with Pedals by Regressing Onset and Offset Times Paper • 2010.01815 • Published Oct 5, 2020
Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation Paper • 2503.19611 • Published Mar 25