-
Chain-of-Thought Reasoning Without Prompting
Paper • 2402.10200 • Published • 110 -
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper • 2402.03620 • Published • 118 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper • 2404.03715 • Published • 62 -
Do language models plan ahead for future tokens?
Paper • 2404.00859 • Published • 3
Thomas Renkert
trenkert
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Watch and Listen: Understanding Audio-Visual-Speech Moments with
Multimodal LLM
liked
a dataset
about 2 months ago
StudyPal/education
upvoted
a
paper
about 2 months ago
Kwai Keye-VL Technical Report