Multimodal Synthetic Dataset and Multi-Task Reinforcement Learning Document Parser
AI & ML interests
None defined yet.
Recent Activity
Multimodal Synthetic Dataset and Multi-Task Reinforcement Learning Document Parser
LLM-based dense retrieval models for EN & ZH (also effective in other languages)
Reinforcement Learning Document Parser and High-Quality Synthetic Dataset.
OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs.