-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 81 -
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Paper • 2408.02657 • Published • 36 -
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
Paper • 2508.10711 • Published • 135
Charles Cai
charlescai2016
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 19 hours ago
Intern-S1: A Scientific Multimodal Foundation Model
upvoted
a
paper
about 20 hours ago
MCP-Universe: Benchmarking Large Language Models with Real-World Model
Context Protocol Servers
upvoted
an
article
2 days ago
Model2Vec: Distill a Small Fast Model from any Sentence Transformer