Rameswar Panda's picture

30 1

Rameswar Panda

rpand002

·

AI & ML interests

None yet

Recent Activity

new activity 23 days ago

interstellarninja/json-mode-verifiable:License

new activity about 1 month ago

interstellarninja/hermes_reasoning_tool_use:License

new activity 3 months ago

ethz-spylab/RealMath:License

View all activity

Organizations

authored a paper about 1 year ago

Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts

Paper • 2406.12034 • Published Jun 17, 2024 • 16

authored 2 papers over 1 year ago

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

Paper • 2405.12981 • Published May 21, 2024 • 34

Data Engineering for Scaling Language Models to 128K Context

Paper • 2402.10171 • Published Feb 15, 2024 • 26