library_name: peft license: afl-3.0 datasets: - andrewt28/keystroke-typing-videos language: - en base_model: - Qwen/Qwen2.5-Omni-3B pipeline_tag: video-text-to-text
Fine-tuned on video and audio of typing to predict the typed text.