Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
efederici 
posted an update May 10, 2024
Post
2071
Finally, I can post! 🚀

I created a Capybara-inspired Italian dataset by translating the initial instruction and running it through a pipeline to generate conversations. I used Claude Sonnet for translation and instruction generation, and Opus for generating the answers.

I hope this dataset proves useful for people working on 🇮🇹 language models.

⛁ Open sourcing the dataset here: efederici/capybara-claude-15k-ita

Nice one!