🛕 Tamil Nadu Heritage Knowledge LLM

A domain-specific Language Model dedicated to capturing the rich cultural, architectural, and historical heritage of Tamil Nadu. This LLM is fine-tuned on a custom dataset and designed to answer questions or generate content related to Tamil Nadu's temples, history, monuments, and culture.


🚀 Project Status: Early Development Phase

This project is in its starting phase, and we welcome contributors to expand the dataset, improve the model, and help build a comprehensive open-source heritage model.


📦 Base Model


🗂 Dataset

  • Source: Boobalamurugan/tn-heritage-sites-dataset
  • Size: ~1.93K entries
  • Description: This dataset contains information about heritage sites in Tamil Nadu, including temples, historical monuments, kings, architecture, and festivals.

📢 The dataset is still small. Feel free to contribute and help us grow this valuable resource!


🧠 Model Objective

  • Build a knowledgeable and culturally aware assistant focused on Tamil Nadu heritage.
  • Answer factual questions about heritage sites, kings, festivals, inscriptions, architecture, etc.
  • Generate informative content or summaries for educational or cultural purposes.

✨ Features (Planned)

  • ✅ Q&A on Tamil Nadu temples, kings, and architecture
  • ✅ Context-aware content generation (e.g., temple descriptions, cultural significance)
  • 🔄 Summarization of historical texts (Coming Soon)
  • 🔄 Integration into web or mobile apps (Planned)

🤝 How to Contribute

📁 Dataset Contributions

  • Fork the dataset repository
  • Add more entries (temples, kings, historical facts)
  • Submit a pull request with the added data

🧠 Model Training

  • Fine-tune the model with additional data
  • Improve prompt formatting and pre-processing
  • Evaluate the model’s responses

📝 License

This project is licensed under the Apache 2.0 License — free for commercial and non-commercial use with proper attribution.


🌐 Tags

#tamil-heritage #open-source #language-model #llm #tamil-nadu #text-generation


📬 Stay Connected

For updates and discussions:

  • Follow the project creator: @Boobalamurugan
  • Join our community forum (Coming Soon)
Downloads last month
16
GGUF
Model size
8.03B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

4-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Boobalamurugan/TN_Heritage_LLM

Quantized
(1)
this model

Dataset used to train Boobalamurugan/TN_Heritage_LLM