Skip to main content

📊 Open-weight releases

RankModelPrimary focusLinkdownloads*Maintainer(s)
1IndicTrans2 (family)**Multilingual translation stack covering 22 Indic languageshttps://huggingface.co/AI4Bharat/indictrans2-en-indic-1B110,328AI4Bharat
2sarvamai/sarvam-translateGemma-3 based multimodal translation & assistant for Indic workflowshttps://huggingface.co/sarvamai/sarvam-translate25,127Sarvam AI
3sarvamai/sarvam-m24B chat-tuned LLM spanning 11 Indian languageshttps://huggingface.co/sarvamai/sarvam-m2,790Sarvam AI
4sarvamai/sarvam-1Llama-2 derived multilingual chat model (bn/en/gu/hi/kn/ml/mr/or/pa/ta/te)https://huggingface.co/sarvamai/sarvam-12,504Sarvam AI
5sarvamai/OpenHathi-7B-Hi-v0.1-BaseHindi-first 7B base model for Indic workloadshttps://huggingface.co/sarvamai/OpenHathi-7B-Hi-v0.1-Base1,664Sarvam AI • AI4Bharat • Bhashini
6sarvamai/sarvam-0.52B instruction-tuned model optimized for code-mixed Indic chathttps://huggingface.co/sarvamai/sarvam-0.51,293Sarvam AI
7aashay96/indic-gptGPT-2 scale baseline for Indic text generation experimentshttps://huggingface.co/aashay96/indic-gpt175Community (aashay96)
Aggregated across ai4bharat/indictrans2-en-indic-1B (55,506), ai4bharat/indictrans2-indic-en-1B (17,999), ai4bharat/indictrans2-indic-indic-1B (13,828), ai4bharat/indictrans2-indic-indic-dist-320M (8,304), ai4bharat/indictrans2-indic-en-dist-200M (7,623), ai4bharat/indictrans2-en-indic-dist-200M (7,068).

🚧 Closed / sovereign programs without public checkpoints

  • Hanooman — SML India, Seetha.ai, IIT Bombay: Multilingual suite (22 Indic + 10 global languages); enterprise deployments, no Hugging Face release.
  • Bhashini / AIRAWAT LLM — MeitY, C-DAC: National language mission building sovereign inference infrastructure; models gated within the NLTM stack.
  • Krutrim — Ola / Krutrim SI Designs: Full-stack AI platform with proprietary pretraining on Indian corpora; checkpoints not open-sourced.
  • Dhruva — NVIDIA, C-DAC, MeitY: AIRAWAT-backed compute cluster enabling large-scale sovereign model training rather than a single public LLM.
  • Kosmos India (AIRAWAT) — NIC, C-DAC Pune: Multimodal research initiative (text + vision) currently limited to internal partners.
  • BharatGPT — Reliance Jio, IIT Bombay: In-development enterprise LLM suite; benchmarks and weights remain undisclosed.

🧠 Govt & Research Infrastructure Enablers