Why Mamba 2/3, not Transformers.
Linear inference scaling and constant-memory decoding fit Tamil's agglutinative morphology and long classical document lengths. Transformer scaling doesn't.
The Tamil substrate. A bilingual foundation model on Mamba 2/3 - built so that everything else Tamil at Murai Labs has a real model to stand on.
Tamil has 80M+ speakers worldwide and a literary tradition going back 2,000 years. It also has no foundation model of its own.
Every "Tamil AI" today is an English-thinking model in Tamil clothing - bolted on, lossy, and culturally tone-deaf. TamilLM is the substrate I'm building so that doesn't have to keep being true.
Civilizational memory deserves better infrastructure than a translation API.
Linear inference scaling and constant-memory decoding fit Tamil's agglutinative morphology and long classical document lengths. Transformer scaling doesn't.
A 32K tokenizer designed for Tamil orthography - classical, devotional, literary, formal, colloquial, Tanglish, English. Existing English-biased BPE breaks Tamil word formation.
A Tamil model behind an API isn't cultural infrastructure. Quantized for Jetson Orin and Thor, it can live in homes, schools, and institutions - locally, offline, with no per-token cost.