madhavantechie's picture
UPDATE: Complete Tamil tokenizer with 291 characters (was 286) - Full Unicode coverage including all vowels, consonants, uyirmei combinations, Grantha characters, digits, and symbols
1d5977d verified