A simple, consistent and extendable toolkit for IndicTrans2
-
Updated
Jan 16, 2025 - Python
A simple, consistent and extendable toolkit for IndicTrans2
MILU (Multi-task Indic Language Understanding Benchmark) is a comprehensive evaluation dataset designed to assess the performance of LLMs across 11 Indic languages.
Fine-tuned and compared 3 🤗 pre-trained Multilingual LLMs
Setu dashboard is a all-in-one streamlit application that allows users to provide feedback on the outputs of the setu data cleaning pipeline for @AI4Bharat
This repository contains Python implementations for processing multilingual text data, focusing on language classification and translation tasks. The project addresses two distinct tasks: language classification and English translation, each involving different complexities in the processing of text data.
Add a description, image, and links to the ai4bharat topic page so that developers can more easily learn about it.
To associate your repository with the ai4bharat topic, visit your repo's landing page and select "manage topics."