Skip to content
AI4Bharat
  • Pages
    • AI4Bharat Public
      • Seminars
      • icon picker
        Publications
      • People
      • Models
    • AI4Bharat Admin
      • Members
      • Planning
      • Licensing
      • Meity Timelines
      • Hiring
        • AI4Bharat Summer of Code
    • IndicMining
      • Meeting Minutes
      • NeurIPS dataset paper plan
    • IndicASR
      • RNN-T
      • Multilingual ASR
        • Analysis
      • Adaptation in End-to-End Speech Recognition
      • Data Augmentation
      • Text Normalization for speech
    • Shoonya
      • Documentation - User Manual
        • Welcome Page
        • User-Roles on Shoonya
        • Getting Started with Workflow
          • Manager Workflow
          • Language-Experts Workflow
            • Annotation Workflow
            • Collection Workflow
        • Terminology
        • FAQs and Feedback
      • Management Dashboard
        • Language Experts
        • Annotation Tasks
      • Reporting and Analytics
        • Projects DataExports
        • Task Details
    • Shoonya Development Document
      • Shoonya Workflow
      • Software Architecture Diagrams
      • Technology Used
      • Shoonya Code Structure
      • Shoonya Deployment
    • Shoonya Forms
      • Feature Suggestions
      • Report Bugs for Shoonya
      • User Feedbacks
      • Stats-collection Forms
Share
Explore

Publications

List of Publications
2022
4
Towards Building ASR Systems for the Next Billion Users
Tahir Javed, Sumanth Doddapaneni, Abhigyan Raman, Kaushal Santosh Bhogale, Gowtham Ramesh, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M Khapra
AAAI
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages.
Gowtham Ramesh, Sumanth Doddapaneni, Aravinth Bheemaraj, Mayank Jobanputra, Raghavan AK, Ajitesh Sharma, Sujit Sahoo, Harshita Diddee, Mahalakshmi J, Divyanshu Kakwani, Navneet Kumar, Aswin Pradeep, Kumar Deepak, Vivek Raghavan, Anoop Kunchukuttan, Pratyush Kumar, Mitesh Shantadevi Khapra
Transactions of the ACL
IndicBART: A Pre-trained Model for Natural Language Generation of Indic Languages.
Raj Dabre, Himani Shrotriya, Anoop Kunchukuttan, Ratish Puduppully, Mitesh M. Khapra, Pratyush Kumar
Findings of ACL
IndicNLG Suite: Multilingual Datasets for Diverse NLG Tasks in Indic Languages
Aman Kumar, Himani Shrotriya, Prachi Sahu, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Amogh Mishra, Mitesh M. Khapra, Pratyush Kumar
arXiv preprint arXiv:2203.05437
2021
1
A primer on pretrained multilingual language models
Sumanth Doddapaneni, Gowtham Ramesh, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M Khapra
arXiv preprint arXiv:2107.00676
2020
1
IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages.
Divyanshu Kakwani, Anoop Kunchukuttan, Satish Golla, Gokul N.C., Avik Bhattacharyya, Mitesh M. Khapra, Pratyush Kumar
Findings of EMNLP

 
Want to print your doc?
This is not the way.
Try clicking the ··· in the right corner or using a keyboard shortcut (
CtrlP
) instead.