IEEE Open Journal of Signal Processing (Jan 2024)

Lightweight, Multi-Speaker, Multi-Lingual Indic Text-to-Speech

  • Abhayjeet Singh,
  • Amala Nagireddi,
  • Anjali Jayakumar,
  • Deekshitha G,
  • Jesuraja Bandekar,
  • Roopa R,
  • Sandhya Badiger,
  • Sathvik Udupa,
  • Saurabh Kumar,
  • Prasanta Kumar Ghosh,
  • Hema A Murthy,
  • Heiga Zen,
  • Pranaw Kumar,
  • Kamal Kant,
  • Amol Bole,
  • Bira Chandra Singh,
  • Keiichi Tokuda,
  • Mark Hasegawa-Johnson,
  • Philipp Olbrich

DOI
https://doi.org/10.1109/OJSP.2024.3379092
Journal volume & issue
Vol. 5
pp. 790 – 798

Abstract

Read online

The Lightweight, Multi-speaker, Multi-lingual Indic Text-to-Speech (LIMMITS'23) challenge is organized as part of the ICASSP 2023 Signal Processing Grand Challenge. LIMMITS'23 aims at the development of a lightweight, multi-speaker, multi-lingual Text to Speech (TTS) model using datasets in Marathi, Hindi, and Telugu, with at least 40 hours of data released for each of the male and female voice artists in each language. The challenge encourages the advancement of TTS in Indian Languages as well as the development of techniques involved in TTS data selection and model compression. The 3 tracks of LIMMITS'23 have provided an opportunity for various researchers and practitioners around the world to explore the state-of-the-art techniques in TTS research.

Keywords