<aside> 💡
POC for JioStar
Led the research and development of speech-to-speech for JioStar
The core idea is to convert English audio to Indian regional languages
Researched multiple ways to built the speech-to-speech pipeline, prepared technical report explaining which is better focusing on three parameters — Latency, Speed, Output Accuracy
This POC is built on top of OpenAI’s Realtime API
English Input-
https://drive.google.com/file/d/1_RXCg-EUhTXpVWUEr0xVw2rb0Btyte4p/view?usp=drive_link
Hindi Output-
https://drive.google.com/file/d/1-3mZnhKyi2AjtrWvCGrtyzIvOfZ92ET-/view?usp=drive_link
Marathi Output-
https://drive.google.com/file/d/1C4kltqNbkcDLs78aB7kf8OjWEc4CGhsB/view?usp=drive_link
Punjabi Output-
https://drive.google.com/file/d/1E_0-KLLmbd0cnwRDpjfqK0kZzT2_ZsvE/view?usp=drive_link
Gujarati Output-