I work at Sea AI Lab as a research scientist now, leading the audio team and doing some fundamental audio-related research. We are hiring researchers and engineers to work on TTS, music generation, speech translation and audio-driven talking face generation. If interested, feel free to email me at renyi@sea.com.

I graduated from Chu Kochen Honors College, Zhejiang University (浙江大学竺可桢学院) with a bachelor’s degree and from the Department of Computer Science and Technology, Zhejiang University (浙江大学计算机科学与技术学院) with a master’s degree, advised by Zhao Zhou (赵洲). I also collaborate with Xu Tan (谭旭), Tao Qin (秦涛) and Tie-yan Liu (刘铁岩) from Microsoft Research Asia closely.

I won the Baidu Scholarship (10 candidates worldwide each year) and ByteDance Scholars Program (10 candidates worldwide each year) in 2020 and was selected as one of the top 100 AI Chinese new stars and AI Chinese New Star Outstanding Scholar (10 candidates worldwide each year).

My research interest includes speech synthesis, neural machine translation and automatic music generation. I have published more than 20 papers at the top international AI conferences such as NeurIPS, ICML, ICLR, KDD.

To promote the communication among the Chinese ML & NLP community, we (along with other 11 young scholars worldwide) founded the MLNLP community in 2021. I am honored to be one of the chairs of the MLNLP committee.

🔥 News

  • 2022.05: I join Sea AI Lab as the audio team leader. We are hiring researchers and engineers!
  • 2022.04: Three papers are accepted by IJCAI 2022:
    • SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech, Zhenhui Ye, Zhou Zhao, Yi Ren, Fei Wu
    • EditSinger: Zero-Shot Text-Based Singing Voice Editing System with Diverse Prosody Modeling, Lichao Zhang, Zhou Zhao, Yi Ren, Liqun Deng
    • FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis, Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao
  • 2022.03: We release NeuralSVB, the code of our ACL 2022 work (singing voice beautifying). 🚧 ⛏️ 🛠️ 👷
  • 2022.02: I release a modern and responsive academic personal homepage template. Welcome to STAR and FORK!
  • 2022.02: 🎉🎉 Two papers are accepted by ACL 2022:
  • 2022.02: 🎉🎉 My google scholar citations have exceeded 1000!
  • 2022.02: We public a Non-Autoregressive Text-to-Speech (NAR-TTS) framework NATSpeech , including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022). 🎉🎉 It was shown on the Github Daily Trending List on 19 Feb 2022!

📝 Publications

🎙 Speech Synthesis

NeurIPS 2019
sym

FastSpeech: Fast, Robust and Controllable Text to Speech
Yi Ren, Yangjun Ruan, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu

Project

  • FastSpeech is the first fully parallel end-to-end speech synthesis model.
  • Academic Impact: This work is included by many famous speech synthesis open-source projects, such as ESPNet . Our work are promoted by more than 20 media and forums, such as 机器之心InfoQ.
  • Industry Impact: FastSpeech has been deployed in Microsoft Azure TTS service and supports 49 more languages with state-of-the-art AI quality. It was also shown as a text-to-speech system acceleration example in NVIDIA GTC2020.
ICLR 2021
sym

FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu

Project

NeurIPS 2021
sym
AAAI 2022
sym

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Zhou Zhao

Project | | | Hugging Face

👄 Lip Generation/Understanding

📚 Machine Translation

🎼 Music Generation

🧑‍🎨 Generative Model

🎖 Honors and Awards

  • 2021.10 Tencent Scholarship (Top 1%)
  • 2021.10 National Scholarship (Top 1%)
  • 2020.12 Baidu Scholarship (10 students in the world each year)
  • 2020.12 AI Chinese new stars (100 worldwide each year)
  • 2020.12 AI Chinese New Star Outstanding Scholar (10 candidates worldwide each year)
  • 2020.12 ByteDance Scholars Program (10 students in China each year)
  • 2020.10 Tianzhou Chen Scholarship (Top 1%)
  • 2020.10 National Scholarship (Top 1%)
  • 2015.10 National Scholarship (Undergraduate) (Top 1%)

📖 Educations

  • 2019.06 - 2022.04 (now), Master, Zhejiang University, Hangzhou.
  • 2015.09 - 2019.06, Undergraduate, Chu Kochen Honors College, Zhejiang Univeristy, Hangzhou.
  • 2012.09 - 2015.06, Luqiao Middle School, Taizhou.

💬 Invited Talks

  • 2022.02, Hosted MLNLP seminar | [Video]
  • 2021.06, Audio & Speech Synthesis, Huawei internal talk
  • 2021.03, Non-autoregressive Speech Synthesis, PaperWeekly & biendata | [video]
  • 2020.12, Non-autoregressive Speech Synthesis, Huawei Noah’s Ark Lab internal talk

💻 Internships