Best Paper |
FLEURS: FEW-SHOT LEARNING EVALUATION OF UNIVERSAL REPRESENTATIONS OF SPEECH |
Alexis Conneau, Min Ma, Simran Khanuja, Yu Zhang, Vera Axelrod, Siddharth Dalmia, Jason Riesa, Clara Rivera, Ankur Bapna |
Best Paper |
On the Utility of Self-supervised Models for Prosody-related Tasks |
Guan-Ting Lin, Chi Luen Feng, Wei-Ping Huang, Yuan Tseng, Chen An Li, Tzu-Han Lin, Hung-yi Lee, Nigel Ward |
Best Student Paper |
End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation |
Yoshiki Masuyama, Xuankai Chang, Samuele Cornell, Shinji Watanabe, Nobutaka Ono |
Honorable Mention |
StyleTTS-VC: One-Shot Voice Conversion by Knowledge Transfer from Style-Based TTS Models |
Yinghao A Li, Cong Han, Nima Mesgarani |
Honorable Mention |
PEPPANET: EFFECTIVE MISPRONUNCIATION DETECTION AND DIAGNOSIS LEVERAGING PHONETIC, PHONOLOGICAL, AND ACOUSTIC CUES |
Bi-Cheng Yan, Hsin-Wei Wang, Berlin Chen |