Work for a Member organization and need a Member Portal account? Register here with your official email address.

Publication

End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions

Aug. 20, 2022

Projects

Voice Anonymization

Groups

Share this publication

Wonjune Kang, Mark Hasegawa-Johnson, and Deb Roy. "End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions." In Proc. Interspeech, pp. 2303-2307. 2023.

Abstract

Zero-shot voice conversion is becoming an increasingly popular research topic, as it promises the ability to transform speech to sound like any speaker. However, relatively little work has been done on end-to-end methods for this task, which are appealing because they remove the need for a separate vocoder to generate audio from intermediate features. In this work, we propose LVC-VC, an end-to-end zero-shot voice conversion model that uses location-variable convolutions (LVCs) to jointly model the conversion and speech synthesis processes. LVC-VC utilizes carefully designed input features that have disentangled content and speaker information, and it uses a neural vocoder-like architecture that utilizes LVCs to efficiently combine them and perform voice conversion while directly synthesizing time domain audio. Experiments show that our model achieves especially well balanced performance between voice style transfer and speech intelligibility compared to several baselines.

via ISCA Archive

Anonymization of Voices in Spaces for Civic Dialogue: Measuring Impact on Empathy, Trust, and Feeling Heard

Wonjune Kang, Margaret A. Hughes, and Deb Roy. "Anonymization of Voices in Spaces for Civic Dialogue: Measuring Impact on Empathy, Trust, and Feeling Heard." Proceedings of the ACM on Human-Computer Interaction 8, no. CSCW2 (2024): 1-22.

Article Research

Shared Voices, Shared Experiences with realtalk@MIT

Center for Constructive Communication project builds connections through conversations

Post Research

Deb Roy named to Aspen Institute Commission on Information Disorder

The Commission's members are drawn from government, research and academia, civil society, public service, and private industry.

Publication Research

Using Twitter Data to Understand Public Perceptions of Approved versus Off-label Use for COVID-19-related Medications

Hua, Yining*, Hang Jiang*, Shixu Lin, Jie Yang, Joseph M. Plasek, David W. Bates, and Li Zhou. "Using Twitter Data to Understand Public Perceptions of Approved versus Off-label Use for COVID-19-related Medications." Journal of the American Medical Informatics Association (2022). *Equal Contribution.