제목 On a Voice Conversion by using Prosodic Control
저자명

Jongkuk Kim, Min-Cheol Hong, Hernsoo Hahn
 

초   록

Voice conversion is a method that aims to transform the input speech signal such that the output signal will be perceived as produced by another speaker .Speech synthesizers using voice conversion technologies allow developers to create more voices from a single database and users to personalize the synthesizer to speak with any desired voice after a training period. In this paper, we present the method that converts time and pitch scaling using spectral mapping and PSOLA technique with OLA. This new synthesis scheme allows very flexible modifications of the pitch-scale, the time-scale and the spectral envelope characteristics while producing high-quality speech output. This synthesis scheme is thus well suited to voice conversion. Further work will be conducted on a matching method to correspond well with each phonetic information, and larger corpora to assess the robustness of the method.



원문 수록처 International Conference on Advanced Computer Science and Electronics Information (ICACSEI 2013) pp477-481
자료유형 International Journal