year 2, Issue 2 (Journal of Acoustical Engineering Society of Iran 2015)                   مجله انجمن علوم صوتی ایران (مهندسی صوتیات سابق) 2015, 2(2): 41-52 | Back to browse issues page

XML Persian Abstract Print


Abstract:   (7703 Views)

One of the most serious bottlenecks of voice conversion is the need of high number of training sentences from target speaker. Parallel voice conversion methods need 50 to 200 training sentences and nonparallel conversion methods need several times of it. In this research, a new method for construction of all Persian language vowels only with one training word of target speaker is introduced. In each word there is at least one syllable and in each syllable there is one vowel. Basic idea of research is designing direct transformations between each vowel and other vowels. Based on this idea, by offline training of sets of transformations, it is possible to construct all vowels from one arbitrary vowel. In Persian language there are 6 vowels. Because of it, 30 of such mentioned transformations must be designed. These transformations were designed with Gaussian conditional functions and trained by 10 speakers in supervised manner. With distortion distance criterion, average distance between real and artificial vowels was 0.4459.

Full-Text [PDF 140 kb]   (1115 Downloads)    
Type of Study: Research | Subject: Hydroacoustics
Received: 2014/07/12 | Accepted: 2014/12/2 | Published: 2015/03/19

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.