A novel approach to stuttered speech correction
Stuttered speech is a dysfluency rich speech, more prevalent in males than females. It has been associated with insufficient air pressure or poor articulation, even though the root causes are more complex. The primary features include prolonged speech and repetitive speech, while some of its seconda...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Faculty of Computer Science, Universitas Indonesia
2016
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/51544/ http://irep.iium.edu.my/51544/ http://irep.iium.edu.my/51544/1/p.sed.jiki.2016.382-1017-1-PB.pdf |
Summary: | Stuttered speech is a dysfluency rich speech, more prevalent in males than females. It has been associated with insufficient air pressure or poor articulation, even though the root causes are more complex. The primary features include prolonged speech and repetitive speech, while some of its secondary features include, anxiety, fear, and shame. This study used LPC analysis and synthesis algorithms to reconstruct the stuttered speech. The results were evaluated using cepstral distance, Itakura-Saito distance, mean square error, and likelihood ratio. These measures implied perfect speech reconstruction quality. ASR was used for further testing, and the results showed that all the reconstructed speech samples were perfectly recognized while only three samples of the original speech were perfectly recognized.
Shuttered speech adalah speech yang kaya dysfluency, lebih banyak terjadi pada laki-laki daripada
perempuan. Ini terkait dengan tekanan udara yang tidak cukup atau artikulasi yang buruk, meskipun
akar penyebabnya lebih kompleks. Fitur utama termasuk speech yang berkepanjangan dan berulangulang,
sementara beberapa fitur sekunder meliputi, kecemasan, ketakutan, dan rasa malu. Penelitian ini
menggunakan LPC analysis dan synthesis algoritma untuk merekonstruksi stuttered speech. Hasil
dievaluasi menggunakan jarak cepstral, jarak Itakura-Saito, mean square error, dan rasio likelihood.
Langkah-langkah ini terkandung kualitas speech reconstruction yang sempurna. ASR digunakan untuk
pengujian lebih lanjut, dan hasilnya menunjukkan bahwa semua sampel speech yang terekonstruksi
dikenali dengan sempurna sementara hanya tiga sampel dari speech asli dikenali dengan sempurna |
---|