ALT Speech Recognition System using F0 Improvement and Spectral Tilt Method
Inbanila K1, KrishnaKumar E2
1Inbanila K, CHRIST (Deemed to be University), Bangalore, Karnataka, India.
2KrishnaKumar E, East Point College of Engineering and Technology, Avalahalli, Bengaluru, Karnataka, India.
Manuscript received on February 05, 2019. | Revised Manuscript received on February 14 2019. | Manuscript published on August 30, 2019. | PP: 3556-3561 | Volume-8 Issue-6, August 2019. | Retrieval Number: F9348088619/19©BEIESP | DOI: 10.35940/ijeat.F9348.088619
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Human Beings use voice as the medium for communication. Human Speech is a very complex signal with multiple frequencies, amplitudes and intensities that mix up to convey specific information. In international terminology, voice disorders are described as dysphonia. Various dysphonia’s are clearly organic origin due to nervous, muscular, neuro or cellular degenerative disease affecting the body or it is from local laryngeal changes. Other dysphonia’s having no visible laryngeal causes are grouped as non organic involving habitual dysphonia’s that arise from faulty speaking habits or the psycho genic dysphonia’s that stem from emotional causes. This paper looks at a speech recognition system for disordered speech generated by Physically Disabled people using Artificial Larynx Transducer (ALT) device from the perspective of Speech Signal Processing. From the ALT speech features like formant, pitch and spectral tilt is estimated. For formant frequency estimation RNN technique is used. Before training the system pitch frequency improvement is accomplished. Now the features and homomorphic based coefficients are used for training the system. The same operation is performed during the test phase and compared with the training set. Comparison and decision making is accomplished using distance estimator.
Keywords: ALT speech, Formant frequency, Spectral Tilt, Disordered speech, Healthy speech (HE speech), DREL noise.