Research Article Open Access

Speech Compression for Noise-Corrupted Thai Expressive Speech

Suphattharachai Chomphan

Abstract

Problem statement: In speech communication, speech coding aims at preserving the speech quality with lower coding bitrate. When considering the communication environment, various types of noises deteriorates the speech quality. The expressive speech with different speaking styles may cause different speech quality with the same coding method. Approach: This research proposed a study of speech compression for noise-corrupted Thai expressive speech by using two coding methods of CS-ACELP and MP-CELP. The speech material included a hundredmale speech utterances and a hundred female speech utterances. Four speaking styles included enjoyable, sad, angry and reading styles. Five sentences of Thai speech were chosen. Three types of noises were included (train, car and air conditioner). Five levels of each type of noise were varied from 0-20 dB. The subjective test of mean opinion score was exploited in the evaluation process. Results: The experimental results showed that CS-ACELP gave the better speech quality than that of MP-CELP at all three bitrates of 6000, 8600-12600 bps. When considering the levels of noise, the 20-dB noise gave the best speech quality, while 0-dB noise gave the worst speech quality. When considering the speech gender, female speech gave the better results than that of male speech. When considering the types of noise, the air-conditioner noise gave the best speech quality, while the train noise gave the worst speech quality. Conclusion: From the study, it can be seen that coding methods, types of noise, levels of noise, speech gender influence on the coding speech quality.

Journal of Computer Science
Volume 7 No. 10, 2011, 1565-1573

DOI: https://doi.org/10.3844/jcssp.2011.1565.1573

Submitted On: 31 May 2011 Published On: 6 August 2011

How to Cite: Chomphan, S. (2011). Speech Compression for Noise-Corrupted Thai Expressive Speech. Journal of Computer Science, 7(10), 1565-1573. https://doi.org/10.3844/jcssp.2011.1565.1573

  • 2,833 Views
  • 2,365 Downloads
  • 0 Citations

Download

Keywords

  • Multi-Pulse based (MP-CELP)
  • code excited
  • linear predictive
  • speech coding
  • bitrate scalability
  • Linear Prediction (LP)
  • expressive speech
  • speech compression
  • speech quality
  • coding methods
  • speech signal
  • perceptual weighting