<< Chapter < Page | Chapter >> Page > |
The components of speech are the words and the voice .
Every phrase is a union of these two components - they are the foundations of the spoken language. One or the other does not mean much without its conterpart. Words without voice lackintonation, so they have no meaning. Voice without words is devoid of structure and cannot possibly transfer information. Only the fusion of the two can claim to be such a thing as speech.
In biology, the components of speech are produced in different organs. To speak, air is first released over the vocal cords, which expand and contract to give the air column structure. This isthe biological concept of words. The words are then passed through the vocal tract where they are shaped, givingthem intonation. This shaping of the words is the biological concept of voice. Such a biological process can be easily modeled.
So far, we have determined that speech is a collection of words shaped by voice. Here, we present a model of this. In this model, the words are called the source . Since the words are modified by voice, we say the source passes through a filter . This brings us to the source filter model of speech.
This model has possibility for application in many different fields. We will focus on the topic of signal processing here.
The source filter model can easily be extended to signal processing. The source is simply a signal . This signal isthe input to the filter and is called the excitation signal since it excites the vocal tract. The vocal tract is a filter similar to all filters we have studied so far: it is a lineartime-invariant system with impulse response . This is sometimes called the transfer function of speech since it is what transfers the excitation signal to speech - it adds voice to words. Speech is theoutput of thesource signal passed through the filter with impulse response . Thus, theoutput is given by . This is depicted below:
Since speech is simply a convolution of a source signal with a filter'sinput response , we can analyze these signals to determine the characeristics of a speech signal . However, wemust first deconvolve these signals so that they can be processed individually. This topic is exploredin the next section covering deconvolution .
Huckvale, Mark. "Lecture 8: Source-Filter Model of Speech Production." B214: Phonetic Science: Acoustics of Speech and Hearing. University College London. (External Link) .
Johnson, Don. Connexions module m0049: Modeling the Speech Signal .
Notification Switch
Would you like to follow the 'Methods for voice conversion' conversation and receive update notifications?