<< Chapter < Page Chapter >> Page >
The audio features used to characterize the sound signal and classify the sample by instrument.

How do we decide what parts of the spectrum are important? The CUIDADO project (2) provided a set of 72 audio features, and research1 has shown that some of the features aremore important in capturing the signal characteristics. We therefore decided to implement a small subset of thesefeatures:

Cepstral Features

  • Mel-Frequency Cepstrum Coefficients (MFCC), k = 2:13

Spectral Features

  • Slope
  • Roll-Off
  • Centroid
  • Spread
  • Skew
  • Kurtosis
  • Odd-to-Even Harmonic Energy Ratio (OER)
  • Tristimulus

Definitions

Cepstral coefficients have received a great deal of attention in the speech processing community, as they tryto extract the characteristics of the filter and model it independently of the signal being produced. This is ideal, as thefilter in our case is the instrument that we are trying to recognize. We work on a Mel scale because it more accurately modelshow the human auditory system perceives different frequencies, i.e. it gives more weight to changes at low frequencies as humans aremore adept at distinguishing low frequency changes.

The centroid correlates to the “brightness” of the sound and is often higher than expected due to the energyfrom harmonics above the fundamental frequency. The spread, skew, and kurtosis are based on the 2nd, 3rd, and 4th moments and, alongwith the slope, help portray spectral shape.

Odd-to-even harmonic energy ratio simply determines whether a sound consists primarily of odd harmonicenergy, of even harmonic energy, or whether the harmonic energy is equally spread.

The tristimulus measure energy as well and were introduced as the timbre equivalent to the color attributes ofvision. Like the OER, it provides clues regarding the distribution of harmonic energy, this time focusing on low, mid, and highharmonics rather than odd and even harmonics. This gives more weight to the first few harmonics, which are perceptually moreimportant.

How we chose features

MFCC have shown to work very well in monophonic environments, as they capture the shape of the spectrumvery effectively. Unfortunately, they are of less use in polyphonic recordings, as the MFCC captures the shape of a spectrum calculatedfrom multiple sources. Most of the work we have seen on this subject uses MFCC regardless, however. They are particularly usefulif only one instrument is playing or is relatively quite salient.

Most wind instruments have their harmonics evenly spread among the odd and even indices, but the clarinet isdistinct in that it produces spectra consisting predominantly of odd ratios, with very little even harmonics appearing at all. Thismakes sense from a physics standpoint, as when played, the clarinet becomes a closed cylinder at one end, therefore allowing only theodd harmonics to resonate. This feature was thus chosen primarily with clarinet classification in mind.

We chose the roll-off and tristimulus as our energy measures, as they were both easy to implement and judged tobe important (1) . Finally, the first four spectral moments and the spectral slope, in both perceptual and spectral models, were shownto be the top ten most important features in the same study and were therefore some of the first features added to our classificationsystem. We note that we had hoped to implement a perceptual model and thereby nearly double our features, but we could not find anaccurate filter model for the mid-ear and thus decided to forgo any features based on perceptual modeling.

For further discussion of these features, along with explicit mathematical formulas, please refer to (1) .

References

  • A.A. Livshin and X. Rodet. “Musical Instrument Identification in Continuous Recordings,” in Proc. of the 7th Int. Conference onDigital Audio Effects, Naples, Italy, October 5-8, 2004.
  • G. Peeters. “A large set of audio features for sound description (similarity and classification) in the CUIDADOproject,” 2003. URL: http://www.ircam.fr/anasyn/peeters/ARTICLES/Peeters_2003_cuidadoaudiofeatures.pdf.

Questions & Answers

profit maximize for monopolistically?
Usman Reply
what kind of demand curve under monopoly?
Mik Reply
what is the difference between inflation and scarcity ?
Abdu Reply
What stops oligopolists from acting together as a monopolist and earning the highest possible level of profits?
Mik
why economics is difficult for 2nd school students.
Siraj Reply
what does mean opportunity cost?
Aster Reply
what is poetive effect of population growth
Solomon Reply
what is inflation
Nasir Reply
what is demand
Eleni
what is economics
IMLAN Reply
economics theory describes individual behavior as the result of a process of optimization under constraints the objective to be reached being determined by
Kalkidan
Economics is a branch of social science that deal with How to wise use of resource ,s
Kassie
need
WARKISA
Economic Needs: In economics, needs are goods or services that are necessary for maintaining a certain standard of living. This includes things like healthcare, education, and transportation.
Kalkidan
What is demand and supply
EMPEROR Reply
deman means?
Alex
what is supply?
Alex
ex play supply?
Alex
Money market is a branch or segment of financial market where short-term debt instruments are traded upon. The instruments in this market includes Treasury bills, Bonds, Commercial Papers, Call money among other.
murana Reply
good
Kayode
what is money market
umar Reply
Examine the distinction between theory of comparative cost Advantage and theory of factor proportion
Fatima Reply
What is inflation
Bright Reply
a general and ongoing rise in the level of prices in an economy
AI-Robot
What are the factors that affect demand for a commodity
Florence Reply
price
Kenu
differentiate between demand and supply giving examples
Lambiv Reply
differentiated between demand and supply using examples
Lambiv
what is labour ?
Lambiv
Got questions? Join the online conversation and get instant answers!
Jobilize.com Reply

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Elec 301 projects fall 2005. OpenStax CNX. Sep 25, 2007 Download for free at http://cnx.org/content/col10380/1.3
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Elec 301 projects fall 2005' conversation and receive update notifications?

Ask