<< Chapter < Page | Chapter >> Page > |
The overall approach is shown in figure 4.1. A PIN consists of a username and recordings of four separate (though not necessarily unqiue) numbers. A user is added to the database by adding seven recordings of all PIN numbers.
A candidate is a specified username and a single recoding of all PIN numbers. Upon submission to the software, the relevant signal is extracted from the entire candidate recording.
From the database, all seven recordings are combined into an “average” signal. The average signal and the extracted candidate signal are the compared – this results in a metric value. Based on this metric, a decision is made as to whether or not the candidate is a match or an impostor.
Figure 4.1: Overall approach for speaker identification
Notification Switch
Would you like to follow the 'Analysis of speech signal spectrums using the l2 norm' conversation and receive update notifications?