<< Chapter < Page | Chapter >> Page > |
Consider the general hypothesis testing problem where we have -dimensional observations and hypotheses. If the data are real-valued, for example, then a hypothesistest is a mapping For every possible realization of the input, the test outputs a hypothesis. The test partitions the input space into a disjoint collection , where The sets are called decision regions . The boundary between two decision regions is a decision boundary . depicts these concepts when , , and .
If the distribution of the data under a certain hypothesis is fully known, we call it a simple hypothesis. All of the hypotheses in the examples above are simple. In many cases, however, we onlyknow the distribution up to certain unknown parameters. For example, in a Gaussian noise model we may not know thevariance of the noise. In this case, a hypothesis is said to be composite .
Consider the problem of detecting the signal where is an unknown delay parameter. Then is a binary test of a simple hypothesis ( ) versus a composite alternative. Here we are assuming , with known.
Often a test involving a composite hypothesis has the form where is fixed. Such problems are called two-sided because the composite alternative "lies on both sides of ." When is a scalar, the test is called one-sided . Here, both hypotheses are composite.
Suppose a coin turns up heads with probability . We want to assess whether the coin is fair( ). We toss the coin times and record ( means heads and means tails). Then is a binary test of a simple hypothesis ( ) versus a composite alternative. This is also a two-sided test.
In binary hypothesis testing, assuming at least one of the two models does indeed correspond to reality, thereare four possible scenarios:
Consider the general binary hypothesis testing problem If is simple, that is, , then the size (denoted ), also called the false-alarm probability ( ), is defined to be When is composite, we define For , the power (denoted ), or detection probability ( ), is defined to be The probability of a type II error, also called the miss probability , is If is composite, then is viewed as a function of .
The design of a hypothesis test/detector often involves constructing the solution to an optimizationproblem. The optimality criteria used fall into two classes: Bayesian and frequent.
Representing the former approach is the Bayes Risk Criterion . Representing the latter is the Neyman-Pearson Criterion . These two approaches are developed at length in separate modules.
The following table, adapted from Kay, p.65 , summarizes the different terminology for hypothesis testing from statistics and signal processing:
Statistics | Signal Processing |
---|---|
Hypothesis Test | Detector |
Null Hypothesis | Noise Only Hypothesis |
Alternate Hypothesis | Signal + Noise Hypothesis |
Critical Region | Signal Present Decision Region |
Type I Error | False Alarm |
Type II Error | Miss |
Size of Test ( ) | Probability of False Alarm ( ) |
Power of Test ( ) | Probability of Detection ( ) |
Notification Switch
Would you like to follow the 'Statistical signal processing' conversation and receive update notifications?