<< Chapter < Page | Chapter >> Page > |
This implies that for any with probability at least we have
X=[0,1] , Y={0,1}. Let , k=1, 2, ... denote the collection of histogram classification rules with k equal volumebins. We can use the following codebook for the index k.
And follow this codeword with bits to indicate which of the 2 possible histogram rules is under consideration. Thus for any for some k 1 there is a prefix code of length
It follows that for any with probability at least we have
where is the number of bins in histogram corresponding to . Contrast with the bound we had for the class of m bin histograms alone: with probability ,
Notice the bound for all histograms rules is almost as good as the bound for only the -bin rules. That is, when the bounds are within a factor of . On the other hand, the new bound is a big improvement, since it also gives us a guide for selecting thenumber of bins.
ProofWe will prove that for any binary prefix code, the codeword lengths , , ... satisfy . The converse is easy to prove also, but it not central to ourpurposes here (for a proof, see Cover Thomas '91). Consider a binary tree like the one shown below.
The sequence of bit values leading from the root to a leaf of the tree represents a codeword. The prefix condition implies that nocodeword is a descendant of any other codeword in the tree. Let c be the length of the longest codeword (also the number of branches to the deepest leaf) in the tree.
Consider a leaf in the tree at level c . This leaf would have 2 descendants at level c . Furthermore, for each leaf the set of possible descendants at level c is disjoint (since no codeword can be a prefix of another). Therefore,since the total number of possible leafs at level c is , we have
which proves the case when the number of codewords is finite.
Suppose now that we have a countably infinite number of codewords. Let b b ... b be the ith codeword and let
be the real number corresponding to the binary expansion of the codeword. We canassociate the interval with the ith codeword. This is the set of all real numbers whose binaryexpansion begins with b b ... b . Since this is a subinterval of , and all such subintervals corresponding to prefix codewords are disjoint, the sum of their lengths must beless than or equal to 1. This proves the case where the number of codewords is infinite.
Notification Switch
Would you like to follow the 'Statistical learning theory' conversation and receive update notifications?