0.3 Universal coding for classes of sources (Page 2/3)

Universal algorithms in signal Page 2 / 3

where $Q$ is the prior induced by the coding length function $l$ .

Minimal redundancy

Note that

\begin{matrix} \forall w, l, sup_{θ} r_{n} (l, θ) & \geq & \int_{Λ} w (d θ) r_{n} (l, θ) \\ \geq & inf_{l \in c_{n}} \int_{Λ} w (d θ) r_{n} (l, θ) . \end{matrix}

Therefore,

R_{n}^{+} = inf_{l} sup_{θ} r_{n} (l, θ) \geq sup_{w} inf_{l} \int_{Λ} w (d θ) r_{n} (l, θ) = R_{n}^{-} .

In fact, Gallager showed that $R_{n}^{+} = R_{n}^{-}$ . That is, the min-max and max-min redundancies are equal.

Let us revisit the Bernoulli source $p_{θ}$ where $θ \in Λ = [0, 1]$ . From the definition of [link] , which relies on a uniform prior for the sources, i.e., $w (θ) = 1, \forall θ \in Λ$ , it can be shown that there there exists a universal code with length function $l$ such that

E_{θ} [l (x^{n})] \leq n E_{θ} [h_{2}, (\frac{n_{x} (1)}{n})] + log (n + 1) + 2,

where $h_{2} (p) = - p log (p) - (1 - p) log (1 - p)$ is the binary entropy. That is, the redundancy is approximately $log (n)$ bits. Clarke and Barron [link] studied the weighting approach,

p (x) = \int_{Λ} d w (θ) p_{θ} (x),

and constructed a prior that achieves $R_{n}^{-} = R_{n}^{+}$ precisely for memoryless sources.

Theorem 5 [link] For memoryless source with an alphabet of size $r$ , $θ = (p (0), p (1), \dots, p (r - 1))$ ,

n R_{n}^{-} (w) = \frac{r - 1}{2} log (\frac{n}{2 π e}) + \int_{Λ} w (d θ) log (\frac{\sqrt{| I (θ) |}}{w (θ)}) + O_{n} (1),

where $O_{n} (1)$ vanishes uniformly as $n \to \infty$ for any compact subset of $Λ$ , and

I (θ) ≜ E [(\frac{\partial ln p_{θ} (x_{i})}{\partial θ}), {(\frac{\partial ln p_{θ} (x_{i})}{\partial θ})}^{T}]

is Fisher's information.

Note that when the parameter is sensitive to change we have large $I (θ)$ , which increases the redundancy. That is, good sensitivity means bad universal compression.

Denote

J (θ) = \frac{\sqrt{| I (θ) |}}{\int_{Λ} \sqrt{| I (θ^{'}) |} d θ^{'}},

this is known as Jeffrey's prior . Using $w (θ) = J (θ)$ , it can be shownthat $R_{n}^{-} = R_{n}^{+}$ .

Let us derive the Fisher information $I (θ)$ for the Bernoulli source,

\begin{matrix} p_{θ} (x) = θ^{n_{x} (1)} \cdot {(1 - θ)}^{n_{x} (0)} \\ \Rightarrow & ln p_{θ} (x) = n_{x} (1) ln θ + n_{x} (0) ln (1 - θ) \\ \Rightarrow & \frac{\partial ln p_{θ} (x)}{\partial θ} = n_{x} (1) \frac{1}{θ} - n_{x} (0) \frac{1}{1 - θ} \\ \Rightarrow & {(\frac{\partial ln p_{θ} (x)}{\partial θ})}^{2} = \frac{n_{x}^{2} (1)}{θ^{2}} + \frac{n_{x}^{2} (0)}{{(1 - θ)}^{2}} - \frac{2 n_{x} (1) n_{x} (0)}{θ (1 - θ)} \\ \Rightarrow & E [{(\frac{\partial ln p_{θ} (x)}{\partial θ})}^{2}] = \frac{θ}{θ^{2}} + \frac{1 - θ}{{(1 - θ)}^{2}} - \frac{2}{θ (1 - θ)} E [n_{x} (1) n_{x} (0)] \\ = \frac{1}{θ} + \frac{1}{1 - θ} - 0 \\ = \frac{1}{θ (1 - θ)} . \end{matrix}

Therefore, the Fisher information satisfies $I (θ) = \frac{1}{θ (1 - θ)}$ .

Recall the Krichevsky–Trofimov coding, which was mentioned in [link] . Using the definition of Jeffreys' prior [link] , we see that $J (θ) \propto \frac{1}{\sqrt{θ (1 - θ)}}$ . Taking the integral over Jeffery's prior,

\begin{matrix} p_{J} (x^{n}) & = & \int_{0}^{1} c \frac{d θ}{\sqrt{θ (1 - θ)}} θ^{n_{x} (1)} {(1 - θ)}^{n_{x} (0)} \\ = & c \int_{0}^{1} θ^{n_{x} (1) - \frac{1}{2}} {(1 - θ)}^{n_{x} (0) - \frac{1}{2}} d θ \\ = & \frac{Γ (n_{x} (0) + \frac{1}{2}) Γ (n_{x} (1) + \frac{1}{2})}{π Γ (n + 1)}, \end{matrix}

where we used the gamma function. It can be shown that

p_{J} (x^{n}) = \prod_{t = 0}^{n} p_{J} (x_{t + 1} | x_{1}^{t}),

where

\begin{matrix} p_{J} (x_{t + 1} | x_{1}^{t}) & = & \frac{p_{J} (x_{1}^{t + 1})}{p_{J} (x_{1}^{t})}, \\ p_{J} (x_{t + 1} = 0 | x_{1}^{t}) & = & \frac{n_{x}^{t} (0) + \frac{1}{2}}{t + 1}, \\ p_{J} (x_{t + 1} = 1 | x_{1}^{t}) & = & \frac{n_{x}^{t} (1) + \frac{1}{2}}{t + 1} . \end{matrix}

Similar to before, this universal code can be implemented sequentially. It is due to Krichevsky and Trofimov [link] , its redundancy satisfies Theorem 5 by Clarke and Barron [link] , and it is commonly used in universal lossless compression.

Rissanen's bound

Let us consider – on an intuitive level – why $C_{n} \approx \frac{r - 1}{2} \frac{log (n)}{n}$ . Expending $\frac{r - 1}{2} log (n)$ bits allows to differentiate between ${(\sqrt{n})}^{r - 1}$ parameter vectors. That is, we would differentiate between each of the $r - 1$ parameters with $\sqrt{n}$ levels. Now consider a Bernoulli RV with (unknown) parameter $θ$ .

One perspective is that with $n$ drawings of the RV, the standard deviation in the number of 1's is $O (\sqrt{n})$ . That is, $\sqrt{n}$ levels differentiate between parameter levels up to a resolution that reflects the randomness of the experiment.

A second perspective is that of coding a sequence of Bernoulli outcomes with an imprecise parameter,where it is convenient to think of a universal code in terms of first quantizing the parameter and then using that (imprecise) parameter to encode the input $x$ . For the Bernoulli example, the maximum likelihood parameter $θ_{M L}$ satisfies

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Universal algorithms in signal processing and communications. OpenStax CNX. May 16, 2013 Download for free at http://cnx.org/content/col11524/1.1

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Universal algorithms in signal processing and communications' conversation and receive update notifications?

Ask

	Pre-class Online Quiz - Case Control Studies By Mucho Mizinduko Start Quiz
	6 AP 06 Skeletal System Essay By OpenStax Start Flashcards
	Anthropology Life Cycle By Richley Crapo Start Assignment
	29 Biology 29 Vertebrates MCQ By OpenStax Start Quiz
	Nutrition Exam By Hannah Sheth Start Quiz
	1 Microeconomics 01 What Is Economics? By OpenStax Start Flashcards
	Young Economist MCQ Test By Robert Murphy Start Test
©flickr:	Biology 1 By Jill Zerressen Start Quiz
©flickr: Francisco	U.S. Civil War Pre-test By Danielle Stephens Start Quiz
	12 Dr Dowers Endocrinology By Brooke Delaney Start Exam