<< Chapter < Page | Chapter >> Page > |
This shows that the Bernoulli distribution can be written in the form of Equation [link] , using an appropriate choice of , and .
Let's now move on to consider the Gaussian distribution. Recall that, when deriving linear regression, the value of had no effect on our final choice of and . Thus, we can choose an arbitrary value for without changing anything. To simplify the derivation below, let'sset . If we leave as a variable, the Gaussian distribution can also be shown to be in the exponential family, where is now a 2-dimension vector that depends on both and . For the purposes of GLMs, however, the parameter can also be treated by considering a more general definition of the exponential family: . Here, is called the dispersion parameter , and for the Gaussian, ; but given our simplification above, we won't need the more general definition for the examples we will consider here. We then have:
Thus, we see that the Gaussian is in the exponential family, with
There're many other distributions that are members of the exponential family: The multinomial (which we'll see later), the Poisson (for modelling count-data;also see the problem set); the gamma and the exponential (for modelling continuous, non-negative random variables, such as time-intervals); the beta and the Dirichlet(for distributions over probabilities); and many more. In the next section, we will describe a general “recipe” for constructing models in which (given and ) comes from any of these distributions.
Suppose you would like to build a model to estimate the number of customers arriving in your store (or number of page-views on your website) in any givenhour, based on certain features such as store promotions, recent advertising, weather, day-of-week, etc. We know that the Poisson distributionusually gives a good model for numbers of visitors. Knowing this, how can we come up with a model for our problem? Fortunately,the Poisson is an exponential family distribution, so we can apply a Generalized Linear Model (GLM).In this section, we will we will describe a method for constructing GLM models for problems such as these.
More generally, consider a classification or regression problem where we would like to predict the value of some random variable as a function of . To derive a GLM for this problem, we will make the following threeassumptions about the conditional distribution of given and about our model:
Notification Switch
Would you like to follow the 'Machine learning' conversation and receive update notifications?