<< Chapter < Page | Chapter >> Page > |
The human vision system perceives images in colour using receptors on the retina of the eye which respond to threerelatively broad colour bands in the regions of red, green and blue (RGB) in the colour spectrum (red, orange, yellow, green,blue, indigo, violet).
Colours in between these are perceived as different linear combinations of RGB. Hence colour TVs and monitors can formalmost any perceivable colour by controlling the relative intensities of R, G and B light sources. Thus most colourimages which exist in electronic form are fundamentally represented by 3 intensities (R, G and B) at each pictureelement (pel) position.
The numerical values used for these intensities are usually chosen such that equal increments in value result inapproximately equal apparent increases in brightness. In practise this means that the numerical value is approximatelyproportional to the log of the true light intensity (energy of the wave) - this is Weber's Law . Throughout this course, we shall refer to these numerical values as intensities, sincefor compression it is most convenient to use a subjectively linear scale.
The eye is much more sensitive to overall intensity (luminance) changes than to colour changes. Usually most of theinformation about a scene is contained in its luminance rather than its colour (chrominance).
This is why black-and-white (monochrome) reproduction was acceptable for photography and TV for many years untiltechnology provided colour reproduction at a sufficient cheap price to make its modest advantages worth having.
The luminance ( ) of a pel may be obtained from its RGB components as:
RGB representations of images are normally defined so that if , the pel is always some shade of gray, and if in these cases, the 3 coefficients in should sum to unity.
When defines the luminance of a pel, its chrominance is usually defined by and such that:
The transformation between RGB and YUV colour spaces is linear and may be achieved by a matrix and its inverse:
shows the sensitivity of the eye to luminance ( ) and chrominance ( , ) components of images. The horizontal scale is spatial frequency, and represents thefrequency of an alternating pattern of parallel stripes with sinusoidally varying intensity. The vertical scale is thecontrast sensitivity of human vision, which is the ratio of the maximum visible range of intensities to the minimumdiscernible peak-to-peak intensity variation at the specified frequency.
In we see that:
A colour demonstration on the computer will show this effect.
The 3 RGB samples at each pel are transformed into 3 YUV samples using .
Most image compression systems then subsample the and information by 2:1 horizontally and vertically so that there is one and one pel for each block of pels. The subsampled and pels are obtained by averaging the four and samples, from . The quarter-size and subimages are then compressed using the same techniques as the full-size image, except that coarser quantisation may be used for and , so the total cost of adding colour may only be about 25% increase in bit rate. Sometimes and are subsamples 4:1 each way (16:1 total), giving an even lower cost of colour.
From now on we will mostly be considering compression of the monochrome image, and assume that similar techniques will be used for the smaller and subimages.
A final feature of human vision, which is useful for compression, is that the contrast sensitivity to a givenpattern is reduced in the presence of other patterns (activity) in the same region. This is known as activitymasking.
It is a complicated subject as it depends on the similarity between the given pattern and the background activity. Howeverin general, the higher the variance of the pels in a given region (typically ~ 8 to 16 pels across), the lower is thecontrast sensitivity.
Hence compression schemes which adapt the quantisation to local image activity tend to perform better than those whichuse uniform quantisation.
A computer demonstration will show the effect of reduced sensitivity to quantisation effects when noise is added to animage.
Notification Switch
Would you like to follow the 'Image coding' conversation and receive update notifications?