<< Chapter < Page | Chapter >> Page > |
Solving the third problem posed in the introduction to these notes is rather different from the other two. Here we want to find an operator ormatrix that when multiplied by gives . Clearly a solution to this problem would not be unique as stated. In order to pose a better definedproblem, we generally give a set or family of inputs and the corresponding outputs . If these families are independent, and if the number of them is the same as the size of the matrix, a unique matrix isdefined and can be found by solving simultaneous equations. If a smaller number is given, the remaining degrees of freedom can be used to satisfysome other criterion. If a larger number is given, there is probably no exact solution and some approximation will be necessary.
If the unknown operator matrix is of dimension by , then we take inputs for , each of dimension and the corresponding outputs , each of dimension and form the matrix equation:
where is the by unknown operator, is the by input matrix with columns which are the inputs and is the by output matrix with columns . The operator matrix is then determined by:
if the inputs are independent which means is nonsingular.
This problem can be posed so that there are more (perhaps many more) inputs and outputs than with a resulting equation error which can be minimized with some form of pseudoinverse.
Linear regression can be put in this form. If our matrix equation is
where is a row vector of unknown weights and is a column vector of known inputs, then is a scaler inter product. If a seond experiment gives a second scaler inner product from a secondcolumn vector of known inputs, then we augment to have two rows and to be a length-2 row vector. This is continued for experiment to give [link] as a 1 by row vector times an by matrix which equals a 1 by row vector. It this equation is transposed, it is in the formof [link] which can be approximately solved by the pesuedo inverse to give the unknown weights for the regression.
Alternatively, the matrix may be constrained by structure to have less than degrees of freedom. It may be a cyclic convolution, a non cyclic convolution, a Toeplitz, a Hankel, or a Toeplitz plus Hankelmatrix.
A problem of this sort came up in research on designing efficient prime length fast Fourier transform (FFT) algorithms where is the data and is the FFT of . The problem was to derive an operator that would make this calculation using the least amount of arithmetic. We solved it using aspecial formulation [link] and Matlab.
Notification Switch
Would you like to follow the 'Basic vector space methods in signal and systems theory' conversation and receive update notifications?