![]() |
|
![]() |
![]() |
|
![]() |
![]() |
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Subscribe]
DM: RE: Why is Singular Vector Decomposition for OLS?From: Cunningham, Scott W Date: Thu, 9 Apr 1998 13:26:43 -0400 (EDT)
Krishnadas
Singular value decomposition DOES have clear advantages over the plain
inverse(X'X) X'Y or "normal equations."
The reason has to do with the so called "multi-collinearity" problem.
When
two independent variables are closely
correlated it is very difficult to accurately assess the correct
regression
parameters for each. The variance about the estimates is
artificially high.
In fact, some regression packages (such as Excel 97) will REFUSE to
perform
regressions when two variables are completely linearly dependent.
Statistically, there is nothing wrong with the procedure. In large
datasets
(where you can't eye for linear dependency) the refusal of the
software to
perform regression is a major difficulty.
Contrast this with singular value decomposition (SVD), as applied to
ordinary least squares. SVD finds linear combinations of the
variables so
that the resulting "eigenvectors" are linearly independent
(orthogonal) of
each other. Then when linear regression is performed it is
algorithmically
very clear as to which eigenvectors are responsible for which
percentage of
the original variance. The regression parameters on the eigenvectors
are
then converted back into regression parameters on the original
variables,
and the output is then returned to the user.
Both procedures return regression estimates. The difference: SVD
regression estimates are much more tightly bound.
An excellent book on the topic, which discusses "scientific computing"
rather than "statistics" is
Press, et al. (1992). Numerical Recipes in C: The Art of Scientific
Computing, Second Edition. Cambridge University Press: Cambridge.
It has a number of algorithms in C that are of special interest to
data
miners. There are versions of the book for other languages, including
Fortran.
Best wishes,
Scott Cunningham, D.Phil.
NCR Corporation
Human Interface Technology Center
-----Original Message-----
From: Krishnadas [SMTP:ckkrish@cyberspace.org]
Sent: Thursday, April 09, 1998 9:55 AM
To: Datamining Mailinng List
Subject: DM: Why is Singular Vector Decomposition for
OLS?
Hello,
Since SVD is used widely for OLS I guess it has clear
advantages
over
plain Inverse(X'X) X'Y. Can anyone tell me about it? Any
good
books
or references on the motivation for SVD and application of
other
matrix
decomposition in statistics?
Thanks.
-- Krishnadas
-----------------------------------------------------------------
C. K. Krishnadas c k krish at cyberspace dot o
r g
ckkrish@cyberspace.org
http://www.cyberspace.org/~ckkrish
na.kck@na-net.ornl.gov
-----------------------------------------------------------------
|
MHonArc 2.2.0