RE: Condition number
Hi Ayyappa,
I think the condition number was first proposed as a statistic to diagnose
multicollinearity in multiple linear regression analyses based on an
eigenvalue analysis of the X'X matrix. You can probably search the
statistical literature and multiple linear regression textbooks to find
various rules for the condition number as well as other statistics related
to the eigenvalue analysis. For the CN<1000 rule I typically reference the
following textbook:
Montgomery and Peck (1982). Introduction to Linear Regression Analysis.
Wiley, NY (pp. 301-302).
The condition number is good at detecting model instability but it is not
very good for identifying the source. Inspecting the correlation matrix for
extreme pairwise correlations is better suited for identifying the source of
the instability when it only involves a couple of parameters. It becomes
more challenging to identify the source of the instability
(multicollinearity) when the CN>1000 but none of the pairwise correlations
are extreme |corr|>0.95. Although when CN>1000 often we will find several
pairwise correlations that are moderately high |corr|>0.7 but it may be hard
to uncover a pattern or source of the instability without trying alternative
models that may eliminate one or more of the parameters associated with
these moderate to high correlations.
Best,
Ken
Kenneth G. Kowalski
Kowalski PMetrics Consulting, LLC
Email: [email protected]
Cell: 248-207-5082
Quoted reply history
-----Original Message-----
From: [email protected] [mailto:[email protected]] On
Behalf Of Ayyappa Chaturvedula
Sent: Tuesday, November 29, 2022 8:52 AM
To: [email protected]
Subject: [NMusers] Condition number
Dear all,
I am wondering if someone can provide references for the condition number
thresholds we are seeing (<1000) etc. Also, the other way I have seen when I
was in graduate school that condition number <10^n (n- number of parameters)
is OK. Personally, I am depending on correlation matrix rather than
condition number and have seen cases where condition number is large
(according to 1000 rule but less than 10^n rule) but correlation matrix is
fine.
I want to provide these for my teaching purposes and any help is greatly
appreciated.
Regards,
Ayyappa
--
This email has been checked for viruses by Avast antivirus software.
www.avast.com