RE: Condition number

From: Kenneth Kowalski Date: November 29, 2022 technical Source: mail-archive.com
Hi Ayyappa, I think the condition number was first proposed as a statistic to diagnose multicollinearity in multiple linear regression analyses based on an eigenvalue analysis of the X'X matrix. You can probably search the statistical literature and multiple linear regression textbooks to find various rules for the condition number as well as other statistics related to the eigenvalue analysis. For the CN<1000 rule I typically reference the following textbook: Montgomery and Peck (1982). Introduction to Linear Regression Analysis. Wiley, NY (pp. 301-302). The condition number is good at detecting model instability but it is not very good for identifying the source. Inspecting the correlation matrix for extreme pairwise correlations is better suited for identifying the source of the instability when it only involves a couple of parameters. It becomes more challenging to identify the source of the instability (multicollinearity) when the CN>1000 but none of the pairwise correlations are extreme |corr|>0.95. Although when CN>1000 often we will find several pairwise correlations that are moderately high |corr|>0.7 but it may be hard to uncover a pattern or source of the instability without trying alternative models that may eliminate one or more of the parameters associated with these moderate to high correlations. Best, Ken Kenneth G. Kowalski Kowalski PMetrics Consulting, LLC Email: [email protected] Cell: 248-207-5082
Quoted reply history
-----Original Message----- From: [email protected] [mailto:[email protected]] On Behalf Of Ayyappa Chaturvedula Sent: Tuesday, November 29, 2022 8:52 AM To: [email protected] Subject: [NMusers] Condition number Dear all, I am wondering if someone can provide references for the condition number thresholds we are seeing (<1000) etc. Also, the other way I have seen when I was in graduate school that condition number <10^n (n- number of parameters) is OK. Personally, I am depending on correlation matrix rather than condition number and have seen cases where condition number is large (according to 1000 rule but less than 10^n rule) but correlation matrix is fine. I want to provide these for my teaching purposes and any help is greatly appreciated. Regards, Ayyappa -- This email has been checked for viruses by Avast antivirus software. www.avast.com
Nov 29, 2022 Ayyappa Chaturvedula Condition number
Nov 29, 2022 Kenneth Kowalski RE: Condition number
Nov 29, 2022 Peter Bonate RE: Condition number
Nov 29, 2022 Jeroen Elassaiss-Schaap Re: Condition number
Nov 29, 2022 Kyun-Seop Bae Fwd: Condition number
Nov 30, 2022 Matt Fidler Re: Condition number
Nov 30, 2022 Kenneth Kowalski RE: Condition number
Nov 30, 2022 Leonid Gibiansky Re: Condition number
Nov 30, 2022 Peter Bonate Re: Condition number
Nov 30, 2022 Robert Bauer RE: Condition number
Nov 30, 2022 Bill Denney RE: Condition number
Dec 01, 2022 Kyun-Seop Bae Re: Condition number
Dec 01, 2022 Peter Bonate RE: Condition number
Dec 01, 2022 Kenneth Kowalski RE: Condition number
Dec 01, 2022 Ayyappa Chaturvedula Re: Condition number
Dec 01, 2022 Al Maloney Re: Condition number
Dec 01, 2022 Robert Bauer RE: [EXTERNAL] RE: Condition number
Dec 01, 2022 Robert Bauer Condition number
Dec 02, 2022 Robert Bauer Condition number