Re: Missing covariates

From: Lewis B. Sheiner Date: July 02, 2001 technical Source: cognigencorp.com

From: LSheiner <lewis@c255.ucsf.edu> Subject: Re: Missing covariates Date: Mon, 02 Jul 2001 09:32:04 -0700 This has been discussed before ... (the discussion at http://www.cognigencorp.com/nonmem/nm/99sep112000.html is most relevant) It is generally NOT a good idea to simply impute (the word used in the statistical literature) the missing covariates as you have described. There are at least three problems. 1. You are making up data and treating them as though they were observed. Your standard errors, etc., will reflect this, and hence will be too "optimistic"; that is, you will think you have learned more from the data than you in fact have. 2. By substituting the mean or median, you weaken any covariate relationship that really exists. Imagine, for example, that you were analyzing dose (X) vs concentration at a 6 hr. post dose (Y), but were missing X in half the cases. The correlation between X and Y estimated from the imputed data if all unknown doses were fixed to the median would be about half that estimated from only the "complete" cases ... Which do you think is more likely to be right? 3. The missing data may be missing for reasons that correlate with the values themselves (e.g., data sets with BQLs deleted: the data are missing BECAUSE their values are low), so called "non-ignorable missingness." If so, then the median of the observed values is actually a biased estimate of the median of the missing ones. There are ways to deal with missing data in a principled way (see above referenced archive site), avoiding problems 1 and 2 above, but they can be a bit tricky. Problem 3 is very difficult to deal with, usually requiring additional data or assumptions. LBS. -- _/ _/ _/_/ _/_/_/ _/_/_/ Lewis B Sheiner, MD (lewis@c255.ucsf.edu) _/ _/ _/ _/_ _/_/ Professor: Lab. Med., Bioph. Sci., Med. _/ _/ _/ _/ _/ Box 0626, UCSF, SF, CA, 94143-0626 _/_/ _/_/ _/_/_/ _/ 415-476-1965 (v), 415-476-2796 (fax)

Jul 02, 2001	Atul Bhattaram Venkatesh	Missing covariates
Jul 02, 2001	Jogarao Gobburu	Re: Missing covariates
Jul 02, 2001	Kenneth G. Kowalski	RE: Missing covariates
Jul 02, 2001	Diane Mould	Re: Missing covariates
Jul 02, 2001	Hui C. Kimko	RE: Missing covariates
Jul 02, 2001	Lewis B. Sheiner	Re: Missing covariates
Jul 02, 2001	Leonid Gibiansky	RE: Missing covariates
Jul 02, 2001	Jogarao Gobburu	Re: Missing covariates
Jul 02, 2001	Lewis B. Sheiner	Re: Missing covariates
Jul 02, 2001	Kenneth G. Kowalski	RE: Missing covariates
Jul 02, 2001	Leonid Gibiansky	RE: Missing covariates
Jul 05, 2001	Smith Brian P	Re: Missing covariates

`j` / `k`	Next / previous message
`o`	Open message
`f`	Search
`s`	Copy link
`t`	Filters
`c`	Copy message body
`r`	Related threads
`?`	This help
`Esc`	Close / clear

Re: Missing covariates

Thread

Keyboard Shortcuts