Missing value when modeling categorical data

5 messages 3 people Latest: Oct 09, 2009

Missing value when modeling categorical data

From: Wangx826 Date: October 07, 2009 technical
Dear NMUsers, I am modeling ordered-categorical PD data versus time, but since the clinical trial is ongoing, I don't currently have complete data set for each subject. In other words, for some subjects, I have 1 year's PD data, but for some others, I can only collect PD data for 1 month. For the 1 month's case, it is like missing data for the rest of time. But I need to consider time course of the proportion of subjects who got a certain PD score. In my case, if I use the general logistic regression model to fit the relationship between time and proportions of event, would there be any problem? If so, how can I avoid it? Or need I consider censoring like survival analysis? Any suggestion would be appreciated very much. Thanks in advance, Tianli ***************************************************************** Tianli Wang University of Minnesota

Missing value when modeling categorical data

From: Wangx826 Date: October 08, 2009 technical
Dear NMUsers, I am modeling ordered-categorical PD data versus time, but since the clinical trial is ongoing, I don't currently have complete data set for each subject. In other words, for some subjects, I have 1 year's PD data, but for some others, I can only collect PD data for 1 month. For the 1 month's case, it is like missing data for the rest of time. But I need to consider time course of the proportion of subjects who got a certain PD score. In my case, if I use the general logistic regression model to fit the relationship between time and proportions of event, would there be any problem? If so, how can I avoid it? Or need I consider censoring like survival analysis? Any suggestion would be appreciated very much. Thanks in advance, Tianli ***************************************************************** Tianli Wang University of Minnesota

Re: Missing value when modeling categorical data

From: Nick Holford Date: October 08, 2009 technical
Tianli, Your data sounds like it could be described by a survival analysis. A time to event model will give you the survivor function i.e. the prob of not having had the event as a function of time. The event in your case is defined by the 'certain PD score'. The censoring will of course be taken care of by a typical survival analysis. Nick [email protected] wrote: > Dear NMUsers, > > I am modeling ordered-categorical PD data versus time, but since the clinical trial is ongoing, I don't currently have complete data set for each subject. In other words, for some subjects, I have 1 year's PD data, but for some others, I can only collect PD data for 1 month. For the 1 month's case, it is like missing data for the rest of time. But I need to consider time course of the proportion of subjects who got a certain PD score. In my case, if I use the general logistic regression model to fit the relationship between time and proportions of event, would there be any problem? If so, how can I avoid it? Or need I consider censoring like survival analysis? > > Any suggestion would be appreciated very much. > > Thanks in advance, > Tianli > ***************************************************************** > Tianli Wang > University of Minnesota -- Nick Holford, Professor Clinical Pharmacology Dept Pharmacology & Clinical Pharmacology University of Auckland, 85 Park Rd, Private Bag 92019, Auckland, New Zealand [email protected] tel:+64(9)923-6730 fax:+64(9)373-7090 mobile: +64 21 46 23 53 http://www.fmhs.auckland.ac.nz/sms/pharmacology/holford
Tianli, Let me check whether I understood your data correctly: You have a trial where subjects enroll over time. You collected data up to some calendar date. Since subjects were enrolled over time, you have unequal amount of data for each subject. However, period of observation does not depend on the observed PD scores. If this is a correct description, then your data are missing completely at random (probability of an observation being missing does not depend on observed or unobserved measurements). In this case, you may analyze them ignoring missingness: just build your model with all the available data. ACoP conference that just has ended had a good section on missing data ( http://www.go-acop.org/acop2009/program ). You may look at the presentations there, or just on some literature. Thanks Leonid -------------------------------------- Leonid Gibiansky, Ph.D. President, QuantPharm LLC web: www.quantpharm.com e-mail: LGibiansky at quantpharm.com tel: (301) 767 5566 [email protected] wrote: > Dear NMUsers, > > I am modeling ordered-categorical PD data versus time, but since the clinical trial is ongoing, I don't currently have complete data set for each subject. In other words, for some subjects, I have 1 year's PD data, but for some others, I can only collect PD data for 1 month. For the 1 month's case, it is like missing data for the rest of time. But I need to consider time course of the proportion of subjects who got a certain PD score. In my case, if I use the general logistic regression model to fit the relationship between time and proportions of event, would there be any problem? If so, how can I avoid it? Or need I consider censoring like survival analysis? > > Any suggestion would be appreciated very much. > > Thanks in advance, > Tianli > ***************************************************************** > Tianli Wang > University of Minnesota

Re: Missing value when modeling categorical data

From: Nick Holford Date: October 09, 2009 technical
Tianli, Please look carefully at this: http://www.fmhs.auckland.ac.nz/sms/pharmacology/holford/teaching/pharmacometrics/_docs/modelling_likelihoods_using_NONMEM_VI.pdf It shows you how to do time to event models which also include PKPD variables which change as a function of dose and time. Nick [email protected] wrote: > Dear Nick, > > I think I didn't make it clear in my first email that my model is a PK-PD linked model, which means, the probability of a certain event is a function of drug exposure which can describe as a function of time and some other parameters like CL, V, etc. But it seems to me that the hazard function only takes time into account. I am not familiar with survival analysis so this might be a very basic question: How can I combine the PK/PD model with the time-to-event model? > > Thanks in advance, > Tianli >
Quoted reply history
> On Oct 8 2009, Nick Holford wrote: > > > Tianli, > > > > Your data sounds like it could be described by a survival analysis. A time to event model will give you the survivor function i.e. the prob of not having had the event as a function of time. The event in your case is defined by the 'certain PD score'. The censoring will of course be taken care of by a typical survival analysis. > > > > Nick > > > > [email protected] wrote: > > > > > Dear NMUsers, > > > > > > I am modeling ordered-categorical PD data versus time, but since the clinical trial is ongoing, I don't currently have complete data set for each subject. In other words, for some subjects, I have 1 year's PD data, but for some others, I can only collect PD data for 1 month. For the 1 month's case, it is like missing data for the rest of time. But I need to consider time course of the proportion of subjects who got a certain PD score. In my case, if I use the general logistic regression model to fit the relationship between time and proportions of event, would there be any problem? If so, how can I avoid it? Or need I consider censoring like survival analysis? > > > > > > Any suggestion would be appreciated very much. > > > > > > Thanks in advance, > > > Tianli > > > ***************************************************************** > > > Tianli Wang > > > University of Minnesota -- Nick Holford, Professor Clinical Pharmacology Dept Pharmacology & Clinical Pharmacology University of Auckland, 85 Park Rd, Private Bag 92019, Auckland, New Zealand [email protected] tel:+64(9)923-6730 fax:+64(9)373-7090 mobile: +64 21 46 23 53 http://www.fmhs.auckland.ac.nz/sms/pharmacology/holford