WRES AND OUTLIER IDENTIFICATION/EXCLUSION

35 messages 14 people Latest: Oct 03, 2006

WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Niyi Adedokun Date: September 25, 2006 technical
From: "NIYI ADEDOKUN" niyiadedokun@hotmail.com Subject: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Monday, September 25, 2006 3:50 PM Dear All, In conducting population PK analysis I have seen situations when outliers are excluded from a dataset based on weighted residuals>5. Is this justified and are there useful references to back up this practice. Regards, Jo

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: William Bachman Date: September 25, 2006 technical
From: "Bachman, William (MYD)" bachmanw@iconus.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Mon, September 25, 2006 4:24 pm My particular preference is to remove the outliers (based on whatever criterion that you choose) and then, if warranted by the results without the outliers, be able to state that removing the outliers had NO significant influence on the results. Of course, if it does, I would start looking for WHY.

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Nick Holford Date: September 25, 2006 technical
From: Nick Holford n.holford@auckland.ac.nz Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Monday, September 25, 2006 4:56 PM Bill, If removing outliers didnt make any difference to the results then why would you want to remove them? It seems to me that one only removes outliers if it really makes a difference to the results. I agree that you then have to try to explain why the differences exist and why results without outliers are preferable to other results with outliers. This is a game I really try not to play unless outliers can clearly be shown by independent means to be data errors. Nick -- Nick Holford, Dept Pharmacology & Clinical Pharmacology University of Auckland, 85 Park Rd, Private Bag 92019, Auckland, New Zealand email:n.holford@auckland.ac.nz tel:+64(9)373-7599x86730 fax:373-7556 http://www.health.auckland.ac.nz/pharmacology/staff/nholford/

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Mark Sale Date: September 25, 2006 technical
From: Mark Sale - Next Level Solutions mark@nextlevelsolns.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Mon, 25 Sep 2006 14:38:15 -0700 I'd like to add to Bill's comment: 1. Look at a time vs dv/pred plot for that person, if you have an otherwise reasonable profile, but one strange point, I'd be more comfortable (not that CROs ever mislabel samples, or record times wrong, or mix up tubes) 2. If all samples from a person are strange, it isn't a statistical outlier. Either, that person is different, or something went wrong (wrong dose, sample put in red tube instead of green tube, samples sat on counter over the weekend, assay technician did the dilution wrong etc etc...) You can still delete them all if you want to, but you're sort of obligated to ask if this person is different and why. If we always through away data that isn't consistent with our models, we won't learn very much (think pre 2D6 genotyping studies for tricyclic antidepressants). We only learn when our models DO NOT explain the data. Sort of Learn..Confirm OR Fail to confirm..Learn, to paraphrase LBS. 3. It is very hard to justify deleting more than 3% of your data as statistical outliers. At some point you have to say that the site|lab|data management messed up. Or, this person is strange, for unknown reason, should be included in the estimate of OMEGA, and you should write a grant to study why this is happening. To answer your question about references, not that I'm aware of. But, outlier handleing should be specified in the analysis plan, then you're covered. Mark Mark Sale MD Next Level Solutions, LLC www.NextLevelSolns.com

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: William Bachman Date: September 26, 2006 technical
From: "Bill Bachman" bachmanw@comcast.net Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Mon, 25 Sep 2006 20:26:14 -0400 That's exactly the point! It's better not to remove data!

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Andrew Hooker Date: September 26, 2006 technical
From: "Andrew Hooker" andrew.hooker@farmbio.uu.se Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 10:53:17 +0200 Hi Jo, If you are using the FOCE method then I dont think that using the weighted residuals as a criterion to exclude outliers is justified. The WRES are based on the FO approximation and can be severely biased even when the data is simulated from the same model you estimate from. Using some other form of metric to identify outliers (such as DV/PRED as suggested by Mark) seems preferable in this case. -Andy Andrew Hooker, Ph.D. Assistant Professor of Pharmacometrics Div. of Pharmacokinetics and Drug Therapy Dept. of Pharmaceutical Biosciences Uppsala University Box 591 751 24 Uppsala Sweden Tel: +46 18 471 4355 www.farmbio.uu.se/research.php?avd=5

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Michael Fossler Date: September 26, 2006 technical
From: Michael.J.Fossler@GSK.COM Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 08:29:34 -0400 My personal preference is not to exclude any points based on outlier criteria. By doing so, you may be excluding important information. To take an extreme example, if you were modeling consecutive games played in the majors, would you exclude Cal Ripkin? He is clearly an outlier, and yet excluding him from the data-set would bias your model significantly. You would be trading model relevance for a better fit, which is not a good trade-off. Excluding data which are in error should be done, but those data are not outliers, they are errors. Apologies to my European colleagues for the baseball reference. Insert your favorite soccer example above (:^)) Mike ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Michael J. Fossler, Pharm. D., Ph. D., F.C.P. Director Clinical Pharmacokinetics, Modeling & Simulation GlaxoSmithKline (610) 270 - 4797 FAX: (610) 270-5598 Cell: (443) 350-1194 Michael_J_Fossler@gsk.com ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Leonid Gibiansky Date: September 26, 2006 technical
From: Leonid Gibiansky leonidg@metrumrg.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 09:16:31 -0400 It is known that methods implemented by NONMEM are not robust to the outliers (or data errors). Few bad observations can significantly damage convergence and result in the parameter estimate that reflect data errors rather than system description. Phase 3 data are known to be prone to data errors from non-compliance to incorrect dosing to sample mixed up, you name it. On the other hand, it is not feasible to check several hundred records (say 300-400 patients with 3-4 samples each = 1-2K records) manually/visually. In this situation it could be a valid strategy to exclude 20-30 (say, 1-2-3%) of records based on some diagnostic. WRES is one of the indicators of the mis-fit; large WRES are indicative of outliers/possible data errors. If you do not like WRES, you may try CWRES or any other diagnostic. Often you see the problem immediately: large cloud of reasonable WRES and several points far up or down the scale. Those few suspicious observations can be manually checked for problems: more often that not you may see that the observation is likely to be an error (much higher than the previous one without extra dose, or nearly zero followed by something reasonable, etc.). If so, decision to exclude can be backed by WRES as diagnostic + explanation based on the plots. Leonid

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Michael Fossler Date: September 26, 2006 technical
From: Michael.J.Fossler@gsk.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 09:47:39 -0400 Hi Leonid; I have to point out that you are mixing up "outliers" with bad data. The two are not the same. Errors in the data should be tracked down and corrected, but these are not outliers. Outliers are data points that do not fit your "priors" of how the data should look. I still maintain that excluding data based on the fact that they don't fit into your pre-conceived notions of how they should look is a bad idea and should not be done. I'm not sure I agree with you about the relative quality of Phase 3 data. Electronic point-of-care data capture is definitely minimizing these kinds of errors (If not, why are we doing it?). Also, thorough training of nursing staff will go a long way toward minimizing mistakes. I'd argue for more of that and less outlier exclusion; I'm not sure I'd want to go to the FDA and say, "well, you know Phase 3 data, always full of errors, this is the best we can do." I have an idea of what the response would be to that... Mike ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Michael J. Fossler, Pharm. D., Ph. D., F.C.P. Director Clinical Pharmacokinetics, Modeling & Simulation GlaxoSmithKline (610) 270 - 4797 FAX: (610) 270-5598 Cell: (443) 350-1194 Michael_J_Fossler@gsk.com ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Mats Karlsson Date: September 26, 2006 technical
From: "Mats Karlsson" mats.karlsson@farmbio.uu.se Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 15:53:13 +0200 Hi, I believe that sensitivity to outliers/errors, when present, may well come from the assumption that residual error magnitude is the same across all subjects. This assumption is usually proven wrong when challenged and a model that allows variability in sigma between subjects is preferable. This can easily be accomplished with FOCEI and a coding of the type Y=F+EPS(1)*EXP(ETA(.)). Best regards, Mats Mats Karlsson, PhD Professor of Pharmacometrics Div. of Pharmacokinetics and Drug Therapy Dept. of Pharmaceutical Biosciences Faculty of Pharmacy Uppsala University Box 591 SE-751 24 Uppsala Sweden phone +46 18 471 4105 fax +46 18 471 4003 mats.karlsson@farmbio.uu.se

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Niyi Adedokun Date: September 26, 2006 technical
From: "NIYI ADEDOKUN" niyiadedokun@hotmail.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 14:19:20 +0000 Many thanks for responses to this posting. Clearly "true" outliers may give important information in model building but suspected "errors" in data can be very detrimental to convergence as well as a succesful covariance step. I have noticed this phenomenon when using FOCEI. Runs with FO or FOCE would converge with the data 'errors' while with FOCEI convergence is only possible when the data errors are excluded. I suspect that sensitivity to data errors is estimation method dependent. More bothersome is the fact that when convergence does occur with FOCEI, parameters could differ by as much as 25%. Since I have more confidence using FOCEI one is left to make a decison on what to do with the data "errors". As for comparing models with and without the 'errors', how can one compare parameters obtained from a converging and a non converging NONMEM run? In addition if one were to use CWRES as a diagnostic to exclude data "errors" what would be the acceptable cut off point? Regards Jo

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Alan Xiao Date: September 26, 2006 technical
From: "Xiao, Alan" alan_xiao@merck.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 10:37:08 -0400 I think the key is when to exclude the "outliers". If they are excluded at the beginning, you might lose some important information about your drug (and consequently important findings) as Mike mentioned below. If they are excluded after you got a final model and you want to refine the final model with the exclusion, you then lose much less information - you can not really well characterize the lost part anyway since your model can not capture them either because they are simply data errors or non-representativeness for a new population or other reasons you can imagine, although you still need an interpretation about the "outliers". In this case, your refined final model might be more reliable to data which are representative, but not covering the outliers. Appropriate documentation and interpretation (and implication) are very important. Alan

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Leonid Gibiansky Date: September 26, 2006 technical
From: Leonid Gibiansky leonidg@metrumrg.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 11:24:33 -0400 Hi Mike, You never know whether high WRES points are outliers or bad data points: these are not labeled on the tube "outliers". I am not talking about full profile - exclusion; the discussion is whether to exclude some points on the profile that for some unknown reasons cannot be described by the model, and mainly, on how to identify those points without looking on each and every PK profile of hundreds of patients. I tried the idea of using individual sigma values. Results of the quick experiments on the recent data set is below (number of excluded data points was about 2%): High WRES excluded, same sigma THETAs OM SIGMA Value 4.05E+00 2.90E-01 1.11E-02 1.29E-02 8.06E-01 4.82E-02 9.33E-02 SE 1.20E-01 1.21E-02 1.58E-03 4.74E-03 3.41E-02 6.61E-03 8.24E-03 High WRES included, same sigma THETAs OM SIGMA Value 4.37E+00 3.08E-01 1.08E-02 1.75E-02 7.68E-01 4.72E-02 2.04E-01 SE 2.12E-01 1.09E-02 1.11E-03 5.47E-03 3.65E-02 9.27E-03 4.84E-02 High WRES excluded, individual sigma THETAs OM SIGMA OM on SIGMA Value 4.04E+00 2.90E-01 1.11E-02 1.30E-02 8.06E-01 4.82E-02 9.20E-02 2.42E-03 SE 1.18E-01 1.22E-02 1.60E-03 4.82E-03 3.43E-02 6.61E-03 9.14E-03 1.18E-02 High WRES included, individual sigma THETAs OM SIGMA OM on SIGMA Value 4.04E+00 2.93E-01 1.05E-02 1.50E-02 8.21E-01 4.65E-02 8.27E-02 2.00E-01 SE 1.19E-01 1.20E-02 1.38E-03 6.23E-03 3.66E-02 7.49E-03 1.06E-02 7.41E-02 You may see that individualizing residual error is more or less equivalent to exclusion of high WRES: weight of those points is significantly reduced by assigning high residual error to patients with those points. It is more automatic, and do not require data exclusion, great idea, Mats, thanks. Leonid

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Michael Fossler Date: September 26, 2006 technical
From: Michael.J.Fossler@gsk.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 11:37:30 -0400 I routinely use the varying sigma residual model and usually see a great improvement in the fit. As Mats points out, it allows one to side-step the issue of data exclusion. Mike ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Michael J. Fossler, Pharm. D., Ph. D., F.C.P. Director Clinical Pharmacokinetics, Modeling & Simulation GlaxoSmithKline (610) 270 - 4797 FAX: (610) 270-5598 Cell: (443) 350-1194 Michael_J_Fossler@gsk.com ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Chuanpu Hu Date: September 26, 2006 technical
From: Chuanpu.Hu@sanofi-aventis.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 17:08:06 -0400 I think Mike pointed out an important distinction. As Mats suggested, most of the time outliers are model-dependent; i.e., outliers occurred because the appropriate model wasn't or couldn't be fitted. If the quality of the apparent outliers are questionable, this becomes a robustness/sensitivity issue. This can only be assessed by analyzing data in two ways, including and excluding the outliers. In any case, I see no rationale for simply excluding. Of course, excluding outliers makes the result look better, which can be very appealing... Chuanpu

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Mats Karlsson Date: September 26, 2006 technical
From: "Mats Karlsson" mats.karlsson@farmbio.uu.se Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 26 Sep 2006 23:44:00 +0200 Hi Chuanpu, You wrote "As Mats suggested, most of the time outliers are model-dependent; i.e., outliers occurred because the appropriate model wasn't or couldn't be fitted." Actually, I don't think I said this. My general impression, which doesn't particularly influence how I handle these cases, is that once we've done a good job on the modeling, the remaining outliers are most likely errors in data, which would occur under any reasonable model. Further, you wrote: "This can only be assessed by analyzing data in two ways, including and excluding the outliers." Although I think that such contrasting analyses can be of great help to understand the impact of anomalous data, my point was the opposite, you don't necessarily have to do these contrasting analyses if you use a model that is more robust to outliers/errors. Best regards, Mats Mats Karlsson, PhD Professor of Pharmacometrics Div. of Pharmacokinetics and Drug Therapy Dept. of Pharmaceutical Biosciences Faculty of Pharmacy Uppsala University Box 591 SE-751 24 Uppsala Sweden phone +46 18 471 4105 fax +46 18 471 4003 mats.karlsson@farmbio.uu.se

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Sima Sadray Date: September 27, 2006 technical
From: Dr Sima Sadray Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Wed, 27 Sep 2006 Dear All, I think the article below will help. Likelihood-based diagnostics for Influential Individuals in nonlinear mixed effects model selection. S Sadray,E.N. Jonsson and M O Karlsson, Pharmaceutical Research, Vol 16, No.8,1999 Sima Dr Sima Sadray, PharmD, PhD Division of Pharmacokinetics and Biopharmaceutics Department of Pharmaceutics, Faculty of Pharmacy Tehran University of Medical Sciences P.O.Box 14155 /6451 Tehran - IRAN Telfax: +98 21 66959054 Mobile: 0912 2022793 Fax: +98 21 66461178 E-mail: sadrai@sina.tums.ac.ir

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Partha Nandy Date: September 27, 2006 technical
From: "Nandy, Partha [PRDUS]" PNandy@prdus.jnj.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Wed, 27 Sep 2006 07:15:30 -0400 Hi All, Thanks for discussing this outlier issue. It is always very difficult to decide whether to keep the outlying data points in or remove those. I have a question though; For Additive error models I think what Mats suggested works great. What is your suggestion for a ADD+PROP Error models? Should one use OMEGAs on both ADD and PROP Errors? Also, bear in mind that if one is using AIC or any other such criteria to select models, one needs to now account for additional parameters.. I am interested in your opinion... Kind Regards, Partha

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Chuanpu Hu Date: September 27, 2006 technical
From: Chuanpu.Hu@sanofi-aventis.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Wed, 27 Sep 2006 09:25:53 -0400 Hi Mats, Sorry for not interpreting you correctly - thank you for clearifying. I agree with your second point, and I generally don't go about deleting outliers. My point was that if one does delete outliers, then the impact should be assessed. In particular, I have seen "assessments" been reported with changes in -2LL, which I think is misleading. Regarding your first point, I think sometimes in practice a "good job" is not done, for various reasons. For example, we know that dosing or sample collection times are inexact in phase III trials. Modeling can be attempted fot this, but not easilly. Could you comment on searching and dealing with outliers in this situation. Best regards, Chuanpu

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Kenneth Kowalski Date: September 27, 2006 technical
From: "Kowalski, Ken" Ken.Kowalski@pfizer.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Wed, 27 Sep 2006 15:27:55 -0400 Hi Mats, Nmusers, Here are my two cents on this discussion. 1) For individual data-point outliers wouldn't the 'ETA on Epsilon' residual error model you propose effectively down-weight all of the observations within an individual and not just the suspected outlier data point? I certainly see value in the 'ETA on Epsilon' residual error model when the magnitude of the residual variation does not appear to be the same across all subjects. However, in using this model I would want to assess whether the apparent change in magnitude of the residual variation across subjects is being unduly influenced by a single observation within the subject's data. If it is, I don't think I would use this approach. Of course, it may be a challenge to discrimate statistical outliers vs. misspecification of the residual error model (e.g., non-homogenous variation across subjects) vs. lack of fit of the structural model. Note that a change in residual error model to accommodate outliers rather than excluding outliers is making an implicit set of assumptions so I don't think we can 'side-step' the issue of outlier assessment...we are just trading one set of assumptions for another. 2) Matt Hutmacher and I have been toying with the following idea to address individual data outliers. First, based on a prespecified set of criteria, identify suspected individual data outliers. Second, create a flag variable on the data set to identify these data outliers (i.e., FLAG=1 denotes outlier, FLAG=0 denotes non-outlier). Third, fit a residual error model with different sigmas for outliers and non-outliers. The following code for a constant CV error model might be considered: Y=F*(1+(1-FLAG)*EPS(1)+FLAG*EPS(2)) (If the outliers appear to be independent of F then one might postulate EPS(2) as an additive effect.) With this model sigma2 would be larger than sigma1 effectively down-weighting the suspected outliers without having to formally exclude them (i.e., giving zero weight to them). The degree of down-weighting can be determined from the ratio of the estimates of sigma2 to sigma1 and would increase as the magnitude of outlier deviations increases. One could compare the parameter estimates (thetas and omegas) from this model to that of the usual CV error model, Y=F*(1+EPS(1)), to determine how much leverage these outliers collectively have on the estimation. Any thoughts on this approach? We don't have any direct experience in applying this approach so if anyone would like to try it and report back their experiences we would certainly be interested in hearing about it. 3) For detecting individual data-point outliers (as opposed to outlying subjects) wouldn't the IWRES be a better diagnostic than WRES or CWRES? It would seem to better fit with the sentiment that when assessing individual data point outliers, they should be evaluated in context with the other observations for that individual, presumably with respect to their deviations from the IPRED. 4) Outlier assessment is a very contextual thing. It is nearly impossible to be completely objective in this assessment but at the same time we should be systematic and use sound reasoning in evaluating outliers and the actions we take. While we need to be cautious when considering the impact of exclusion or down-weighting individual outliers we also shouldn't take the position that we should never exclude them. These outliers can unduly inflate the variance components and mask our ability to detect important determinants (covariate effects) of the PK and PD responses. We need to rigorously evaluate the adequacy of our models with various diagnostic plots and rule out (whenever possible) various forms of model (structural and statistical) misspecification before proposing to exclude outliers. The totality of our diagnostics should help inform our decision on the models we postulate and any actions (including no action) we take regarding outliers. Ken

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Mats Karlsson Date: September 27, 2006 technical
From: "Mats Karlsson" mats.karlsson@farmbio.uu.se Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Wed, 27 Sep 2006 23:54:18 +0200 Ken, Regarding your comments: 1) I agree. According to the model I suggested, a single outlying data point would mean that the entire information content of that individual would be considered less than without that outlier. Of course this model makes assumptions too, even if it relaxes the assumption of everyone having the same residual variability. It still makes the assumption that residual error distribution is a (transformation of) normal distribution. 2) Maybe the idea has merits. Trying it and showing that the extra subjectivity and effort does pay off in terms of increased parameter precision is however something that I think needs to be shown. Also, with your approach I would think that even when you do identify the outliers correctly, the assumption of a normally distributed random error for the outliers is usually not appropriate. In my experience that is not what outliers/errors look like. E.g. often some observations are far too high (sampling in the wrong arm) or far too low (didn't take the dose), but rarely do the two equate to form a nice normal. Further, I'm not sure what you mean by "prespecified criteria". This could be tricky as outliers are usually not easy to foresee. Your suggestion seems to imply that these are identified before you fit a model to the data and then it is even harder to predict which are outliers. Last, it is not uncommon that one can see that one out of two data points are an outlier, but difficult to determine which of the two it is. 3) I tend to agree, but IWRES is not a panacea either. If data are sparse (compared to the number of parameters and especially etas), IWRES can be quite misleading due to overfit. 4) Your usual good advice that I would not want to disagree with. In relation to this, one of my former co-workers (Dr Sima Sadrai) reminded me in a mail that I think was intended for nmusers (copied below), that there may be some further help in inspecting the individual contribution to the likelihood. The idea is to investigate whether some individuals are driving or masking any model selection. The main idea was not in relation to errors/outlying data points, but maybe it has some merits there too. Best regards, Mats Mats Karlsson, PhD Professor of Pharmacometrics Div. of Pharmacokinetics and Drug Therapy Dept. of Pharmaceutical Biosciences Faculty of Pharmacy Uppsala University Box 591 SE-751 24 Uppsala Sweden phone +46 18 471 4105 fax +46 18 471 4003 mats.karlsson@farmbio.uu.se
From: "Elassaiss - Schaap, J. (Jeroen)" jeroen.elassaiss@organon.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Thu, 28 Sep 2006 08:18:30 +0200 Hi Ken, To my opinion your idea of flagging with an extra epsilon is a first step towards complete iterative weighting as demonstrated by Jan Freijer, see http://www.page-meeting.org/page/page2005/PAGE2005P76.pdf (I guess he won't chime in himself). His implementation has two pros: i) it is unsupervised and ii) it is gradual. He specified the example of errors in dosing and sampling time, but the approach seems general and therefore also applicable to other causes of outliers. I have used a similar approach during my PhD but in another field with smoothing rather than fitting, i.e. adaptive smoothing. It worked really well in removal of (electronic) artifacts observed in noisy, densily sampled time series and was simple to implement (iteration on a linear regression). Best regards, Jeroen J. Elassaiss-Schaap Scientist PK/PD Organon NV PO Box 20, 5340 BH Oss, Netherlands Phone: + 31 412 66 9320 Fax: + 31 412 66 2506 e-mail: jeroen.elassaiss@organon.com

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Kenneth Kowalski Date: September 28, 2006 technical
From: "Kowalski, Ken" Ken.Kowalski@pfizer.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Thu, 28 Sep 2006 17:08:59 -0400 Mats, We appear to be in good agreement on all points. Thank you for your kind words regarding (4) and info on Dr. Sadray's (et al.) paper...I will certainly take a look at it. I just have a follow up with regards to your responses to (2). 2) Certainly work needs to be done to evaluate whether this approach indeed has merit. I agree there is no reason necessarily to believe outliers are normal, however, we most likely will lack suitable power to assess the distribution of these outlying data. There is precedence to consider a mixture model of normal distributions, referred to in the statistical literature as a contaminated normal distribution where Y is distributed as (1-p)N(mu,sigma) + (p)N(mu,k(sigma)) where p represents the fraction of outliers and k is the scale parameter for the increased variation in the outliers (see Barnett and Lewis, Outliers in Statistical Data, Wiley, 1978, pp 31-33, 127-130). We propose a two-stage approach to this contanimated normal distribution by first estimating p by use of a prespecified outlier criteria and fixing this through the use of the FLAG variable. In the second stage we estimate k which is the ratio of sigma2 to sigma1. The outlier criteria, which would ideally be specified in the analysis plan before starting the model development, might be something like "flag all data points as potential outliers for further evaluation where abs(IWRES)>5" (perhaps a reasonable criteria with dense data). Of course, we could look at a full likelihood mixture model approach were p and k are simultaneously estimated. There are other contaminated normal mixture models that allow for asymmetry (a shift in mu as well as a scale increase in sigma) and of course mixtures of different distributions between non-outliers and outliers. Whether we have enough power to discern between various contaminated distributions and how well they may perform in the context of PK/PD is certainly an area that could benefit from some research. Kind regards, Ken

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Mats Karlsson Date: September 29, 2006 technical
From: "Mats Karlsson" mats.karlsson@farmbio.uu.se Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Fri, 29 Sep 2006 16:45:56 +0200 Hi Ken, Good luck with the evaluations. When you write: " Of course, we could look at a full likelihood mixture model approach were p and k are simultaneously estimated.", do you know a software that could do that? NONMEM would not be able to handle it. Best regards, Mats Mats Karlsson, PhD Professor of Pharmacometrics Div. of Pharmacokinetics and Drug Therapy Dept. of Pharmaceutical Biosciences Faculty of Pharmacy Uppsala University Box 591 SE-751 24 Uppsala Sweden phone +46 18 471 4105 fax +46 18 471 4003 mats.karlsson@farmbio.uu.se

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Kenneth Kowalski Date: September 29, 2006 technical
From: "Kowalski, Ken" Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Fri, 29 Sep 2006 12:17:45 -0400 Mats, It can be done in NONMEM...credit goes to Matt Hutmacher for figuring this out. You need to use the LIKELIHOOD or -2LL option. Here is an example of a $PRED code segment that Matt Hutmacher prepared and tested: $PRED MU=THETA(1)+ETA(1) MP=THETA(2) SIG1=THETA(3) SIG2=THETA(4) IW1=(DV-MU)/SIG1 IW2=(DV-MU)/SIG2 L1=-0.5*LOG(2*3.14159265)-LOG(SIG1)-0.5*(IW1**2) L2=-0.5*LOG(2*3.14159265)-LOG(SIG2)-0.5*(IW2**2) L=(1-MP)*EXP(L1)+MP*EXP(L2) Y=-2*LOG(L) Note that MU can be replaced with a more complex PK/PD model, MP is the mixing probability, SIG1 is the sigma for non-outliers, and SIG2 is the sigma for outliers. Regards, Ken

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Mats Karlsson Date: September 30, 2006 technical
From: "Mats Karlsson" mats.karlsson@farmbio.uu.se Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Sat, 30 Sep 2006 17:38:50 +0200 Ken, Nice. I guess it does not come without a price as the same mixture model is applied to subjects with and without outliers alike. If one were to take this estimation route, estimating a (interindividual) mixture model for the mixing component would be a way to address this: $MIX P(1)=THETA(5) ;Proportion of subjects with outliers P(2)=1-P(1) ;Proportion of subjects without outliers $PRED MU=THETA(1)+ETA(1) MP=THETA(2) ;MP for subjects with outliers IF(MIXNUM.EQ.2) MP=0 ;MP for subjects without outliers SIG1=THETA(3) SIG2=THETA(4) IW1=(DV-MU)/SIG1 IW2=(DV-MU)/SIG2 L1=-0.5*LOG(2*3.14159265)-LOG(SIG1)-0.5*(IW1**2) L2=-0.5*LOG(2*3.14159265)-LOG(SIG2)-0.5*(IW2**2) L=(1-MP)*EXP(L1)+MP*EXP(L2) Y=-2*LOG(L) If you have really contaminated data maybe you in addition want to add a (logit-transformed) ETA on MP for subjects with outliers. Best regards, Mats Mats Karlsson, PhD Professor of Pharmacometrics Div. of Pharmacokinetics and Drug Therapy Dept. of Pharmaceutical Biosciences Faculty of Pharmacy Uppsala University Box 591 SE-751 24 Uppsala Sweden phone +46 18 471 4105 fax +46 18 471 4003 mats.karlsson@farmbio.uu.se

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Nick Holford Date: October 01, 2006 technical
From: Nick Holford n.holford@auckland.ac.nz Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Mon, 02 Oct 2006 12:59:33 +1300 Mats, Ken, I must be missing some subtle issue here -- why do you think it is necessary to code this using -2LL? Why not code it like this? $THETA (0,0.1,1) ; P prob of being an outlier (0,1,) ; SD of additive residual error (0,1,) ; K fractional difference in SD in outlier population $OMEGA 0.5 ; between subject variability in MYPRED $SIGMA 1 FIX ; unit random effect $MIX NSPOP=2 P(1)=THETA(1) ; P prob of being an outlier P(2)=1-P(1) $PRED ;MYPRED= ...any PKPD model you like with random effects e.g. MYPRED=DOSE/V*EXP(-CL/V*TIME)+ETA(1) ;SD= ... any residual error model you want expressed with THETAs e.g SD=THETA(2) ; additive residual error SD IF (MIXNUM.EQ.1) THEN ; outlier KOUT=THETA(3) ELSE KOUT=1 ENDIF Y=MYPRED+KOUT*SD*EPS(1) Nick -- Nick Holford, Dept Pharmacology & Clinical Pharmacology University of Auckland, 85 Park Rd, Private Bag 92019, Auckland, New Zealand email:n.holford@auckland.ac.nz tel:+64(9)373-7599x86730 fax:373-7556 http://www.health.auckland.ac.nz/pharmacology/staff/nholford/

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Mats Karlsson Date: October 02, 2006 technical
From: "Mats Karlsson" mats.karlsson@farmbio.uu.se Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Mon, 2 Oct 2006 08:25:22 +0200 Hi Nick, The intention is to have the mixture on the individual *observation* level, not on (or not only on) the individual subject level. Best regards, Mats Mats Karlsson, PhD Professor of Pharmacometrics Div. of Pharmacokinetics and Drug Therapy Dept. of Pharmaceutical Biosciences Faculty of Pharmacy Uppsala University Box 591 SE-751 24 Uppsala Sweden phone +46 18 471 4105 fax +46 18 471 4003 mats.karlsson@farmbio.uu.se

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Nick Holford Date: October 02, 2006 technical
From: Nick Holford n.holford@auckland.ac.nz Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Mon, 02 Oct 2006 22:55:36 +1300 Mats, So you are using $MIX to switch between outliers and non-outlier type subjects? The mixing probability of an outlier is P (Ken's nomenclature). If its an outlier type subject then the -2LL code returns a fractional residual error contribution for an outlier (SIG2) or non-outlier observation (SIG1) depending on an observation level mixing fraction (MP). If its a non-outlier type subject then the residual error is determined by SIG1 alone (MP=0). SIG2/SIG1 is K (Ken's nomenclature). >>> > L1=-0.5*LOG(2*3.14159265)-LOG(SIG1)-0.5*(IW1**2) >>> > L2=-0.5*LOG(2*3.14159265)-LOG(SIG2)-0.5*(IW2**2) >>> > L=(1-MP)*EXP(L1)+MP*EXP(L2) >>> > Y=-2*LOG(L) PS The -0.5*LOG(2*3.14159265) is not really needed (because its a constant that doesn't change the minimum of -2LL) and indeed is inconsistent with the way that NONMEM typically calculates its objective function. -- Nick Holford, Dept Pharmacology & Clinical Pharmacology University of Auckland, 85 Park Rd, Private Bag 92019, Auckland, New Zealand email:n.holford@auckland.ac.nz tel:+64(9)373-7599x86730 fax:373-7556 http://www.health.auckland.ac.nz/pharmacology/staff/nholford/

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Mats Karlsson Date: October 02, 2006 technical
From: Mats Karlsson [mailto:mats.karlsson@farmbio.uu.se] Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Monday, October 02, 2006 5:08 PM Hi Ken, Allowing varying residual variability between subjects provide you with an estimate of which individuals' data for which the model doesn't fit, whether it is through error, outlier or model misspecification. If you use a prespecified criteria, I don't think this will change - your criteria (if it is any good) will still identify observations for which the model doesn't fit. These will represent errors, outliers or data where model misspecification is pronounced. I'm not sure why you think you could, through the somewhat ad hoc procedure of e.g. IWRES>x strike a better balance and only get errors and not model misspecification. Regarding classification of data points as outliers or not, I guess classification is always destroying information. Why would you want to classify? However, I guess that you could get the information to classified by "just" evaluating the OFV for the 2**Ni possibilities, where Ni is the number of observations for subject i. On the practical side, a drawback of the individual observation mixture model is of course that it requires the LAPLACIAN method (unless Matt has pulled off another stunt!). Thanks for the entertaining discussion, but tomorrow morning I'm off for vacation so don't expect any quick reply on further comments. Best regards, Mats Mats Karlsson, PhD Professor of Pharmacometrics Div. of Pharmacokinetics and Drug Therapy Dept. of Pharmaceutical Biosciences Faculty of Pharmacy Uppsala University Box 591 SE-751 24 Uppsala Sweden phone +46 18 471 4105 fax +46 18 471 4003 mats.karlsson@farmbio.uu.se

Re: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Nick Holford Date: October 02, 2006 technical
From: Nick Holford n.holford@auckland.ac.nz Subject: Re: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Monday, October 02, 2006 5:59 PM Ken, Thanks for confirming the distinction between outlier subjects (i.e. large ETA) and subjects with outliers (i.e. large EPS). While attempting to contribute to this thread I have realized the importance of distinguishing between 'outlier subjects' and 'subjects with outliers'. The $MIX code I posted earlier attempts to separate subjects with outliers from those without outliers based on the size of EPS (see again below). This is in the spirit of the discussion to identify subjects who do not fit well and therefore may be subjects with outliers regardless of any ETA adjustment. The fractional likelihood method you and Matt H. propose estimates some average fraction of outlier observations. I am still not very comfortable about the fractional likelihood method because it assumes a similar fraction of outlier and non-outlier observations in each subject (which might be addressed by having an ETA on this fraction) but more importantly (for this thread) it doesn't distinguish in a discrete way between outlier and non-outlier observations. I think we need to have something like $MIX that works at the observation level (instead of only at the subject level) because it produces a discrete classification. You refer to a "two stage approach" in your earlier emails but I am afraid I didnt really follow all the details. Here is a specific proposal using NONMEM: Step 1: Use the simple $MIX method (See Step 1 code below) to identify subjects with outliers from subjects without outliers. Save posthoc estimates of the ETA specific parameters for the next step. This step estimates 'P' (your nomenclature) i.e. the proportion of subjects with outliers and also discretely identifies (using MIXEST) which subject belongs to each group (ME1). Step 2: Code the data with ID changing for each observation and use the individual posthoc parameters (FIXED) to make subject specific predictions. Set up a $MIX model to identify the overall proportion of observation level outliers in the total set of observations (See Step 2 code below). You can estimate different residual error model parameters for outlier and non-outlier observations ('K') during this second step but the main outcome would to use MIXEST at the observation level to distinguish outlier observations (ME2). The second step can be used to to test the hypothesis that there are indeed two populations of observations i.e. outliers and non-outliers by fitting with and without $MIX and 'K'. Overall this 2 step procedure is not as elegant as doing it in one step but it could maybe achieve the goal of this thread i.e. to identify outlier observations with a somewhat objective and automatable procedure. The main problem I think is in the posthoc estimates of the parameters in Step 1 which will be somewhat contaminated with the misspecification of the outlier vs nonoutlier residual error. Perhaps it is here that the fractional likelihood method might help. Best wishes, Nick Caution: The following code has not been tested. It is intended primarily to demonstrate the ideas being discused. It might easily contain errors! ************* STEP 1 ****************** $THETA (0,0.1,1) ; P prob of being subject with outlier 1 (0,3,); Pop clearance 2 (0,10,); Pop volume 3 (0,1,) ; SD of additive residual error 4 (0,1,) ; K fractional difference in SD in subjects with outliers 5 $OMEGA 0.5 ; between subject variability in CL 0.5 ; between subject variability in V $SIGMA 1 FIX ; unit random effect $MIX NSPOP=2 P(1)=THETA(1) ; P prob of being subject with outlier P(2)=1-P(1) $PRED IF (NEWIND.EQ.0) OBID=0 ; counter for observations CL=THETA(2)*EXP(ETA(1)) V=THETA(3)*EXP(ETA(2)) MYPRED=DOSE/V*EXP(-CL/V*TIME) ;SD= ... any residual error model you want expressed with THETAs e.g SD=THETA(4) ; additive residual error SD IF (MIXNUM.EQ.1) THEN ; subject with outliers KOUT=THETA(3) ELSE KOUT=1 ENDIF Y=MYPRED+KOUT*SD*EPS(1) IF (MDV.EQ.0) OBID=OBID+1 ; create observation level ID ME1=MIXEST $TABLE ID ME1 OBID TIME DOSE CL V DV MDV NOAPPEND NOPRINT ONEHEADER FILE=step1.fit ************* STEP 2 ****************** $DATA step1.fit IGNORE=@ ;OBID is an observation level ID type of data item ;ICL and IV are the posthoc individual estimates of CL and V from Step 1 $INPUT SID ME1 ID=OBID TIME DOSE ICL IV DV MDV $THETA (0,0.1,1) ; proportion of obs which are outliers 1 (0,1,) ; SD of additive residual error 2 (0,1,) ; K fractional difference in SD in outlier population 3 $OMEGA 0 FIX ; keep NM-TRAN happy (no OMEGAs needed in this step $SIGMA 1 FIX ; unit random effect $MIX NSPOP=2 P(1)=THETA(1) ; proportion of obs which are outliers P(2)=1-P(1) $PRED MYPRED=DOSE/IV*EXP(-ICL/IV*TIME) SD=THETA(2) ; additive residual error SD IF (MIXNUM.EQ.1) THEN ; outlier observation KOUT=THETA(3) ELSE KOUT=1 ENDIF Y=MYPRED+KOUT*SD*EPS(1) ME2=MIXEST $TABLE SID ID ME1 ME2 -- Nick Holford, Dept Pharmacology & Clinical Pharmacology University of Auckland, 85 Park Rd, Private Bag 92019, Auckland, New Zealand email:n.holford@auckland.ac.nz tel:+64(9)373-7599x86730 fax:373-7556 http://www.health.auckland.ac.nz/pharmacology/staff/nholford/

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Kenneth Kowalski Date: October 02, 2006 technical
From: "Kowalski, Ken" Ken.Kowalski@pfizer.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Mon, 2 Oct 2006 16:21:25 -0400 Mats, Nice suggestion. By incorporating an interindividual mixture model for subjects with at least one outlier we can output MIXEST for each individual as a diagnostic. This would allow us to identify which individuals have at least one outlier observation (and which individuals are "clean" of any outliers). Still, within an individual in the outlier population we wouldn't know which observations are being classified as outliers vs non-outliers without further post-processing of the individual observation contributions to the likelihood (don't ask me how to do this :) ). While this is an interesting approach (our original proposal with or without your interindividual mixture model suggestion) it is not clear to me how well this approach will work in practice. I suspect that any misspecification of the structural as well as statistical models may result in the estimation of subpopulations that may not exactly coincide with outlier vs non-outlier populations of observations. I would probably start with the two-stage approach and go to a full likelihood approach if one shows that the subpopulations coincide with the outlier vs non-outlier dichotomy based on a pre-specified criteria for outliers. If the outliers tend to have more positive rather than negative residuals or if they are not randomly distributed about the curve, this might be an indication of model misspecification which may have to be resolved before entertaining this full likelihood outlier approach and what form of contaminated residual error model to employ. Best regards, Ken

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Kenneth Kowalski Date: October 02, 2006 technical
From: "Kowalski, Ken" Ken.Kowalski@pfizer.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Mon, 2 Oct 2006 16:31:43 -0400 Nick, Sorry for the confusion. Mats' suggestion to identify subjects with at least one outlier observation (data point) via the $MIX code combined with the residual error mixture model should not be confused with the distinct but separate issue of "outlier subjects". Note that an "outlier subject" whose individual curve deviates substantially from the typical individual curve may not have any individual data point outliers. That is, conditional on the subject's ETA values, the deviations from IPRED may not be extreme (i.e., no extreme IWRES values). We are not dealing with outlier subjects whose curves may represent extreme deviations from PRED (e.g., extreme ETA values) but whose individual observations are reasonably consistent within the individual. Ken

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Kenneth Kowalski Date: October 03, 2006 technical
From: "Kowalski, Ken" Ken.Kowalski@pfizer.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 3 Oct 2006 17:31:45 -0400 Mats, Outlier detection requires some criteria for explicit assessment. In practice we often look at residual plots and make a judgement as to whether to classify an observation as an outlier or non-outlier. It is important to make the distinction that this classification is done only for diagnostic purposes. No matter how we choose to deal with outliers, we should be explicit, transparent, and systematic about our procedures for handling data outliers. Moreover, as we stated previously, we need to first rule out to the best of our ability any model misspecification before we take any actions with regards to outliers we detect (e.g., excluding outlier observations or fitting alternative models and down-weighting). We could use a similar approach to classify data outliers based on this full likelihood (residual error mixture model) to what NONMEM does with the MIXEST calculation to classify which subpopulation each subject belongs to. It is purely a post hoc calculation for diagnostic purposes; knowing which subpopulation each observation belongs to is not involved in the estimation. In this case it's just not built into NONMEM so we would have to do some post-processing to perform this post hoc classification. At this point we are certainly not recommending this interesting but untested approach. But it may be promising and worth evaluating with real data and simulation studies. Figuring out the post hoc calculations would certainly help improve the diagnostic value of this approach as well. I trust that you had or are having an enjoyable vacation. Best wishes, Ken

RE: WRES AND OUTLIER IDENTIFICATION/EXCLUSION

From: Kenneth Kowalski Date: October 03, 2006 technical
From: "Kowalski, Ken" Ken.Kowalski@pfizer.com Subject: RE: [NMusers] WRES AND OUTLIER IDENTIFICATION/EXCLUSION Date: Tue, 3 Oct 2006 17:38:09 -0400 Nick, You wrote: "I am still not very comfortable about the fractional likelihood method because it assumes a similar fraction of outlier and non-outlier observations in each subject..." The full likelihood approach using an observation-level mixture model (i.e., residual error mixture model) does not make this assumption. The original version without Mats' modification to use $MIX merely estimates the proportion of outlier observations from the total number of observations. With Mats' $MIX code modification, we estimate the proportion of 'subjects with outliers', thus, the mixing proportion in the residual error mixture model is now conditional on the total number of observations in the population of 'subjects with outliers' rather than the total number of observations. There is no constraint that forces the observation-level mixing proportion to be the same within each subject of the 'subjects with outliers' subpopulation. You wrote: "...more importantly (for this thread) it doesn't distinguish in a discrete way between outlier and non-outlier observations." True...but it doesn't mean we couldn't perform some post hoc calculations to classify observations as outlier or non-outlier based on this full likelihood approach. It is analogous to the situation with $MIX and the post hoc calculations that MIXEST performs. If we did not have the MIXEST capabilities built in to NONMEM we would have a harder time with our diagnostic evaluation of subject-level mixture models using the $MIX functionality. Same is true here with the observation-level mixture models. To fully evaluate and advocate this approach would require more work to determine the post-processing calculations that would allow us to classify the observations in the two populations (outliers vs non-outliers) for diagnostic purposes. Note that if we performed these post hoc calculations such that we could classify outliers vs non-outliers at the observation level we would no longer need to use the $MIX code to identify 'subjects with outliers'. The population of 'subjects with outliers' could be determined directly from these post hoc calculations of the observation-level classification of outliers and non-outliers. Kind regards, Ken _______________________________________________________