RE: Right skewness in bootstrap distribution

From: Martin Bergstrand Date: April 25, 2013 technical Source: mail-archive.com
Dear Sebastian, Felipe and all, Bootstraps and sample size in NLME is the topic of a poster abstract submitted to PAGE 2013 (www.page-meeting.org/?abstract=2899). I recommend individuals interested in the topic who are attending PAGE 2013 to stop by the poster. For those not attending PAGE the abstract is available via the link and the poster should be available at the same place after the conference. Niebecker R., Mats O. Karlsson M.O. Are datasets for NLME models large enough for a bootstrap to provide reliable parameter uncertainty distributions? PAGE 22 (2013) Abstr 2899 [www.page-meeting.org/?abstract=2899] Another PAGE 2013 abstract deals with an alternative method (SIR) for characterizing parameter uncertainty (www.page-meeting.org/?abstract=2907). This method might be of particular value in application to small sample sizes. Hang around towards the end of the PAGE program (Friday) and you can hear Anne-Gaëlle Dosne tell you more about it. Dosne A.G., Bergstrand M., Mats O. Karlsson M.O. Application of Sampling Importance Resampling to estimate parameter uncertainty distributions. PAGE 22 (2013) Abstr 2907 [www.page-meeting.org/?abstract=2907] Best regards, Martin Bergstrand, PhD Pharmacometrics Research Group Dept of Pharmaceutical Biosciences Uppsala University, Sweden Postal address: Box 591, 751 24 Uppsala, Sweden Phone +46 709 994 396 Fax + 46 18 4714003
Quoted reply history
From: [email protected] [mailto:[email protected]] On Behalf Of Sebastian Frechen Sent: den 25 april 2013 01:34 To: Felipe Hurtado; [email protected] Subject: AW: [NMusers] Right skewness in bootstrap distribution Dear All, let me also add some more thoughts. I do not think that it is a matter of a low value of the bootstrap itself considering a sample size of only 8 subjects. In this case, every estimate of a parameter and its related variability have to be treated with caution - that's clear. However, here is a very simple example that the bootstrap also may work for small sample sizes from a pure mathematical point of view: Let's say, we would like to estimate the true average weight in a population of interest. Our sample consists of n=8 observations, for example: X = (64, 68 ,72 ,75 ,76 ,83 ,91 ,93). Of course, the mean is a reasonable estimate (77.8) and we easily obtain a (slightly biased [<5%]) estimated standard error of sd(X)/sqrt(n)=3.7 Now let's run a bootstrap with 1000 samples by resampling X with replacement; I estimated in this case a standard error of 3.6 given by the standard deviation of my estimates for the mean in each bootstrap sample. At least in this example, this value corresponds to the value obtained by the ordinary method. The same applies for estimated confidence intervals. In our case, I would assume an overall normal distribution for weight. Thus, computing for the mean an exact 95%-CI based on the t-distribution with 7 degrees of freedom, we obtain (69.0 ; 86.5). Constructing a 95%-CI based on the percentiles of my bootstrap sample distribution, the CI is (70.8 ; 84.9) - slightly too narrow but still not too bad and still giving us a reasonable impression for the precision. Of course, this is a very easy example where the bootstrap does not really make sense since exact methods exist and also there may be constellations where the bootstrap does not produce sound results at all. It's clear that larger sample sizes significantly improve the bootstrap. But still, in my opinion especially in cases of small numbers, a bootstrap analysis might be even of greater value than results from asymptotic computations. Best regards, Sebastian ---------------------------------------------------------------------------- -------- Dr. Sebastian Frechen Department of Pharmacology, Clinical Pharmacology Cologne University Hospital ----------------------- Gleueler Str. 24 50931 Cologne Germany _____ Von: De Ridder, Filip [JRDBE] [[email protected]] Gesendet: Mittwoch, 24. April 2013 09:54 An: Sebastian Frechen; Felipe Hurtado; [email protected] Betreff: RE: [NMusers] Right skewness in bootstrap distribution Dear All, A few more additional thoughts regarding the use of the bootstrap here. Although I agree with what Sebastian writes, I think the value of the bootstrap is very limited when you have a sample size of only 8 subjects. Re-sampling with replacement from such a small number can easily result in replicates in which a certain subjects appears 2 or 3 even three times. In this way, the bootstrap distribution becomes quite discrete in nature and can easily start to look weird. I suspect the “subpopulation” Felipe mentions corresponds to bootstrap samples in which a subject with an extreme V1 occurs repeatedly. On a related note, did all of your bootstrap replicates yield a converged NONMEM run? Kind regards, Filip De Ridder Model-based Drug Development Janssen R&D From: [email protected] [mailto:[email protected]] On Behalf Of Sebastian Frechen Sent: Wednesday, 24 April 2013 1:29 AM To: Felipe Hurtado; [email protected] Subject: AW: [NMusers] Right skewness in bootstrap distribution Dear Felipe, I totally argree with Jakob. Maybe just some more comments on the bootstrap to support this. Each estimator comes up with its own sampling distribution reflecting the uncertainty for the obtained estimate given your data and model. You can do assumption on this distribution, for example the arithemtic mean as an estimate for the "true average in a population" follows in general a normal distribution if the sample size is suffiently large enough. However, this does not apply for every estimator! One of the basic ideas of the bootstrap is now that you do not know the underlying sampling distribution of your parameter estimate. But using the non-parametric bootstrap method (sample from you dataset with replacement), you construct this distribution (and it is not necessarily Gaussian) by estimating your parameter in each of the generated sample. This in turn gives you a fairly good feeling of how precise your estimate is given your model and the sample size. With respect to your volume: Have you tried fitting the data to one- or two-compartment models? How does the volume behave then? Why are you using a three-compartment model? Best regards, Sebastian
Apr 23, 2013 Felipe Hurtado Right skewness in bootstrap distribution
Apr 23, 2013 Jakob Ribbing RE: Right skewness in bootstrap distribution
Apr 24, 2013 Filip de Ridder RE: Right skewness in bootstrap distribution
Apr 25, 2013 Felipe Hurtado RE: Right skewness in bootstrap distribution
Apr 25, 2013 Martin Bergstrand RE: Right skewness in bootstrap distribution