RE: VPCs confidence intervals?

From: Kenneth Kowalski Date: March 14, 2019 technical Source: mail-archive.com
Hi All, I know what Bill is trying to say but it is not quite accurate the way he states it. A prediction interval makes inference on a statistic based on a future sample such as a sample mean of a future set of data. In contrast, a confidence interval makes inference on a parameter such as the population mean which is a fixed number. A prediction interval takes into account both the uncertainty in the existing data used to estimate the population parameter as well as the sampling variation to make inference on a sample statistic (e.g., sample mean for a future trial). A confidence interval only takes into account the uncertainty in the existing data used to estimate the parameter. Based on the Law of Large Numbers, the population mean can be thought of as taking the sample mean of an infinite sample size (i.e., sampling the entire population). For this reason, a prediction interval with an infinite sample size will collapse to a confidence interval. An interval based on VPCs is more akin to a prediction interval since it takes into account the sampling variation based on a finite sample size, however, one cannot assign a valid coverage probability (confidence level) to this interval unless it also takes into account the parameter uncertainty. With VPCs applied to existing data (i.e, an internal VPC) it is customary to not take into account this parameter uncertainty so many refer to such prediction intervals as degenerate as they place 100% certainty on the model parameter estimates used to obtain the VPC predictions. One could potentially call these intervals ‘degenerate prediction intervals’ but I tend to just call them ‘VPC intervals’ (e.g., a 90% VPC interval) so as to avoid misperception that these prediction intervals have a statistically valid coverage probability. However, when VPCs are applied to an independent dataset not used in the development of the model, it is often advised to take into account the parameter uncertainty when performing the VPCs to essentially reflect the trial-to-trial uncertainty of the independent data not used in the estimation of model (i.e., refitting the same model to a new set of trial data will not give the same set of estimates and hence reflects trial-to-trial variation). In this setting, where the VPCs take into account both the parameter uncertainty and sampling variation to predict on an independent (e.g., future) dataset, then one is on more solid ground to refer to these VPC intervals as prediction intervals with valid coverage probabilities. Kind regards, Ken Kenneth G. Kowalski Kowalski PMetrics Consulting, LLC Email: <mailto:[email protected]> [email protected] Cell: 248-207-5082
Quoted reply history
From: [email protected] [mailto:[email protected]] On Behalf Of Bill Denney Sent: Thursday, March 14, 2019 1:10 PM To: Soto, Elena <[email protected]>; [email protected] Subject: RE: [NMusers] VPCs confidence intervals? Hi Elena, VPCs are accurately called prediction intervals not confidence intervals. The difference is that a prediction interval shows what you would expect for the next individual in a study while a confidence interval shows what you would expect for the result of a statistic (often confidence intervals of a mean are shown). With many VPCs, the confidence interval of the median and the confidence interval of the 5th and 95th percentiles are shown. Also, when the lines indicate the median, 5th, and 95th percentiles of the simulations, that is the 90% prediction interval since it is the middle 90% of the data (not the 95% confidence interval). Thanks, Bill From: [email protected] <mailto:[email protected]> <[email protected] <mailto:[email protected]> > On Behalf Of Soto, Elena Sent: Thursday, March 14, 2019 12:49 PM To: [email protected] Subject: [NMusers] VPCs confidence intervals? Dear all, I have a question regarding visual predictive checks (VPCs). Most of VPCs used now, include a line representing the median and 5th and 95th percentiles of the data values and an area around the same percentiles that is commonly define as the 95% confidence interval (of the simulations). But is it correct, from the statistical point of view, to call confidence interval to this area? And if this is not the case how should we define them? Thanks, Elena Soto Elena Soto, PhD Pharmacometrician Pharmacometrics, Global Clinical Pharmacology Global Product Development Pfizer R&D UK Limited, IPC 096 CT13 9NJ, Sandwich, UK Phone : +44 1304 644883 _____ Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this e-mail by anyone else is unauthorised. If you are not an addressee, any disclosure or copying of the contents of this e-mail or any action taken (or not taken) in reliance on it is unauthorised and may be unlawful. If you are not an addressee, please inform the sender immediately. Pfizer R&D UK Limited is registered in England under No. 11439437 with its registered office at Ramsgate Road, Sandwich, Kent CT13 9NJ --- This email has been checked for viruses by Avast antivirus software. https://www.avast.com/antivirus
Mar 14, 2019 Elena Soto VPCs confidence intervals?
Mar 14, 2019 Bill Denney RE: VPCs confidence intervals?
Mar 14, 2019 Kenneth Kowalski RE: VPCs confidence intervals?
Mar 14, 2019 Mats Karlsson RE: VPCs confidence intervals?
Mar 18, 2019 Mike K Smith RE: VPCs confidence intervals?
Mar 18, 2019 Nick Holford RE: VPCs confidence intervals?