Articles

Tolerance intervals in statistical software and robustness under model misspecification

Cho, Kyung S.; Tony Ng, Hon K.

A tolerance interval is a statistical interval that covers at least 100ρ% of the population of interest with a 100(1−α)% confidence, where ρ and α are pre-specified values in (0, 1). In many scientific fields, such as pharmaceutical sciences, manufacturing processes, clinical sciences, and environmental sciences, tolerance intervals are used for statistical inference and quality control. Despite the usefulness of tolerance intervals, the procedures to compute tolerance intervals are not commonly implemented in statistical software packages. This paper aims to provide a comparative study of the computational procedures for tolerance intervals in some commonly used statistical software packages including JMP, Minitab, NCSS, Python, R, and SAS. On the other hand, we also investigate the effect of misspecifying the underlying probability model on the performance of tolerance intervals. We study the performance of tolerance intervals when the assumed distribution is the same as the true underlying distribution and when the assumed distribution is different from the true distribution via a Monte Carlo simulation study. We also propose a robust model selection approach to obtain tolerance intervals that are relatively insensitive to the model misspecification. We show that the proposed robust model selection approach performs well when the underlying distribution is unknown but candidate distributions are available.

Files

  • thumnail for 40488_2021_Article_123.pdf 40488_2021_Article_123.pdf application/pdf 403 KB Download File

Also Published In

Title
Journal of Statistical Distributions and Applications
DOI
https://doi.org/10.1186/s40488-021-00123-2

More About This Work

Published Here
September 22, 2023

Notes

Cauchy distribution, Maximum likelihood, Model selection, Model uncertainty