Skip Navigation

Annals of Occupational Hygiene 2007 51(7):611-632; doi:10.1093/annhyg/mem045
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (5)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Hewett, P.
Right arrow Articles by Ganser, G. H.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Hewett, P.
Right arrow Articles by Ganser, G. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?


© The Author 2007. Published by Oxford University Press on behalf of the British Occupational Hygiene Society

A Comparison of Several Methods for Analyzing Censored Data

Paul Hewett1,* and Gary H. Ganser2

1 Exposure Assessment Solutions, Inc., Morgantown, West Virginia
2 Department of Mathematics, West Virginia University, Morgantown, West Virginia

* Author to whom correspondence should be addressed. e-mail: phewett_2006_07{at}oesh.com

The purpose of this study was to compare the performance of several methods for statistically analyzing censored datasets [i.e. datasets that contain measurements that are less than the field limit-of-detection (LOD)] when estimating the 95th percentile and the mean of right-skewed occupational exposure data. The methods examined were several variations on the maximum likelihood estimation (MLE) and log-probit regression (LPR) methods, the common substitution methods, several non-parametric (NP) quantile methods for the 95th percentile and the NP Kaplan–Meier (KM) method. Each method was challenged with computer-generated censored datasets for a variety of plausible scenarios where the following factors were allowed to vary randomly within fairly wide ranges: the true geometric standard deviation, the censoring point or LOD and the sample size. This was repeated for both a single-laboratory scenario (i.e. single LOD) and a multiple-laboratory scenario (i.e. three LODs) as well as a single lognormal distribution scenario and a contaminated lognormal distribution scenario. Each method was used to estimate the 95th percentile and mean for the censored datasets (the NP quantile methods estimated only the 95th percentile). For each scenario, the method bias and overall imprecision (as indicated by the root mean square error or rMSE) were calculated for the 95th percentile and mean. No single method was unequivocally superior across all scenarios, although nearly all of the methods excelled in one or more scenarios. Overall, only the MLE- and LPR-based methods performed well across all scenarios, with the robust versions generally showing less bias than the standard versions when challenged with a contaminated lognormal distribution and multiple LODs. All of the MLE- and LPR-based methods were remarkably robust to departures from the lognormal assumption, nearly always having lower rMSE values than the NP methods for the exposure scenarios postulated. In general, the MLE methods tended to have smaller rMSE values than the LPR methods, particularly for the small sample size scenarios. The substitution methods tended to be strongly biased, but in some scenarios had the smaller rMSE values, especially for sample sizes <20. Surprisingly, the various NP methods were not as robust as expected, performing poorly in the contaminated distribution scenarios for both the 95th percentile and the mean. In conclusion, when using the rMSE rather than bias as the preferred comparison metric, the standard MLE method consistently outperformed the so-called robust variations of the MLE-based and LPR-based methods, as well as the various NP methods, for both the 95th percentile and the mean. When estimating the mean, the standard LPR method tended to outperform the robust LPR-based methods. Whenever bias is the main consideration, the robust MLE-based methods should be considered. The KM method, currently hailed by some as the preferred method for estimating the mean when the lognormal distribution assumption is questioned, did not perform well for either the 95th percentile or mean and is not recommended.

Received March 5, 2007; in final form August 17, 2007


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Eur Respir JHome page
J. E. Lockey, T. J. Hilbert, L. P. Levin, P. H. Ryan, K. L. White, E. K. Borton, C. H. Rice, R. T. McKay, and G. K. LeMasters
Airway obstruction related to diacetyl exposure at microwave popcorn production facilities
Eur. Respir. J., July 1, 2009; 34(1): 63 - 71.
[Abstract] [Full Text] [PDF]


Home page
ANN OCCUP HYGHome page
J. Cocker, J. R. Cain, P. Baldwin, K. McNally, and K. Jones
A Survey of Occupational Exposure to 4,4'-methylene-bis (2-chloroaniline) (MbOCA) in the UK
Ann. Hyg., July 1, 2009; 53(5): 499 - 507.
[Abstract] [Full Text] [PDF]


Home page
ANN OCCUP HYGHome page
P. Logan, G. Ramachandran, J. Mulhausen, and P. Hewett
Occupational Exposure Decisions: Can Limited Data Interpretation Training Help Improve Accuracy?
Ann. Hyg., June 1, 2009; 53(4): 311 - 324.
[Abstract] [Full Text] [PDF]


Home page
ANN OCCUP HYGHome page
K. Krishnamoorthy, A. Mallick, and T. Mathew
Model-Based Imputation Approach for Data Analysis in the Presence of Non-detects
Ann. Hyg., April 1, 2009; 53(3): 249 - 263.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.