Annals of Occupational Hygiene Advance Access originally published online on May 20, 2009
Annals of Occupational Hygiene 2009 53(5):509-521; doi:10.1093/annhyg/mep034
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sampling Strategies for Occupational Exposure Assessment under Generalized Linear Model
1 Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan Town, Miaoli County 350, Taiwan
2 Institute of Occupational Medicine and Industrial Hygiene, National Taiwan University, College of Public Health, Taipei 100, Taiwan
* Author to whom correspondence should be addressed. Tel: +886-37-246-166 (ext. 36186); fax: +886-37-586-467; e-mail: ccchen{at}nhri.org.tw
Objectives: Occupational exposure assessment is a major task in industrial hygiene studies. Although statistical analyses for magnitudes and variations of exposures to various types of working populations based on existing data sets are extensive, relatively few discussions on study designs appear in the literature, especially for sample size determination and number of repeated measurements.
Methods: In this paper, we propose a general framework of sampling strategies on sample size requirement together with the number of repeated measurements using the mixed-effects generalized linear model (GLM). As illustrative examples, we discuss sampling strategies separately under the log-normal assumption for hypotheses testing on (i) mean exposure differences of multiple worker groups and (ii) presence of a long-term exposure trend.
Results: Given a specified alternative hypothesis, the desired significance level and statistical power, the number of repeated measurements, within-worker and between-worker variances, and a correlation structure, we have derived and tabulated an explicit sample size requirement of the two hypothetical cases under log-normal distribution assumption.
Conclusions: On the basis of the tabulated outcomes, the sample size requirement is much more dominant than the number of repeated measurements for a group exposure comparison. Thus, in this case, recruiting more workers with fewer repeated measurements may be more economical than the opposite approach. For testing the presence of a long-term exposure trend, the sample size required decreases substantially with the number of repeated measurements. Also, equally spaced sampling times would be optimal because the effect of between-worker variance is algebraically cancelled out in sample size calculations.
Keywords: exponential family distribution gamma distribution generalized estimating equation log-normal distribution long-term trend mixed model
Received August 12, 2008; in final form April 23, 2009