Document Type


Date of Degree

Spring 2017

Access Restrictions

Access restricted until 07/13/2019

Degree Name

PhD (Doctor of Philosophy)

Degree In


First Advisor

Brian J. Smith


Measurement error is a type of non-sampling error that could attenuate the effect of a risk factor on an outcome variable if no correction is made. Therefore, an effect might not be detectable, even if there is one. If a classical error type is present, then the power of the analysis will be lowered or a bigger sample size will be needed in order to maintain the desirable power. Thus, a correction should be made before drawing any conclusions from the analysis. The regression calibration and simulation extrapolation methods are some of the available methods developed to deal with this kind of problem.

This dissertation proposes a Bayesian method that uses a hierarchical approach to jointly model true radon exposure (measurement error model) and its effect on lung cancer (excess odds model). This method takes subject-specific characteristics into account when making the correction, and uses random effects when missing data are present. We carried out a simulation study in order to compare this method to the regression calibration and simulation extrapolation (SIMEX). Different scenarios were simulated and the simulated data were analyzed with the three methods. This is the first time that these three methods have been compared in the context of radon risk assessment.

The simulation results showed that the proposed Bayesian method had a consistent coverage through out the scenarios. However, the SIMEX method had the lowest bias and mean squared error and, most of the time, its coverage was the closest to the nominal coverage of 95\%. The regression calibration was the fastest method to be implemented, but it was outperformed by the other methods.

The dissertation finalizes by performing individual and pooled analyses using data from five case-control North America radon studies (Iowa, Missouri, Winnipeg, Connecticut, and Utah/South Idaho). The data from each study were analyzed individually, first without making any correction, and then using the three correction methods. Finally, the data were combined and the methods were applied to this bigger sample. To the best of our knowledge, regression calibration and SIMEX have not been implemented using this combined dataset.


xii, 100 pages


Includes bibliographical references (pages 97-100).


Copyright © 2017 Keyla Pagán-Rivera

Available for download on Saturday, July 13, 2019

Included in

Biostatistics Commons