Skip to main navigation Skip to search Skip to main content

An Improved Misclassification Simulation Extrapolation (MC-SIMEX) Algorithm

Research output: Contribution to journalArticlepeer-review

Abstract

Misclassification Simulation-Extrapolation (MC-SIMEX) is an established method to correct for misclassification in binary covariates in a model. It involves the use of a simulation component which simulates pseudo-datasets with added degree of misclassification in the binary covariate and an extrapolation component which models the covariate's regression coefficients obtained at each level of misclassification using a quadratic function. This quadratic function is then used to extrapolate the covariate's regression coefficients to a point of “no error” in the classification of the binary covariate under question. However, extrapolation functions are not usually known accurately beforehand and are therefore only approximated versions. In this article, we propose an innovative method that uses the exact (not approximated) extrapolation function through the use of a derived relationship between the naïve regression coefficient estimates and the true coefficients in generalized linear models. Simulation studies are conducted to study and compare the numerical properties of the resulting estimator to the original MC-SIMEX estimator. Real data analysis using colon cancer data from the MSKCC cancer registry is also provided.

Original languageEnglish
Article numbere70418
JournalStatistics in Medicine
Volume45
Issue number3-5
DOIs
StatePublished - Feb 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Algorithms
  • Colonic Neoplasms/epidemiology
  • Computer Simulation
  • Humans
  • Linear Models
  • Models, Statistical
  • Registries

Fingerprint

Dive into the research topics of 'An Improved Misclassification Simulation Extrapolation (MC-SIMEX) Algorithm'. Together they form a unique fingerprint.

Cite this