The Efficiency of Ranking Count Data with Excess Zeros

Deborah A. Kanda, Jingjing Yin

Research output: Contribution to conferencePresentation

Abstract

Data from public health studies often include count end-points that exhibit excess zeros and depending on the study objectives, hurdle or zero-inflated models are used to model such data. In this study, we propose to apply a sampling scheme that is based on ranking, which significantly reduces the sample size and thus study cost for count data with excess zero. The appeal of ranked set sampling is its ability to give more precise estimation than simple random sampling as ranked set samples (RSS) are more likely to span the full range of the population. Intensive simulations are conducted to compare the proposed sampling method using RSS with simple random samples (SRS), comparing the mean squared error (MSE), bias, variance, and power of the RSS with the SRS under various data generating scenarios. We also illustrate the merits of RSS on a real data set with excess zeros using data from the National Medical Expenditure Survey on demand for medical care. Results from data analysis and simulation study coincide and show the RSS outperforming the SRS in all cases, with the RSS showing smaller variances and MSE compared to the SRS.

Original languageAmerican English
StatePublished - Mar 25 2018
EventEastern North American Region International Biometric Society Spring Meeting (ENAR) -
Duration: Mar 25 2018 → …

Conference

ConferenceEastern North American Region International Biometric Society Spring Meeting (ENAR)
Period03/25/18 → …

Keywords

  • Count Data
  • Excess Zero

DC Disciplines

  • Biostatistics
  • Public Health

Fingerprint

Dive into the research topics of 'The Efficiency of Ranking Count Data with Excess Zeros'. Together they form a unique fingerprint.

Cite this