PT - JOURNAL ARTICLE AU - Christina G Bracamontes AU - Thelma Carrillo AU - Jane Montealegre AU - Leonid Fradkin AU - Michele Follen AU - Zuber D Mulla TI - Analysis of count data in the setting of cervical cancer detection AID - 10.1136/jim-2020-001381 DP - 2020 Aug 01 TA - Journal of Investigative Medicine PG - 1196--1198 VI - 68 IP - 6 4099 - http://hw-f5-jim.highwire.org/content/68/6/1196.short 4100 - http://hw-f5-jim.highwire.org/content/68/6/1196.full SO - J Investig Med2020 Aug 01; 68 AB - Women with an abnormal Pap smear are often referred to colposcopy, a procedure during which endocervical curettage (ECC) may be performed. ECC is a scraping of the endocervical canal lining. Our goal was to compare the performance of a naïve Poisson (NP) regression model with that of a zero-inflated Poisson (ZIP) model when identifying predictors of the number of distress/pain vocalizations made by women undergoing ECC. Data on women seen in the colposcopy clinic at a medical school in El Paso, Texas, were analyzed. The outcome was the number of pain vocalizations made by the patient during ECC. Six dichotomous predictors were evaluated. Initially, NP regression was used to model the data. A high proportion of patients did not make any vocalizations, and hence a ZIP model was also fit and relative rates (RRs) and 95% CIs were calculated. AIC was used to identify the best model (NP or ZIP). Of the 210 women, 154 (73.3%) had a value of 0 for the number of ECC vocalizations. NP identified three statistically significant predictors (language preference of the subject, sexual abuse history and length of the colposcopy), while ZIP identified one: history of sexual abuse (yes vs no; adjusted RR=2.70, 95% CI 1.47 to 4.97). ZIP was preferred over NP. ZIP performed better than NP regression. Clinicians and epidemiologists should consider using the ZIP model (or the zero-inflated negative binomial model) for zero-inflated count data.