TY - JOUR T1 - Analysis of count data in the setting of cervical cancer detection JF - Journal of Investigative Medicine JO - J Investig Med SP - 1196 LP - 1198 DO - 10.1136/jim-2020-001381 VL - 68 IS - 6 AU - Christina G Bracamontes AU - Thelma Carrillo AU - Jane Montealegre AU - Leonid Fradkin AU - Michele Follen AU - Zuber D Mulla Y1 - 2020/08/01 UR - http://hw-f5-jim.highwire.org/content/68/6/1196.abstract N2 - Women with an abnormal Pap smear are often referred to colposcopy, a procedure during which endocervical curettage (ECC) may be performed. ECC is a scraping of the endocervical canal lining. Our goal was to compare the performance of a naïve Poisson (NP) regression model with that of a zero-inflated Poisson (ZIP) model when identifying predictors of the number of distress/pain vocalizations made by women undergoing ECC. Data on women seen in the colposcopy clinic at a medical school in El Paso, Texas, were analyzed. The outcome was the number of pain vocalizations made by the patient during ECC. Six dichotomous predictors were evaluated. Initially, NP regression was used to model the data. A high proportion of patients did not make any vocalizations, and hence a ZIP model was also fit and relative rates (RRs) and 95% CIs were calculated. AIC was used to identify the best model (NP or ZIP). Of the 210 women, 154 (73.3%) had a value of 0 for the number of ECC vocalizations. NP identified three statistically significant predictors (language preference of the subject, sexual abuse history and length of the colposcopy), while ZIP identified one: history of sexual abuse (yes vs no; adjusted RR=2.70, 95% CI 1.47 to 4.97). ZIP was preferred over NP. ZIP performed better than NP regression. Clinicians and epidemiologists should consider using the ZIP model (or the zero-inflated negative binomial model) for zero-inflated count data. ER -