PCCI’s AI/ML Model to Predict HIV Infection Risk
PCCI is also collaborating with DCHHS and Parkland in a Dallas County public health initiative targeted to adults who are at risk for exposure to HIV through an HIV-STI pre-exposure prophylaxis (PrEP) Initiative.
Dallas County ranks 2nd highest in HIV, 6th in Syphilis, 21st in Gonorrhea, and 26th in Chlamydia infection rates compared to the other 254 Texas counties. PCCI built a successful HIV ML prediction model using a Light Gradient Boosting Machine (LGBM) algorithm. LGBM is an ensemble of decision trees trained sequentially one after the other, improving from the errors of the predecessor to result in a strong boosting classifier.
Overall, this model uses 26 input variables to predict the individuals at increased likelihood of acquiring HIV and who are candidates for HIV PrEP, with the overall goal to both increase awareness and decrease rates of Sexually Transmitted Infections (STIs) (including HIV), to promote public health in Dallas County’s most affected ZIP Codes. This program and its results were published in the peer-reviewed publication, The Journal of Acquired Immune Deficiency Syndrome (JAIDS), co-authored by Arun Nethi, Data & Applied Scientist, and Albert Karam, MS, MBA, VP, Data Strategy Analytics.