ResearchPublications

Utilizing machine learning for predicting PrEP use status among sexual and gender minority young adults
Abstract

Pre-exposure prophylaxis (PrEP) is a highly effective biomedical prevention tool for HIV yet remains underutilized among key populations, particularly among young sexual and gender minorities (SGM). Recognizing the popularity of specific dating and social media apps among SGM young adults, we leveraged user data from these platforms to build a machine learning (ML) model that could inform targeted, data-driven interventions aimed at improving PrEP uptake and adherence. We adapted eWellness, an Android mobile app, to passively collect data from research participants capturing mobile app usage, keystroke patterns and logs, and GPS location data between 2021 and 2024. These data were used to train a ML model to predict self-reported PrEP use. Model accuracy was evaluated through F1 scores across different data types and feature combinations. Study protocols were developed in collaboration with community partners and adhered to strict ethical and privacy standards. A total of 82 SGM young adults participated, with 46 (56%) reporting PrEP use at baseline. Our machine learning model demonstrated good predictive accuracy for predicting PrEP use and non-use, achieving an F1 score of 0.84 (PrEP use) and 0.82 (non-use) outcomes when incorporating data from all mobile apps, including messaging, dating, and social media mobile apps. By contrast, predictions based solely on social media mobile app usage, language associated with sexual behavior and substance use risk, or location monitoring demonstrated worse accuracy (F1 scores of 0.79/0.75, 0.70/0.57, and 0.70/0.52, respectively). Additional feature extraction methods, as well as various combinations of these features, were also tested. However, none achieved predictive accuracy as well as the model incorporating all mobile app usage data combined. This study demonstrates the potential of machine learning to accurately predict PrEP use status among SGM young adults. The findings offer a foundation for developing more personalized PrEP promotion strategies, particularly among SGM young adults who use social media and dating apps. Future research should assess the model’s adaptability across diverse SGM subgroups to further inform intervention development.

Download PDF

Full citation:
Boka C, Yonko EA, Beikzadeh M, Kärkkäinen K, Hong C, Sarrafzadeh M, Holloway IW (2026).
Utilizing machine learning for predicting PrEP use status among sexual and gender minority young adults
Prevention Science [Epub 2026 Jan 10]. doi: 10.1007/s11121-025-01872-1.