MACHINE LEARNING IN PUBLIC HEALTH: PREDICTING ILLICIT DRUG USE AMONG BRAZILIAN ADOLESCENTS
DOI:
https://doi.org/10.53843/bms.v11i15.1148Keywords:
planejamento em saude, aprendizado de máquina, abuso oral de substânciasAbstract
INTRODUCTION: Machine Learning (ML) is a field of artificial intelligence that enables the development of algorithms capable of learning from data, without explicit programming. Advances in these techniques have enabled important applications in healthcare, including risk and behavior prediction. This study proposes the development of a predictive model to understand and predict illicit drug use among Brazilian teenagers aged 13 to 17, based on sociodemographic and behavioral patterns, with the aim of informing more effective public health policies. METHODOLOGY: This is an observational, cross-sectional study based on data from 165,838 students from the 2019 National School Health Survey (PeNSE). Different ML models were compared to identify the one with the best performance in predicting illicit drug use. Predictor variables included gender, age, future plans, alcohol and tobacco use, housing conditions, property ownership, parental educational backgrounds, and family habits. RESULTS: Among the models tested, Logistic Regression presented the highest AUC-ROC (0.90), demonstrating better overall performance. Random Forest, however, was used to assess the importance of variables due to its interpretive robustness. The main factors associated with risk were: alcohol use, maternal educational background, peer and parental support, and parental alcohol consumption. DISCUSSION: The findings confirm the potential of ML in identifying risk patterns, in line with recent studies and national epidemiological data. The inclusion of family and behavioral variables reinforces the relevance of preventive strategies targeted at the school and home environment. CONCLUSION: The application of ML models, especially Logistic Regression, proved to be valid for predicting the risk of illicit drug use in teenagers. These results can guide targeted public policies, prioritizing modifiable risk factors and optimizing the use of public health resources.
References
Nawi AM, Ismail R, Ibrahim F, Hassan MR, Manaf MRA, Amit N, et al. Risk and protective factors of drug abuse among adolescents: a systematic review. BMC Public Health. 2021 Nov 13;21(1):2088. doi: 10.1186/s12889-021-11906-2. PMID: 34774013; PMCID: PMC8590764.
Tinner L, Palmer JC, Lloyd EC, Caldwell DM, MacArthur GJ, Dias K, et al. Individual-, family- and school-based interventions to prevent multiple risk behaviours relating to alcohol, tobacco and drug use in young people aged 8–25 years: a systematic review and meta-analysis. BMC Public Health. 2022 Jun 3;22(1):1111. doi: 10.1186/s12889-022-13072-5. PMID: 35658920; PMCID: PMC9165543.
Afzali MH, Sunderland M, Stewart S, Masse B, Seguin J, Newton N, et al. Machine-learning prediction of adolescent alcohol use: a cross-study, cross-cultural validation. Addiction. 2019 Apr;114(4):662–71. doi: 10.1111/add.14504. PMID: 30461117.
Artero AS. Aprendizado de máquina: conceitos e algoritmos. In: SIBGRAPI 2009 – XXII Conference on Graphics, Patterns and Images. IEEE; 2009. p. 215–28.
Haykin S. Redes neurais: princípios e prática. Porto Alegre: Bookman Editora; 2001.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Mateus Zani De Nadai

This work is licensed under a Creative Commons Attribution 4.0 International License.
User licenses define how readers and the general public can use the article without needing other permissions. The Creative Commons public licenses provide a standard set of terms and conditions that creators and other rights holders can use to share original works of authorship and other material subjects to copyright and certain other rights specified in the public license available at https:// creativecommons.org/licenses/by/4.0/deed.pt_BR. Using the 4.0 International Public License, Brazilian Medical Students (BMS) grants the public permission to use published material under specified terms and conditions agreed to by the journal. By exercising the licensed rights, authors accept and agree to abide by the terms and conditions of the Creative Commons Attribution 4.0 International Public License.