MACHINE LEARNING IN PUBLIC HEALTH: PREDICTING ILLICIT DRUG USE AMONG BRAZILIAN ADOLESCENTS
DOI:
https://doi.org/10.53843/bms.v11i15.1148Keywords:
planejamento em saude, aprendizado de máquina, abuso oral de substânciasAbstract
INTRODUCTION: Machine Learning (ML) is a field of artificial intelligence that enables the development of algorithms capable of learning from data, without explicit programming. Advances in these techniques have enabled important applications in healthcare, including risk and behavior prediction. This study proposes the development of a predictive model to understand and predict illicit drug use among Brazilian teenagers aged 13 to 17, based on sociodemographic and behavioral patterns, with the aim of informing more effective public health policies. METHODOLOGY: This is an observational, cross-sectional study based on data from 165,838 students from the 2019 National School Health Survey (PeNSE). Different ML models were compared to identify the one with the best performance in predicting illicit drug use. Predictor variables included gender, age, future plans, alcohol and tobacco use, housing conditions, property ownership, parental educational backgrounds, and family habits. RESULTS: Among the models tested, Logistic Regression presented the highest AUC-ROC (0.90), demonstrating better overall performance. Random Forest, however, was used to assess the importance of variables due to its interpretive robustness. The main factors associated with risk were: alcohol use, maternal educational background, peer and parental support, and parental alcohol consumption. DISCUSSION: The findings confirm the potential of ML in identifying risk patterns, in line with recent studies and national epidemiological data. The inclusion of family and behavioral variables reinforces the relevance of preventive strategies targeted at the school and home environment. CONCLUSION: The application of ML models, especially Logistic Regression, proved to be valid for predicting the risk of illicit drug use in teenagers. These results can guide targeted public policies, prioritizing modifiable risk factors and optimizing the use of public health resources.
References
1. Tinner L, Palmer JC, Lloyd EC, Caldwell DM, MacArthur GJ, Dias K, et al. Individual-, family- and school-based interventions to prevent multiple risk behaviours relating to alcohol, tobacco and drug use in young people aged 8–25 years: a systematic review and meta-analysis. BMC Public Health. 2022;22(1):1111. doi:10.1186/s12889-022-13072-5.
2. Nawi AM, Ismail R, Ibrahim F, Hassan MR, Manaf MRA, Amit N, et al. Risk and protective factors of drug abuse among adolescents: a systematic review. BMC Public Health. 2021;21(1):2088. doi:10.1186/s12889-021-11906-2.
3. Instituto Brasileiro de Geografia e Estatística (IBGE). Pesquisa Nacional de Saúde do Escolar: 2019. Rio de Janeiro: IBGE; 2021. Disponível em: https://biblioteca.ibge.gov.br/visualizacao/livros/liv101852.pdf. Acesso em: 20 ago 2025.
4. Artero AS. Aprendizado de máquina: conceitos e algoritmos. In: XXII Conference on Graphics, Patterns and Images – SIBGRAPI. IEEE; 2009. p. 215–28.
5. Haykin S. Redes neurais: princípios e prática. 2a ed. Porto Alegre: Bookman; 2001.
6. Silva IN, Spatti DH, Flauzino RA. Redes neurais artificiais para engenharia e ciências aplicadas: curso prático. São Paulo: Artliber; 2010.
7. Afzali MH, Sunderland M, Stewart S, Massé B, Séguin JR, Newton N, et al. Machine-learning prediction of adolescent alcohol use: a cross-study, cross-cultural validation. Addiction. 2019;114(4):662–71. doi:10.1111/add.14504.
8. Vellido A. The importance of interpretability and visualization in machine learning for applications in medicine and health care. Neural Comput Appl. 2020;32:18069–83. doi:10.1007/s00521-019-04051-w.
9. Rajkomar A, Dean J, Kohane I. Machine learning in medicine. N Engl J Med. 2019;380(14):1347–58. doi:10.1056/NEJMra1814259.
10. Beam AL, Kohane IS. Big data and machine learning in health care. JAMA. 2018;319(13):1317–8. doi:10.1001/jama.2017.18391.
11. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25(1):44–56. doi:10.1038/s41591-018-0300-7.
12. Patil RC, Tavaragi MS, Sushma C. Inhalant abuse in adolescents in North Karnataka: a case series. J Psychiatry Spectrum. 2022;1(2):133–5. doi:10.4103/jopsys.jopsys_1_22.
13. Karlovšek MZ, Alibegović A, Balažic J. Our experiences with fatal ecstasy abuse (two case reports). Forensic Sci Int. 2005;147 Suppl:S77–80. doi:10.1016/j.forsciint.2004.09.084.
14. Fineschi V, Masti A. Fatal poisoning by MDMA (ecstasy) and MDEA: a case report. Int J Legal Med. 1996;108(5):272–5. doi:10.1007/BF01369826.
15. Salehi F, Hassanzadeh Taheri MM, Riasi H, Mehrpour O. Recurrent syncope following substance abuse: a case report. Emergency. 2017;5(1):e47. PMID:28286854; PMCID:PMC5325918.
16. Hoorn EJ; Paediatric Death Review Committee, Office of the Chief Coroner of Ontario. A fatal case of ecstasy poisoning. Paediatr Child Health. 2001;6(7):491. doi:10.1093/pch/6.7.491.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Mateus Zani De Nadai

This work is licensed under a Creative Commons Attribution 4.0 International License.
User licenses define how readers and the general public can use the article without needing other permissions. The Creative Commons public licenses provide a standard set of terms and conditions that creators and other rights holders can use to share original works of authorship and other material subjects to copyright and certain other rights specified in the public license available at https:// creativecommons.org/licenses/by/4.0/deed.pt_BR. Using the 4.0 International Public License, Brazilian Medical Students (BMS) grants the public permission to use published material under specified terms and conditions agreed to by the journal. By exercising the licensed rights, authors accept and agree to abide by the terms and conditions of the Creative Commons Attribution 4.0 International Public License.