Evaluating Large Language Models: Frameworks and Methodologies for AI/ML System Testing. International Journal of Scientific Research and Management (IJSRM), [S. l.], v. 12, n. 09, p. 1467–1486, 2024. DOI: 10.18535/ijsrm/v12i09.ec08. Disponível em: https://ijsrm.net/index.php/ijsrm/article/view/5693. Acesso em: 17 may. 2025.