A Tool for Benchmarking Large Language Models' Robustness in Assessing the Realism of Driving Scenarios

A Tool for Benchmarking Large Language Models' Robustness in Assessing the Realism of Driving Scenarios

Authors: J. Wu, C. Lu, A. Arrieta and S. Ali
Status: Published
Publication type: Proceedings Refereed
Year of publication: 2025
Journal: 2nd ACM/IEEE International Conference on AI-powered Software (AIware 2025)
Publisher: ACM/IEEE
Citation key: 18493

Google Scholar BibTex