A Tool for Benchmarking Large Language Models' Robustness in Assessing the Realism of Driving Scenarios

A Tool for Benchmarking Large Language Models' Robustness in Assessing the Realism of Driving Scenarios

Authors: J. Wu
Status: Published
Publication type: Presentation
Year of publication: 2025
Journal: 2nd ACM/IEEE International Conference on AI-powered Software (AIware 2025)
Publisher: ACM/IEEE
Citation key: 18494

Google Scholar BibTex