Abstract: Understanding how large language models (LLMs) generalize across diverse traffic scenarios is critical for advancing autonomous driving systems. While previous studies have validated LLMs’ ...