Abstract: In 3 vs. 3 online basketball games, finite state machine (FSM)-based Game artificial intelligence (AI) has traditionally been employed. However, limitations such as repetitive behavior ...
Abstract: In this article, we present BenchING, a new benchmark for evaluating large language models (LLMs) on their ability to follow structured output format instructions in text-based procedural ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results