Adversarial Strategy Generation Integrating Expert Policies and Multi-Chain-of-Thought Reasoning
DOI:
CSTR:
Author:
Affiliation:

Clc Number:

TP18

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    To address the challenge of enhancing both accuracy and fine-grained decision-making in adversarial settings involving AI agents powered by large language models (LLMs), this study introduces a novel strategy generation approach that integrates expert policies with multi-chain-of-thought (CoT) reasoning. By fusing real-time visual input (images) with structured observational data (text), the method provides LLMs with richer situational awareness. Time-sensitive expert strategies and parallel reasoning chains are embedded into prompt design, significantly improving the agent’s control and tactical precision. Experimental validation in high-difficulty StarCraft II scenarios demonstrates that the method achieves a 95% win rate without any additional model training. Results indicate that the approach enables interpretable and fine-tuned decision outputs in highly dynamic adversarial environments, offering a compelling pathway for leveraging LLMs in strategic behavior generation under competitive conditions.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:April 14,2025
  • Revised:September 09,2025
  • Adopted:September 15,2025
  • Online:
  • Published:
Article QR Code