1.College of Intelligence Science and Technology, National University of Defense Technology, Changsha 410073 , China ;2.National Key Laboratory of Equipment State Sensing and Smart Support, Changsha 410073 , China ; 3.College of Aerospace Science and Engineering, National University of Defense Technology, Changsha 410073 , China
TP249
任君凯, 瞿宇珂, 罗嘉威, 等. 面向长序列自主作业的非对称Actor-Critic强化学习方法[J]. 国防科技大学学报, 2025, 47(4): 111-122.
Copy