1.National Innovation Institute of Defense Technology, Academy of Military, Beijing 100071 , China ; 2.The PLA Unit 32806, Beijing 100091 , China ; 3.Xi′an Satellite Control Center, Xi′an 710043 , China
TP181
李艺颖, 周伟. 元学习探索隐变量的强化学习方法[J]. 国防科技大学学报, 2025, 47(5): 197-205.
Copy