汤普森采样在线设计频率捷变雷达抗干扰策略
作者:
作者单位:

1.安徽大学 电子信息工程学院, 安徽 合肥 230601 ; 2.电子信息系统复杂电磁环境效应国家重点实验室, 河南 洛阳 471003 ;3.中山大学 电子与通信工程学院, 广东 深圳 518107 ; 4.军事科学院 国防科技创新研究院, 北京 100071

作者简介:

吴振华(1993—),男,安徽淮北人,副教授,博士,博士生导师,E-mail:zhwu@ahu.edu.cn

通讯作者:

中图分类号:

TN95

基金项目:

国家自然科学基金资助项目(62201007);中国博士后科学基金资助项目(2020M681992);电子信息系统复杂电磁环境效应国家重点实验室开放课题资助项目(CEMEE2022Z0302B)


Designing online anti-jamming strategy for agile frequency radar via Thompson sampling
Author:
Affiliation:

1.School of Electronic and Information Engineering, Anhui University, Hefei 230601 , China ; 2.State Key Laboratory of Complex Electromagnetic Environment Effects on Electronics and Information System, Luoyang 471003 , China ; 3.School of Electronics and Communication Engineering, Sun Yat-sen University, Shenzhen 518107 , China ; 4.National Innovation Institute of Defense Technology, Academy of Military Sciences, Beijing 100071 , China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献()
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    在有源干扰动态对抗背景下,基于在线学习理论中多臂赌博机模型,将雷达和干扰机工作频率作为对抗动作空间建模,通过对干扰环境状态不确定性进行多轮脉冲波形发射探索,搭建基于卷积神经网络的频率通道干扰识别器以得到频率通道干扰信念状态后验概率估计,利用汤普森采样求解算法高效求解多臂赌博机模型,实现探索与利用之间的平衡。仿真结果表明,相较于频率随机捷变及深度强化学习策略求解算法,该方法的对抗策略收敛性能更高,可适应动态快变干扰环境,充分发挥雷达波形发射主动方对抗优势。

    Abstract:

    In the context of dynamic countermeasures between radar and active jammer, the working frequency of radar and adversarial jammer were modeled as the combat action space based on the multi-arm bandit model in online learning theory. By exploring the uncertainty of the jamming environment state through multiple-round pulse transmission, a frequency channel jamming recognizer based on a convolutional neural network was constructed to obtain the posterior probability estimation of the belief state of each frequency channel. Then the Thompson sampling algorithm efficiently solved the built multi-arm bandit model, achieving a balance between exploration and exploitation. Simulation results show that compared with random frequency agility and deep reinforcement learning algorithms, the method had higher convergence performance and was more adaptable to dynamic fast-changing jamming environments, which can give full potential to the antagonism advantage of radar active waveform transmission.

    参考文献
    相似文献
    引证文献
引用本文

吴振华, 钱军, 张磊, 等. 汤普森采样在线设计频率捷变雷达抗干扰策略[J]. 国防科技大学学报, 2025, 47(5): 206-215.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2023-06-16
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-10-08
  • 出版日期:
文章二维码