引用本文: | 周敏,王少尉.频谱感知次序的在线最优选择.[J].国防科技大学学报,2020,42(4):24-29.[点击复制] |
ZHOU Min,WANG Shaowei.Online optimal selection of spectrum sensing order[J].Journal of National University of Defense Technology,2020,42(4):24-29[点击复制] |
|
|
|
本文已被:浏览 6922次 下载 5450次 |
频谱感知次序的在线最优选择 |
周敏,王少尉 |
(南京大学 电子科学与工程学院, 江苏 南京 210023)
|
摘要: |
动态频谱接入是解决无线电频谱资源短缺和频谱使用效率低下问题的有效方法,它允许次级用户在授权频谱空闲时动态地接入, 以进行数据传输。而频谱感知是实现动态频谱接入的关键挑战之一。由于次级用户的感知能力有限,为了获得更多的频谱接入机会,需要尽快找到频谱空闲概率最大的频段,并研究频谱感知次序问题。考虑到频谱空闲概率对次级用户是不可知的,并且会随时间变化,提出了在线学习框架,把频谱感知次序问题归纳成经典多摇臂赌博机问题,并利用在线学习方法——满意折现汤普森抽样算法处理优化问题。仿真结果表明,和其他算法相比,所提算法可以获得更多的频谱接入机会并且能够跟踪频谱空闲概率的变化。 |
关键词: 动态频谱接入 频谱感知 在线学习 满意折现汤普森抽样 |
DOI:10.11887/j.cn.202004005 |
投稿日期:2019-12-25 |
基金项目:国家自然科学基金资助项目(61671233,61801208, 61931023, U1936202) |
|
Online optimal selection of spectrum sensing order |
ZHOU Min, WANG Shaowei |
(School of Electronic Science and Engineering, Nanjing University, Nanjing 210023, China)
|
Abstract: |
Dynamic spectrum access is deemed as an effective solution to the radio spectrum scarcity and spectrum usage in efficiency problem, which allows secondary users to access the spectrum dynamically for data transmission when the licensed spectrum is idle. However, spectrum sensing is one of the key challenges for dynamic spectrum access. Since the secondary user was equipped with limited sensing capability, in order to obtain more spectrum access opportunities, the spectrum sensing order problem was investigated to find the frequency band with the highest probability of being idle as soon as possible. Considering that the probability of the spectrum being idle was not available for the secondary users and changes over time, an online learning framework in which the spectrum sensing order problem was formulated as a classical multi-armed bandit problem was proposed, and it was addressed by using an online learning method, referred to as satisficing discounted Thompson sampling. Simulation results indicate that compared with other algorithms, the proposed algorithm yields more spectrum opportunities and can track the changes of the probability of the spectrum being idle. |
Keywords: dynamic spectrum access spectrum sensing online learning satisficing discounted Thompson sampling |
|
|
|
|
|