引用本文: | 杨甲森,孟新,王春梅.卫星遥测数据相关性知识发现方法.[J].国防科技大学学报,2019,41(5):71-78.[点击复制] |
YANG Jiasen,MENG Xin,WANG Chunmei.Correlation knowledge discovery method for satellite telemetry data[J].Journal of National University of Defense Technology,2019,41(5):71-78[点击复制] |
|
|
|
本文已被:浏览 6816次 下载 5130次 |
卫星遥测数据相关性知识发现方法 |
杨甲森1,2, 孟新1, 王春梅1 |
(1.中国科学院 国家空间科学中心 复杂航天系统电子信息技术重点实验室, 北京 100190;2.中国科学院大学, 北京 100049)
|
摘要: |
为快速发现海量遥测数据中的相关关系,提出一种基于改进最大信息系数(Maximal Information Coefficient, MIC)的遥测数据相关性知识发现方法。以Mini Batch K-Means聚类算法为前驱过程对数据进行网格划分;计算该网格划分下的互信息,并以信息熵代替原有最大熵对互信息进行归一化矫正得到信息系数;选择不同网格划分下MIC作为变量相关性的测度。采用量子卫星遥测数据进行试验,结果表明:与基于动态规划算法的MIC方法相比,所提方法可有效解决MIC测度偏向多值变量的问题,时间复杂度从O(n2.4)下降为O(n1.6),是一种适用于大规模遥测数据相关性分析的有效方法。 |
关键词: Mini Batch K-Means 信息熵 最大信息系数 遥测数据 相关性 量子卫星 |
DOI:10.11887/j.cn.201905011 |
投稿日期:2018-06-12 |
基金项目:中国科学院空间科学战略性先导专项资助项目(XDA04080201);中国科学院复杂航天系统电子信息技术重点实验室开放基金资助项目(N201708) |
|
Correlation knowledge discovery method for satellite telemetry data |
YANG Jiasen1,2, MENG Xin1, WANG Chunmei1 |
(1. Key Laboratory of Electronics and Information Technology for Space Systems, National Space Science Center,Chinese Academy of Sciences, Beijing 100190, China;2. University of Chinese Academy of Sciences, Beijing 100049, China)
|
Abstract: |
To discover correlations in massive telemetry data efficiently, a novel correlation knowledge discovery method based on the improved MIC (maximal information coefficient) was proposed. The Mini Batch K-Means clustering algorithm was used to discretize data in the precursor process; the mutual information between two variables under this partition was calculated and normalized by information entropy instead of maximal entropy to obtain the information coefficient; the MIC was selected as the measure of variable correlation. Aflerwards, the method was applied to the correlation analysis of the quantum satellite telemetry data, and the results show that the proposed method can effectively solve the problem of MIC measure bias to multi valued variables compared with the method based on dynamic programming algorithm, the time complexity dropped from O(n2.4)to O(n1.6), and it is an effective method for large-scale telemetry data correlation analysis. |
Keywords: Mini Batch K-Means information entropy maximal information coefficient telemetry data correlation quantum satellite |
|
|