面向混合特征数据的粒子群填补方法
作者:
作者单位:

(军事科学院, 北京 100091)

作者简介:

刘艺(1990—),男,安徽蚌埠人,助理研究员,博士,E-mail:albertliu20th@163.com 通信作者:郑奇斌(1990—),男,甘肃兰州人,助理研究员,博士,E-mail:zhengqibin1990@163.com

通讯作者:

中图分类号:

TP391

基金项目:

国家自然科学基金资助项目(91948303);国家自然科学基金青年科学基金资助项目(61802426)


Particle swarm optimization based data imputation method for mixed features
Author:
Affiliation:

(Academy of Military Sciences, Beijing 100091, China)

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对传统数据填补方法难以有效利用标签信息和缺失数据的随机信息的不足,提出面向混合型特征的粒子群优化填补算法。将连续型特征取值建模为高斯分布,均值和标准差作为优化参数。将离散型特征的取值概率作为参数进行优化。使用分类正确率作为优化目标,充分利用标签信息和缺失数据的随机信息。采用4种基于统计的方法和2种基于演化算法的填补方法作为对比,在6个典型的分类数据集上进行实验。结果表明,提出的方法在分类正确率指标上显著优于其他对比算法,同时具有较优的时间开销,能够有效解决混合特征数据缺失的问题。

    Abstract:

    Aiming at the deficiency of traditional data imputation methods in effectively using the label information and random characteristics of missing data, a particle swarm optimization based imputation method for mixed features was proposed. The value of continuous feature was modeled as Gaussian distribution, and the mean and standard deviation were used as optimization parameters. The value probability of categorical features was optimized as a parameter. The classification accuracy rate was used as the optimization target to make full use of random information of label information and missing data. Four statistical methods and two evolutionary algorithm based imputation methods were used to compare the results on six typical classification datasets. The results show that the proposed method significantly outperforms other comparison algorithms in terms of classification accuracy indicator, and has better time overhead at the same time, which can effectively solve the data missing problems of mixed features.

    参考文献
    相似文献
    引证文献
引用本文

刘艺,秦伟,李庚松,等.面向混合特征数据的粒子群填补方法[J].国防科技大学学报,2024,46(6):107-112.
LIU Yi, QIN Wei, LI Gengsong, et al. Particle swarm optimization based data imputation method for mixed features[J]. Journal of National University of Defense Technology,2024,46(6):107-112.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2022-07-15
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2024-12-02
  • 出版日期: 2024-12-28
文章二维码