Particle swarm optimization based data imputation method for mixed features
CSTR:
Author:
Affiliation:

(Academy of Military Sciences, Beijing 100091, China)

Clc Number:

TP391

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Aiming at the deficiency of traditional data imputation methods in effectively using the label information and random characteristics of missing data, a particle swarm optimization based imputation method for mixed features was proposed. The value of continuous feature was modeled as Gaussian distribution, and the mean and standard deviation were used as optimization parameters. The value probability of categorical features was optimized as a parameter. The classification accuracy rate was used as the optimization target to make full use of random information of label information and missing data. Four statistical methods and two evolutionary algorithm based imputation methods were used to compare the results on six typical classification datasets. The results show that the proposed method significantly outperforms other comparison algorithms in terms of classification accuracy indicator, and has better time overhead at the same time, which can effectively solve the data missing problems of mixed features.

    Reference
    Related
    Cited by
Get Citation

LIU Yi, QIN Wei, LI Gengsong, LIU Kun, WANG Qiang, ZHENG Qibin, REN Xiaoguang. Particle swarm optimization based data imputation method for mixed features[J]. Journal of National University of Defense Technology,2024,46(6):107-112.

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:July 15,2022
  • Revised:
  • Adopted:
  • Online: December 02,2024
  • Published: December 28,2024
Article QR Code