Clustering is one of the focused problems in multimedia data mining, and similarity measurement among data is fundamental to clustering. In multimedia data clustering, the corresponding vector features are always of high dimensionality. Most traditional measurement methods, however, are only efficient for low dimensional data. This paper, based on an analysis of general characteristics of data presented in high dimensional spaces, proposes a new similarity measurement for multimedia data mining. It used a special strategy to split the original data space before computing the similarity among data points, thus efficiently avoiding the influence of noisy data in high dimensional dimensional spaces. Experiments show that the new method presented is effective.
参考文献
相似文献
引证文献
引用本文
贺玲,吴玲达,蔡益朝,等.多媒体数据挖掘中数据间的相似性度量研究[J].国防科技大学学报,2006,28(1):77-80. HE Ling, WU Lingda, CAI Yichao, et al. Research on Similarity Measurement in Multimedia Data Mining[J]. Journal of National University of Defense Technology,2006,28(1):77-80.