一种基于TF·IEF模型的在线新闻事件探测方法
DOI:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家部委资助项目;国家自然科学基金资助项目(61170158);湖南省自然科学基金资助项目(12JJ5028)


On-line news event detection based on TF·IEF model
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为了提升在线新闻事件探测的性能,提出一种基于TF·IEF模型的在线新闻事件探测方法。该方法受TF·IDF思想的启发,直接计算特征词表征事件的权重,建立新的增量事件模型,并将探测过程分为两个阶段:第一阶段利用Single-Pass将一定时段内收集到的报道聚成微簇;第二阶段将微簇与已有事件进行相似性匹配,然后通过重新计算事件向量实现模型更新。实验结果表明,该方法运算速度快,特征信息丢失少,提高了探测的效率和准确率。

    Abstract:

    According to the characters of web news stream, an on-line news event detection (ONED) method, based on the two-stage clustering, is proposed to solve the problem of repeated matching. A novel incremental event model was established by calculating terms weighting of events directly. Two stages are involved in our method. In the first stage, the similar reports collected in a certain period were clustered into micro-clusters. In the second, the micro-clusters were matched with existed events, and then this method updated the event model. Experiment shows that the proposed method improves the efficiency and accuracy of ONED with lower complexity and less feature information loss. 

    参考文献
    相似文献
    引证文献
引用本文

张辉,李国辉,贾立,等.一种基于TF·IEF模型的在线新闻事件探测方法[J].国防科技大学学报,2013,35(3):55-60.
ZHANG Hui, LI Guohui, JIA Li, et al. On-line news event detection based on TF·IEF model[J]. Journal of National University of Defense Technology,2013,35(3):55-60.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2012-03-05
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2013-07-04
  • 出版日期:
文章二维码