引用本文: | 于龙,尹浩.站点主题结构与导航归纳技术.[J].国防科技大学学报,2012,34(5):90-95.[点击复制] |
YU Long,YIN Hao.Website topic structure and navigation induction[J].Journal of National University of Defense Technology,2012,34(5):90-95[点击复制] |
|
|
|
本文已被:浏览 6729次 下载 6258次 |
站点主题结构与导航归纳技术 |
于龙, 尹浩 |
(解放军理工大学 通信工程学院,江苏 南京 210007)
|
摘要: |
站点主题描述了互联网站点中信息的聚合与分类,体现着信息逻辑结构,是分析站点信息的关键。分析站点逻辑结构是站点设计的逆向过程,为了准确分析站点中的主题,提出了站点主题结构的理论模型,以形式化的方式描述了站点中不同主题的组织形式、逻辑关系及相关性质,为面向主题的网络信息抽取提供必要的理论基础。在此基础上,进一步研究自动构建站点主题结构的技术,提出基于导航的主题结构归纳方法,并进行了算法描述和实验分析。实验结果证明,站点主题结构的理论模型概括了目前大多数站点的主题结构特征,基于导航的主题结构归纳方法能正确地建立站点的主题结构,并具有较快的运行时间。 |
关键词: 站点 主题结构 导航 |
DOI: |
投稿日期:2012-04-03 |
基金项目:国家自然科学基金资助项目(60903042);国家863高技术资助项目(2010AA) |
|
Website topic structure and navigation induction |
YU Long, YIN Hao |
(Institute of Communications Engineering,PLA University of Science and Technology, Nanjing 210007, China)
|
Abstract: |
Website topics, describing aggregation and classification of website information, embodying information logic structure, is crucial for website information analysis. Analysis of logical structure is the reverse process of website design. In order to accurately analyze the site topics, the research proposed a topic structure model describing the organizational forms, logic relations and related properties of different website’s topics in a formal way, providing the necessary theoretical basis for the topic oriented web information extraction. On this basis, navigation-based topic structure induction was proposed with algorithm and experimental analysis to automatically construct topic structure of websites. Experimental results show that topic structure model generalizes most of the site’s topic structural characteristics, while the navigation based topic structure induction can correctly establish the site's topic structure, and has a faster running time. |
Keywords: website topic hierarchy navigation |
|
|
|
|
|