面向蛋白质功能预测中有向无环图标记结构的多示例多标记学习
作者:
作者单位:

(1. 南京邮电大学 地理与生物信息学院, 江苏 南京 210023;2. 南京邮电大学 通信与信息工程学院, 江苏 南京 210003;3. 南京仁面集成电路技术有限公司, 江苏 南京 210088;4. 南京叁角加文化发展中心, 江苏 南京 210005)

作者简介:

吴建盛(1979—),男,江西临川人,副教授,博士,硕士生导师,E-mail:jansen@njupt.edu.cn

通讯作者:

中图分类号:

TP301.6

基金项目:

国家自然科学基金资助项目(61872198,61971216);江苏省科技厅基础研究计划面上资助项目(BK20201378)


Multi-instance multi-label learning for labels with directed acyclic graph structures in protein function prediction
Author:
Affiliation:

(1. School of Geographic and Biological Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China;2. School of Communications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China;3. Nanjing Renmian Integrated Circuit Technology Limited Company, Nanjing 210088, China;4. Nanjing Triangular Plus Culture Development Centre, Nanjing 210005, China)

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    在多示例多标记学习问题中,标记之间往往是相互关联的,其中有向无环图结构是一种常见的层次关联结构,可见于蛋白质的基因本体学生物学功能预测的应用场景中。针对其标记间的有向无环图结构,提出了一种新的多示例多标记学习算法。算法从原始数据的特征空间训练出所有标记共享的低维子空间,通过随机梯度下降方法来降低模型排序损失,并融入标记间有向无环图结构关系对预测标记进行优化。将该算法应用于多个数据集的蛋白质功能预测中,实验结果表明,该算法具有更高的效率及预测性能。

    Abstract:

    In MIML (multi-instance multi-label learning) tasks, labels are often correlated with each other, and DAG (directed acyclic graph) is a common hierarchically structure which often occurs in the prediction of gene ontology biological functions of proteins. Considering the labels with directed acyclic graph structures in MIML, a novel algorithm named MIMLDAG (multi-instance multi-label directed acyclic graph) was proposed. MIMLDAG trained a low-dimensional subspace of shared labels from the feature space of original datasets, minimized the rank loss by a stochastic gradient descent method, and then incorporated the inner DAG hierarchical structure of labels for optimizing the output labels. MIMLDAG was applied to predict the protein functions in multiple datasets, and the results show that MIMLDAG possesses higher efficiency and predictive performance.

    参考文献
    相似文献
    引证文献
引用本文

吴建盛,唐诗迪,梅德进,等.面向蛋白质功能预测中有向无环图标记结构的多示例多标记学习[J].国防科技大学学报,2022,44(3):23-30.
WU Jiansheng, TANG Shidi, MEI Dejin, et al. Multi-instance multi-label learning for labels with directed acyclic graph structures in protein function prediction[J]. Journal of National University of Defense Technology,2022,44(3):23-30.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2021-06-22
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2022-06-02
  • 出版日期: 2020-06-28
文章二维码