引用本文: | 杜耀华,倪青山,王正志.大肠杆菌转录起始位点的计算定位方法.[J].国防科技大学学报,2006,28(4):88-92.[点击复制] |
DU Yaohua,NI Qingshan,WANG Zhengzhi.Computational Localization of Transcription Start Sites in Escherichia Coli Genomic Sequences[J].Journal of National University of Defense Technology,2006,28(4):88-92[点击复制] |
|
|
|
本文已被:浏览 7151次 下载 6364次 |
大肠杆菌转录起始位点的计算定位方法 |
杜耀华, 倪青山, 王正志 |
(国防科技大学 机电工程与自动化学院,湖南 长沙 410073)
|
摘要: |
根据已有的启动子识别算法,提出了一种基于滑动窗口的大肠杆菌转录起始位点(TSS)计算定位方法,通过在启动子信号特征中引入复合模式来改进识别分类器,并将其用于滑动窗口序列,在合理限定的TSS定位范围内依次计算各个序列位置的TSS似然得分,再利用TSS与翻译起始位点(TLS)的距离分布信息作为TSS的位置得分,两者相结合来进行位置预测。对大肠杆菌真实数据的测试表明,算法可以大幅度减少假阳性结果,实现对真实TSS位置的有效预测。 |
关键词: 大肠杆菌 转录起始位点 计算定位 复合模式 滑动窗口 |
DOI: |
投稿日期:2006-02-28 |
基金项目:国家自然科学基金资助项目(60471003) |
|
Computational Localization of Transcription Start Sites in Escherichia Coli Genomic Sequences |
DU Yaohua, NI Qingshan, WANG Zhengzhi |
(College of Mechatronics Engineering and Automation, National Univ. of Defense Technology, Changsha 410073, China)
|
Abstract: |
Although a large number of researches have been undertaken in the area of transcription start site (TSS) localization, the problem of TSS localization has not yet been fully resolved. According to the previous promoter prediction algorithm, a new sliding window based computational localization method for E. coli TSSs is proposed. The TSS-likelihood scores of each possible position in genomic sequences are calculated by the window classifier which is improved by introducing the composite motif model in the training procedure of original promoter classifier. The distribution of distances between TSSs and translation start sites (TLSs) is also utilized to calculate the TSS-position scores. Localization results are achieved from the final score profiles which combine TSS-likelihood scores and TSS-position scores. The test results on E. coli dataset show that the method can find the putative TSSs and decrease the number of false positives efficiently. |
Keywords: escherichia coli transcription start site (TSS) computational localization composite motif sliding window |
|
|