本文已被:浏览 1506次 下载 3164次
Received:February 22, 2019 Revised:March 08, 2019
Received:February 22, 2019 Revised:March 08, 2019
中文摘要: 野生小麦是异源六倍体,基因组规模较大(约14 GB),且包含大量重复序列.为了培育具有优良性状的新品种,首先要定位控制目标性状的基因,因此建立一个完整准确的基因组注释软件流程至关重要.传统的基因组注释方法基于数据库比对,具有三个明显的缺点:一是比对速度慢;二是难以发现新基因;三是软件选择没有统一标准.本文提出了一种新的生物信息学注释流程,结合了基因数据库比对、转录组高通量测序数据分析、全长转录组单分子测序数据分析等多种技术手段,实现了六倍体小麦科农9204基因组完整准确的注释,为揭示小麦生长发育规律和培育新品种提供了重要参考和软件技术支撑.
Abstract:Wild wheat is a heterologous hexaploid with a large genome size (about 14 GB) and a lot of repetitive sequences. In order to breed new varieties with good traits, we must first locate the genes that control the target traits. Therefore, it is important to establish a complete and accurate genome annotation process. Traditional genomic annotation method based on database alignment has three obvious disadvantages:first, the alignment runs slowly; second, it is difficult to discover new genes; third, there is no uniform standard for software selection. We propose a new analysis process that combines genetic database alignment, transcriptome high-throughput sequencing, and full-length transcriptome single-molecule sequencing data analysis to annotate hexaploid wheat KN9204 completely and accurately. The annotation of the genome provides an important reference and technical support for revealing the growth of wheat and cultivating new varieties.
keywords: genome genome annotation high-throughput sequencing bioinformatics full length transcriptome
文章编号: 中图分类号: 文献标志码:
基金项目:国家重点研发计划(2018YFB0203903,2016YFC0503607);中国科学院信息化专项(XXH13504-08);国家自然科学基金(31771466);青海省科技成果转化专项(2016-SF-127);中国科学院“百人计划”海外引进杰出人才择优支持(牛北方)
引用文本:
祝海栋,李瑞琳,何小雨,赵丹,韩鑫胤,牛北方.六倍体小麦基因组注释流程构建与优化.计算机系统应用,2019,28(8):222-228
ZHU Hai-Dong,LI Rui-Lin,HE Xiao-Yu,ZHAO Dan,HAN Xin-Yin,NIU Bei-Fang.Construction and Optimization of Hexaploid Wheat Genome Annotation Process.COMPUTER SYSTEMS APPLICATIONS,2019,28(8):222-228
祝海栋,李瑞琳,何小雨,赵丹,韩鑫胤,牛北方.六倍体小麦基因组注释流程构建与优化.计算机系统应用,2019,28(8):222-228
ZHU Hai-Dong,LI Rui-Lin,HE Xiao-Yu,ZHAO Dan,HAN Xin-Yin,NIU Bei-Fang.Construction and Optimization of Hexaploid Wheat Genome Annotation Process.COMPUTER SYSTEMS APPLICATIONS,2019,28(8):222-228