发明名称 METHOD AND SYSTEM FOR LOWERING TIME COMPLEXITY IN SHORT SEQUENCES ASSEMBLY
摘要 <p>The invention is applicable to the technical field of gene engineering, and provides a method for lowering time complexity in short sequences assembly and a system thereof. The method comprises the following steps: receiving sequencing sequences; respectively processing base sliding cutting to the received sequencing sequences one by one to obtain short strings with constant base length and to obtain left and right connection relations of the short strings; and storing the sequence values of the obtained all short strings, left and right connection relations and connection amount as one node of a de Bruijn graph, using a hash table to store the nodes of the de Bruijn graph, the hash key is the sequence value, the hash value is the node. Because of using the de Bruijn graph and applying the hash table for storing, it makes updating the connection relation of the nodes to be equal to searching nodes and updating the connection amount of bases having left and right connections for searched nodes. Thus, the searching and adding nodes and updating the connection relations of nodes can be finished during the time of 0(1). The lowering time complexity in the short sequences assembly can be realized and the short sequences of large genome can be assembled.</p>
申请公布号 WO2010066115(A1) 申请公布日期 2010.06.17
申请号 WO2009CN01427 申请日期 2009.12.11
申请人 SHENZHEN HUADA GENE INSTITUTE;LI, RUIQIANG;ZHU, HONGMEI;LI, SONGGANG;WANG, JUN;YANG, HUANMING;WANG, JIAN 发明人 LI, RUIQIANG;ZHU, HONGMEI;LI, SONGGANG;WANG, JUN;YANG, HUANMING;WANG, JIAN
分类号 C12Q1/68;G06F19/22 主分类号 C12Q1/68
代理机构 代理人
主权项
地址