发明名称 TRANSCRIPTOME ASSEMBLY METHOD AND SYSTEM
摘要 Provided is a transcriptome assembly method, comprising the following steps of: constructing a sequencing sample transcriptome read into a de Brujin graph; performing filtering and linearization processing on the de Brujin graph, so as to form continuous contigs; obtaining association among the contigs, and filtering association data; performing linearization processing on a continuous sequence without bifurcation; outputting a contig sequence; comparing the read and an end pairing read with the output contig sequence, so as to obtain information between the read and the contig; establishing connections among the contigs, so as to construct a graph with the contigs as points and the connections as edges; pre-processing and dividing the obtained graph, so as to obtain independent sub-graphs; and outputting a transcript according to the sub-graphs. Further provided is a transcriptome assembly system based on the method.
申请公布号 US2015120204(A1) 申请公布日期 2015.04.30
申请号 US201214394135 申请日期 2012.04.13
申请人 Wu Gengxiong;Huang Weihua;Xie Yinlong;Tang Jingbo;Wang Jun;Wang Jian;Yang Huanming 发明人 Wu Gengxiong;Huang Weihua;Xie Yinlong;Tang Jingbo;Wang Jun;Wang Jian;Yang Huanming
分类号 G06F19/22;G06F19/20 主分类号 G06F19/22
代理机构 代理人
主权项 1. A method for contig assembly, comprising following steps: (1) constructing a de Brujin graph based on transcriptomic reads obtained from a sample; (2) subjecting the de Brujin graph obtained in the step (1) to a first filtration and a first linearization, to form continuous contigs; (3) obtaining a connection relationship among the contigs, and subjecting the connection relationship to a second filtration; (4) subjecting continuous contigs without a fork to a second linearization; (5) repeating step the (3) and the step (4) until a sequence presents no changes, to obtain the sequence assembling with contigs.
地址 Shenzhen CN