发明名称 Systems and methods for processing nucleic acid sequence data
摘要 The present disclosure provides systems and methods for nucleic acid sequence analysis. A system for processing raw nucleic acid sequence data from a genomic sequencer comprises a data processing server having a housing contained therein one or more processing modules. The one or more processing modules can each comprise an electronic control unit programmed to align nucleic acid sequence data from a genomic sequencing device and perform one or more of variant analysis and structural variant analysis on the nucleic acid sequence data. The system can further comprise a computer server in communication with the processing server. The computer server can be programmed or otherwise configured to process and/or analyze the aligned nucleic acid sequence data.
申请公布号 US9600625(B2) 申请公布日期 2017.03.21
申请号 US201313868985 申请日期 2013.04.23
申请人 BINA TECHNOLOGIES, INC. 发明人 Asadi Narges Bani;Chong Jike;Chen Henry;Mohiyuddin Marghoob;Doupnik Austin
分类号 G06F19/00;G06F19/22 主分类号 G06F19/00
代理机构 Kilpatrick, Townsend & Stockton LLP 代理人 Kilpatrick, Townsend & Stockton LLP
主权项 1. A system for processing genetic sequence data from a genomic sequencing device, the system comprising: an input for receiving the genetic sequence data from a sequencing of a sample from a human subject by the genomic sequencing device; a plurality of nodes connected to each other, each node comprising: a computer server including one or more processors;a programmable logic device having an alignment engine that is programmed to perform alignment of the genetic sequence data to reference sequences retrieved by a memory controller from a first memory, thereby obtaining aligned genetic sequence data;a communication bus between the computer server and the programmable logic device for transferring sequence reads of the genetic sequence data from the computer server to the alignment engine and for transferring instructions to the memory controller configured to retrieve the reference sequences from the first memory; andnetwork connections to other nodes for transferring aligned reads identified by an alignment demultiplexer, wherein the alignment demultiplexer is configured to identify which node is responsible for a genomic region corresponding to an aligned read,wherein the computer server is configured to: sort, based on location, aligned reads generated by the node and received from other nodes, thereby obtaining a sorted list of aligned reads,split the sorted list into N sorted regions, andperform a multi-threaded execution of a variant analysis and a structural variant analysis on the aligned reads of the N sorted regions, each thread operating on aligned reads of one of the N sorted regions; wherein the computer servers are configured to perform the variant analysis and the structural variant analysis on said aligned genetic sequence data having at least about thirty times coverage of a human genome, wherein the alignment, variant analysis, and the structural analysis is performed in a time period less than or equal to about 4 hours.
地址 Redwood City CA US