发明名称 Methods and computer program products for compression of sequencing data
摘要 A compression method includes measuring a waveform associated with a chemical event occurring on a sensor array, wherein the waveform comprises at least one region associated with expected measured values and at least one region associated with unpredictable measured values; applying a first compression process to the waveform, the first compression process including an averaging of one or more frames in one or more portions of the waveform; and applying a second compression process to the waveform, the second compression process including a truncating of data corresponding to a portion of the waveform that is not related to a nucleotide incorporation component of the waveform.
申请公布号 US9515676(B2) 申请公布日期 2016.12.06
申请号 US201313754566 申请日期 2013.01.30
申请人 Life Technologies Corporation 发明人 Sugnet Charles;Cawley Simon;Gupta Mohit;Marjanovic Iztok;Rearick Todd;Beauchemin Mark;Hubbell Earl
分类号 G06F19/24;G11C17/00;H03M7/30 主分类号 G06F19/24
代理机构 代理人
主权项 1. A compression method, comprising: measuring a waveform associated with a chemical event occurring on a sensor array, the measuring including digitizing voltage signals using an analog to digital converter to produce a plurality of frames of measured values for the waveform, the voltage signals generated by the sensor array in response to the chemical event, wherein the chemical event is indicative of a number of nucleotide incorporations in a genetic sequencing reaction, wherein the waveform comprises at least one region associated with expected measured values and at least one region associated with unpredictable measured values; applying a first compression process to the waveform using a processor, the first compression process including an averaging of one or more frames in one or more portions of the waveform to form frame-averaged data, wherein a number of frames of frame-averaged data is less than a number of frames in the plurality of frames of measured values; applying a keyframe delta compression to the frame-averaged data using the processor, wherein the keyframe delta compression comprises calculating a difference between a current frame of the frame-averaged data and a previous frame of the frame-averaged data associated with the waveform; forming a compressed data structure including a keyframe of the frame-averaged data and a plurality of the calculated differences subsequent to the keyframe, wherein the compressed data structure represents the keyframe and the plurality calculated differences in a number of bytes that is less than an original number of bytes representing the frame-averaged data; determining compression information corresponding to one or more compressed data structures; storing the compression information and the one or more compressed data structures in a memory; and applying a second compression process to the waveform using the processor, the second compression process including a truncating of data corresponding to a portion of the waveform that is not related to a nucleotide incorporation component of the waveform.
地址 Carlsbad CA US