发明名称 SYSTEMS AND TECHNIQUES FOR SEGMENTATION OF SEQUENTIAL DATA
摘要 An efficient method and associated systems for segmentation of high throughput sequential data, such as genomic datasets. The technique first utilizes dynamic programming to compute the significance for a large number of candidate segments. It then uses tree-based data structures to detect overlapping significant regions and update them simultaneously. Refinement and merging of significant segments are performed at the end to generate the final segmentation.
申请公布号 US2014288847(A1) 申请公布日期 2014.09.25
申请号 US201414216416 申请日期 2014.03.17
申请人 THE FLORIDA STATE UNIVERSITY RESEARCH FOUNDATION, INC. 发明人 Zhang Jinfeng;Dennis Jonathan;Balaji Senthil
分类号 G06F19/10 主分类号 G06F19/10
代理机构 代理人
主权项 1. A method for the segmentation of sequential data comprising the steps of: providing data representing a sequence of measurements or a set of measurements in a sequential order; selecting a representative set of segments from the data; computing a significance measure for each selected segment; detecting overlap between segments using two data structures, wherein the first data structure ranks segments by their significance measure and the second data structure stores the boundaries of each segment for overlap checking and wherein overlapping segments with a lower significance ranking (higher significance measure) than a highest ranked co-overlapping segment are deleted and all undeleted segments are retained as significant segments; and returning the set of retained significant segments.
地址 TALLAHASSEE FL US