发明名称 Gene finding using ordered sets of distinct marker strings
摘要 A method and system for finding in a DNA sequence, a gene represented by an ordered set of marker strings is disclosed. Sub-strings in the DNA sequence matching each marker string are identified. In a set ordered via occurrence of the marker strings in the ordered set of marker strings, the score and position of each sub-string whose score satisfies a matching constraint is recorded. For each except the last marker string, directed links are created between each identified sub-string that matches the marker string and any identified sub-strings that match the subsequent marker string, subject to the directed links satisfying an inter-marker length constraint. Traced are all paths that connect each identified sub-string that matches the first marker string to an identified sub-string that matches the last marker string using the directed links. The paths satisfy a sequence length constraint and are stored in memory of a computer system.
申请公布号 US8738299(B2) 申请公布日期 2014.05.27
申请号 US20060534439 申请日期 2006.09.22
申请人 HUSSAN JAGIR R.;JHONEY ALBEE;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 HUSSAN JAGIR R.;JHONEY ALBEE
分类号 G06G7/58;C12Q1/68;G01N31/00;G01N33/48;G01N33/50;G06F19/00;G06F19/12;G06F19/22 主分类号 G06G7/58
代理机构 代理人
主权项
地址