发明名称 |
Gene finding using ordered sets of distinct marker strings |
摘要 |
A method and system for finding in a DNA sequence, a gene represented by an ordered set of marker strings is disclosed. Sub-strings in the DNA sequence matching each marker string are identified. In a set ordered via occurrence of the marker strings in the ordered set of marker strings, the score and position of each sub-string whose score satisfies a matching constraint is recorded. For each except the last marker string, directed links are created between each identified sub-string that matches the marker string and any identified sub-strings that match the subsequent marker string, subject to the directed links satisfying an inter-marker length constraint. Traced are all paths that connect each identified sub-string that matches the first marker string to an identified sub-string that matches the last marker string using the directed links. The paths satisfy a sequence length constraint and are stored in memory of a computer system. |
申请公布号 |
US8738299(B2) |
申请公布日期 |
2014.05.27 |
申请号 |
US20060534439 |
申请日期 |
2006.09.22 |
申请人 |
HUSSAN JAGIR R.;JHONEY ALBEE;INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
HUSSAN JAGIR R.;JHONEY ALBEE |
分类号 |
G06G7/58;C12Q1/68;G01N31/00;G01N33/48;G01N33/50;G06F19/00;G06F19/12;G06F19/22 |
主分类号 |
G06G7/58 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|