发明名称 |
Computer method and apparatus for uniform representation of genome sequences |
摘要 |
A method and apparatus transforms typically differing length text string representations (i.e., sequences) of biological fragments into uniform length representations. A comparison database stores a predefined number of known biological sequences. A comparison routine compares and scores a subject sequence against each known sequence in the database. Each individual score (one for each known sequence in the database) serves as a vector element forming a fixed length vector representation of the subject sequence. Vector length equals the predefined number of known biological sequences in the database. Scoring is a probability or an occurrence count of the known biological sequence in the subject sequence.
|
申请公布号 |
US7047137(B1) |
申请公布日期 |
2006.05.16 |
申请号 |
US20000724269 |
申请日期 |
2000.11.28 |
申请人 |
HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. |
发明人 |
KASIF SIMON;LOGAN BETH T.;MORENO PEDRO J.;SUZEK BARIS |
分类号 |
G06F19/00;C12Q1/68;G01N33/48 |
主分类号 |
G06F19/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|