发明名称 Method for predicting regulatory elements in repetitive sequences using transcription factor binding sites
摘要 Repeat sequences are the most abundant in the extragenic region of genomes, while a large number of regulatory elements are found in this region. The invention attempts to mine rules on how combinations of individual binding sites are distributed in repeat sequences. These mined association rules would facilitate identifying gene classes regulated by similar mechanisms and accurately predicting regulatory elements. Herein, the combinations of transcription factor binding sites in the repeat sequences are obtained, and data mining techniques are applied to mine the association rules from the combinations of binding sites. In addition, the associations are further pruned to remove insignificant associations and obtain a set of discovered associations. The discovered association rules are used to partially classify the repeat sequences in the repeat sequence database.
申请公布号 US2003068617(A1) 申请公布日期 2003.04.10
申请号 US20010829291 申请日期 2001.04.09
申请人 HORNG JORNG-TZONG;CHAO WEN-FU 发明人 HORNG JORNG-TZONG;CHAO WEN-FU
分类号 C12Q1/68;G06F19/00;(IPC1-7):C12Q1/68 主分类号 C12Q1/68
代理机构 代理人
主权项
地址