发明名称 PREDICTION/CLASSIFICATION METHOD AND DEVICE FOR PROTEIN CODE AREA
摘要 <P>PROBLEM TO BE SOLVED: To provide an information processing method for predicting a protein code area regardless of a biological kind based on the normalized difference value of a codon 2 base pair, and for executing preservation, homologous level retrieval, classification in a state that data are compressed in order to check the structure and function of the area, and for finding out a homologous array from an unknown genome based on the predicated code area. <P>SOLUTION: A protein code area is discovered by using the facts that GA appears in the leading 2 bases of codon with the highest frequency (N1), that TG appears with the lowest frequency (N3), that TG appears in the tail 2 bases with the highest frequency (N4), and that the GA appears with the lowest frequency (N2). Therefore, the partial array of a reading frame where the appearance frequency (N1+N4-(N2+N3)) is made the largest is found out, and when the start codon and the end codon exist in the partial array, the protein code area can be estimated regardless of the biological kind. Those series of inputs, arithmetic operations, and outputs can be easily performed by using an information processor. In addition, the homologous level calculation is operated by using the normalized difference value of the 2 base pair so that the certainty of the code area can be estimated and classified. <P>COPYRIGHT: (C)2003,JPO
申请公布号 JP2003196287(A) 申请公布日期 2003.07.11
申请号 JP20010403147 申请日期 2001.12.27
申请人 ADACHI RIICHI 发明人 ADACHI RIICHI
分类号 C12N15/09;G06F17/30 主分类号 C12N15/09
代理机构 代理人
主权项
地址