发明名称 System and method for determining three-dimensional structure of protein sequences
摘要 The present invention pertains to a system and method for predicting the protein fold of a target amino acid residue sequence of unknown protein structure. A target sequence is represented by a sequence of residue variability types that utilizes positional variability information present in an associated family of homologous sequences to the target sequence. The use of the positional variability information increases the likelihood of matching the target sequence with a known protein structure. In a first preferred embodiment, a target sequence is mapped into a sequence of residue variability types that are based on the solubility variability present between amino acid residues in homologous sequences. In a second preferred embodiment, each residue variability type represents a cluster of residue types at each position of aligned sets of homologous protein sequences. Each distinct cluster represents a pattern of residue variability at various positions in sets of homologous protein sequences. The sequence of residue variability types is aligned with one or more environment strings, each of which represents a known protein structure in accordance with the degree of surface exposure for each amino acid position in the protein's structure. The alignment is performed using a threading procedure that determines a score for each alignment indicating the compatibility of the sequence to the structure. The protein structure associated with the highest score is deemed to be the most analogous structure to the target sequence.
申请公布号 US5878373(A) 申请公布日期 1999.03.02
申请号 US19960761724 申请日期 1996.12.06
申请人 REGENTS OF THE UNIVERSITY OF CALIFORNIA 发明人 COHEN, FRED E.;DEFAY, THOMAS R.
分类号 C07K1/00;G06F17/30;G06F19/00;(IPC1-7):G06F19/00;G06F17/00 主分类号 C07K1/00
代理机构 代理人
主权项
地址