发明名称 PROTEIN FUNCTIONAL AND SUB-CELLULAR ANNOTATION IN A PROTEOME
摘要 Techniques are disclosed for identifying the likely functionality and sub-cellular localization of individual proteins by first creating a protein-protein interaction network where protein pairs are created from data available from databases and experimental results, and by guessing potential interacting protein pairs where no data exists. Inside each protein pair, mutual likely functionality and localization annotations are made using the known functionalities and localization of the two proteins. The resulting annotated proteins are clustered according to similarity of their annotations and for each cluster iterative mutual annotations in each protein pair enrich the previous functional annotations until no more functionality annotations can be made and results in proteins with at least one assigned functionality and localization duet. Ranking of the resulting assignments is done using the specificity and confidence of the assignment.
申请公布号 US2017076036(A1) 申请公布日期 2017.03.16
申请号 US201615361461 申请日期 2016.11.27
申请人 Theofilatos Konstantinos;Dimitrakopoulos Christos;Mavroudi Seferina;Korfiati Aigli;Alexakos Christos 发明人 Theofilatos Konstantinos;Dimitrakopoulos Christos;Mavroudi Seferina;Korfiati Aigli;Alexakos Christos
分类号 G06F19/18;G06N99/00;G06N7/00 主分类号 G06F19/18
代理机构 代理人
主权项 1. A method of predicting the functionality of the proteome of an organism, comprising: constructing a plurality of interacting protein pairs, where said plurality of interacting protein pair are either weighted or un-weighted; assigning a first set of functionalities to proteins in said protein pairs, where each protein is assigned either at least one functionality or no functionality; clustering said proteins into at least one cluster using at least a first criterion; iteratively assigning at least a second set of functionalities to the proteins of the at least one cluster, where said second assignment is done by pairwise comparison of all interacting proteins in a cluster and where the first protein is assigned at least one functionality of the second protein, or no assignment is made if the second protein has no assigned functionality, and the second protein is assigned at least one functionality of the first protein, or no assignment is made if the first protein has no assigned functionality, and where said assignment of the second set of functionalities continues until either all proteins of said proteome have been assigned at least one functionality or no new functionality assignment can be made; assigning confidence values to said functionality assignments; comparing said confidence values with a first threshold; and keeping said confidence values that are larger or equal to the first threshold and rejecting said confidence values that are smaller than the first threshold.
地址 PATRA GR