发明名称 |
FEATURE VECTOR-BASED METHOD FOR REMOVING REDUNDANCY IN A TRAINING DATASET |
摘要 |
PURPOSE: A repetition removal method of training data based on a feature vector is provided to include a feature of protein and an RNA(RiboNucleic Acid) sequence in the feature vector as a composition element, thereby effectively predicting an RNA-combination amino acid existed in a protein sequence. CONSTITUTION: A protein-RNA combination part determines an RNA-combination amino acid which interacts with RNA using a hydrogen bond(S10). A tendency property of an amino acid triplet is calculated(S20). Various features of protein and RNA sequences are coded in a feature vector in order to predict the RNA-combination amino acid(S30). A training data set, which removes the unnecessary repetition of data, is constructed based on the coded feature vector(S40). [Reference numerals] (S10) Step of determining a protein-RNA combination part; (S20) Step of calculating a tendency property of an amino acid triplet; (S30) Step of coding in a feature vector; (S40) Step of constructing a training data set;
|
申请公布号 |
KR20130035732(A) |
申请公布日期 |
2013.04.09 |
申请号 |
KR20110100228 |
申请日期 |
2011.09.30 |
申请人 |
INHA-INDUSTRY PARTNERSHIP INSTITUTE |
发明人 |
HAN, KYUNG SOOK;CHOI, SUNG WOOK |
分类号 |
G06F19/10 |
主分类号 |
G06F19/10 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|