期刊文献+
共找到7篇文章
< 1 >
每页显示 20 50 100
RF-PSSM:A Combination of Rotation Forest Algorithm and Position-Specific Scoring Matrix for Improved Prediction of Protein-Protein Interactions Between Hepatitis C Virus and Human
1
作者 Xin Liu Yaping Lu +3 位作者 Liang Wang Wei Geng Xinyi Shi Xiao Zhang 《Big Data Mining and Analytics》 EI CSCD 2023年第1期21-31,共11页
The identification of hepatitis C virus(HCV)virus-human protein interactions will not only help us understand the molecular mechanisms of related diseases but also be conductive to discovering new drug targets.An incr... The identification of hepatitis C virus(HCV)virus-human protein interactions will not only help us understand the molecular mechanisms of related diseases but also be conductive to discovering new drug targets.An increasing number of clinically and experimentally validated interactions between HCV and human proteins have been documented in public databases,facilitating studies based on computational methods.In this study,we proposed a new computational approach,rotation forest position-specific scoring matrix(RF-PSSM),to predict the interactions among HCV and human proteins.In particular,PSSM was used to characterize each protein,two-dimensional principal component analysis(2DPCA)was then adopted for feature extraction of PSSM.Finally,rotation forest(RF)was used to implement classification.The results of various ablation experiments show that on independent datasets,the accuracy and area under curve(AUC)value of RF-PSSM can reach 93.74% and 94.29%,respectively,outperforming almost all cutting-edge research.In addition,we used RF-PSSM to predict 9 human proteins that may interact with HCV protein E1,which can provide theoretical guidance for future experimental studies. 展开更多
关键词 protein-protein interactions hepatitis C virus position specific scoring matrix two-dimensional principal component analysis rotation forest
原文传递
Improvements in the score matrix calculation method using parallel score estimating algorithm
2
作者 Geraldo F.D.Zafalon Evandro A.Marucci +3 位作者 Julio C.Momente Jose R.A.Amazonas Liria M.Sato Jose M.Machado 《Journal of Biophysical Chemistry》 2013年第2期47-51,共5页
The increasing amount of sequences stored in genomic databases has become unfeasible to the sequential analysis. Then, the parallel computing brought its power to the Bioinformatics through parallel algorithms to alig... The increasing amount of sequences stored in genomic databases has become unfeasible to the sequential analysis. Then, the parallel computing brought its power to the Bioinformatics through parallel algorithms to align and analyze the sequences, providing improvements mainly in the running time of these algorithms. In many situations, the parallel strategy contributes to reducing the computational complexity of the big problems. This work shows some results obtained by an implementation of a parallel score estimating technique for the score matrix calculation stage, which is the first stage of a progressive multiple sequence alignment. The performance and quality of the parallel score estimating are compared with the results of a dynamic programming approach also implemented in parallel. This comparison shows a significant reduction of running time. Moreover, the quality of the final alignment, using the new strategy, is analyzed and compared with the quality of the approach with dynamic programming. 展开更多
关键词 ALGORITHMS scoring matrix Parallel Programming Alignment Quality
下载PDF
New event detection based on sorted subtopic matching algorithm
3
作者 翟东海 CUI Jing-jing +1 位作者 NIE Hong-yu DU Jia 《Journal of Chongqing University》 CAS 2013年第4期179-186,共8页
How to quickly and accurately detect new topics from massive data online becomes a main problem of public opinion monitoring in cyberspace. This paperpresents a new event detection method for the current new event det... How to quickly and accurately detect new topics from massive data online becomes a main problem of public opinion monitoring in cyberspace. This paperpresents a new event detection method for the current new event detection system, based on sorted subtopic matching algorithm and constructs the entire design framework. In this p^per, the subtopics contained in old topics (or news stories) are sorted in descending order according to their importance to the topic(or news stories), and form a sorted subtopic sequence. In the process of subtopic matching, subtopic scoring matrix is used to determine whether a new story is reporting a new event. Experimental results show that the sorted subtopic matching model improved the accuracy and effectiveness ofthenew event detection system in cyberspace. 展开更多
关键词 new event detection topic detection scoring matrix sorted subtopic matching model subtopic sequence
下载PDF
Entropy-based procedures for intuitionistic fuzzy multiple attribute decision making 被引量:6
4
作者 Xu Zeshui~(1,2) & Hu Hui~3 1.School of Economics and Management,Southeast Univ.,Nanjing 210096,P.R.China 2.Inst.of Sciences,PLA Univ.of Sciences and Technology,Nanjing 210007,P.R.China 3.Inst.of Communications Engineering,PLA Univ.of Sciences and Technology, Nanjing 210007,P.R.China 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2009年第5期1001-1011,共11页
The class of multiple attribute decision making (MADM) problems is studied, where the attribute values are intuitionistic fuzzy numbers, and the information about attribute weights is completely unknown. A score fun... The class of multiple attribute decision making (MADM) problems is studied, where the attribute values are intuitionistic fuzzy numbers, and the information about attribute weights is completely unknown. A score function is first used to calculate the score of each attribute value and a score matrix is constructed, and then it is transformed into a normalized score matrix. Based on the normalized score matrix, an entropy-based procedure is proposed to derive attribute weights. Furthermore, the additive weighted averaging operator is utilized to fuse all the normalized scores into the overall scores of alternatives, by which the ranking of all the given alternatives is obtained. This paper is concluded by extending the above results to interval-valued intuitionistic fuzzy set theory, and an illustrative example is also provided. 展开更多
关键词 multiple attribute decision making intuitionistic fuzzy number score matrix ENTROPY additive weighted averaging operator.
下载PDF
Active motif finder-a bio-tool based on mutational structures in DNA sequences
5
作者 Mani Udayakumar Palaniyandi Shanmuga-priya +1 位作者 Kamalakannan Hemavathi Rengasamy Seenivasagam 《The Journal of Biomedical Research》 CAS 2011年第6期444-448,共5页
Active Motif Finder (AMF) is a novel algorithmic tool, designed based on mutations in DNA sequences. Tools available at present for finding motifs are based on matching a given motif in the query sequence. AMF descr... Active Motif Finder (AMF) is a novel algorithmic tool, designed based on mutations in DNA sequences. Tools available at present for finding motifs are based on matching a given motif in the query sequence. AMF describes a new algorithm that identifies the occurrences of patterns which possess all kinds of mutations like insertion, deletion and mismatch. The algorithm is mainly based on the Alignment Score Matrix (ASM) computation by com paring input motif with full length sequence. Much of the effort in bioinformatics is directed to identify these motifs in the sequences of newly discovered genes. The proposed bio-tool serves as an open resource for analysis and useful for studying polymorphisms in DNA sequences. AMF can be searched via a user-friendly interface. This tool is intended to serve the scientific community working in the areas of chemical and structural biology, and is freely available to all users, at http://www.sastra.edu/scbt/amf/. 展开更多
关键词 MUTATIONS alignment score matrix back track INDELS pattern occurrence DNA sequences.
下载PDF
Protein domain boundary prediction by combining support vector machine and domain guess by size algorithm
6
作者 董启文 Wang +2 位作者 Xiaolong Lin Lei 《High Technology Letters》 EI CAS 2007年第1期74-78,共5页
Successful prediction of protein domain boundaries provides valuable information not only for the computational structure prediction of muhi-domain proteins but also for the experimental structure determination. A nov... Successful prediction of protein domain boundaries provides valuable information not only for the computational structure prediction of muhi-domain proteins but also for the experimental structure determination. A novel method for domain boundary prediction has been presented, which combines the support vector machine with domain guess by size algorithm. Since the evolutional information of multiple domains can be detected by position specific score matrix, the support vector machine method is trained and tested using the values of position specific score matrix generated by PSI-BLAST. The candidate domain boundaries are selected from the output of support vector machine, and are then inputted to domain guess by size algorithm to give the final results of domain boundary, prediction. The experimental results show that the combined method outperforms the individual method of both support vector machine and domain guess by size. 展开更多
关键词 domain boundary prediction support vector machine domain guess by size positionspecific score matrix
下载PDF
PCA for predicting quaternary structure of protein
7
作者 Tong WANG Hongbin SHEN +2 位作者 Lixiu YAO Jie YANG Kuochen CHOU 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2008年第4期376-380,共5页
The number and arrangement of subunits that form a protein are referred to as quaternary structure.Knowing the quaternary structure of an uncharacterized protein provides clues to finding its biological function and i... The number and arrangement of subunits that form a protein are referred to as quaternary structure.Knowing the quaternary structure of an uncharacterized protein provides clues to finding its biological function and interaction process with other molecules in a biological system.With the explosion of protein sequences generated in the Post-Genomic Age,it is vital to develop an automated method to deal with such a challenge.To explore this prob-lem,we adopted an approach based on the pseudo position-specific score matrix(Pse-PSSM)descriptor,proposed by Chou and Shen,representing a protein sample.The Pse-PSSM descriptor is advantageous in that it can combine the evolution information and sequence-correlated informa-tion.However,incorporating all these effects into a descriptor may cause‘high dimension disaster’.To over-come such a problem,the fusion approach was adopted by Chou and Shen.A completely different approach,linear dimensionality reduction algorithm principal component analysis(PCA)is introduced to extract key features from the high-dimensional Pse-PSSM space.The obtained dimension-reduced descriptor vector is a compact repre-sentation of the original high dimensional vector.The jack-knife test results indicate that the dimensionality reduction approach is efficient in coping with complicated problems in biological systems,such as predicting the quaternary struc-ture of proteins. 展开更多
关键词 principal component analysis(PCA) qua-ternary structure of protein pseudo position-specific score matrix(Pse-PSSM) dimension reduction method
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部