Detecting the boundaries of protein domains is an important and challenging task in both experimental and computational structural biology. In this paper, a promising method for detecting the domain structure of a pro...Detecting the boundaries of protein domains is an important and challenging task in both experimental and computational structural biology. In this paper, a promising method for detecting the domain structure of a protein from sequence information alone is presented. The method is based on analyzing multiple sequence alignments derived from a database search. Multiple measures are defined to quantify the domain information content of each position along the sequence. Then they are combined into a single predictor using support vector machine. What is more important, the domain detection is first taken as an imbal- anced data learning problem. A novel undersampling method is proposed on distance-based maximal entropy in the feature space of Support Vector Machine (SVM). The overall precision is about 80%. Simulation results demonstrate that the method can help not only in predicting the complete 3D structure of a protein but also in the machine learning system on general im- balanced datasets.展开更多
The maximal entropy ordered weighted averaging (ME-OWA) operator is used to aggregate metasearch engine results, and its newly analytical solution is also applied. Within the current context of the OWA operator, the...The maximal entropy ordered weighted averaging (ME-OWA) operator is used to aggregate metasearch engine results, and its newly analytical solution is also applied. Within the current context of the OWA operator, the methods for aggregating metasearch engine results are divided into two kinds. One has a unique solution, and the other has multiple solutions. The proposed method not only has crisp weights, but also provides multiple aggregation results for decision makers to choose from. In order to prove the application of the ME-OWA operator method, under the context of aggregating metasearch engine results, an example is given, which shows the results obtained by the ME-OWA operator method and the minimax linear programming ( minimax-LP ) method. Comparison between these two methods are also made. The results show that the ME-OWA operator has nearly the same aggregation results as those of the minimax-LP method.展开更多
Blindness which is considered as degrading disabling disease is the final stage that occurs when a certain threshold of visual acuity is overlapped. It happens with vision deficiencies that are pathologic states due t...Blindness which is considered as degrading disabling disease is the final stage that occurs when a certain threshold of visual acuity is overlapped. It happens with vision deficiencies that are pathologic states due to many ocular diseases. Among them, diabetic retinopathy is nowadays a chronic disease that attacks most of diabetic patients. Early detection through automatic screening programs reduces considerably expansion of the disease. Exudates are one of the earliest signs. This paper presents an automated method for exudates detection in digital retinal fundus image. The first step consists of image enhancement. It focuses on histogram expansion and median filter. The difference between filtered image and his inverse reduces noise and removes background while preserving features and patterns related to the exudates. The second step refers to blood vessel removal by using morphological operators. In the last step, we compute the result image with an algorithm based on Entropy Maximization Thresholding to obtain two segmented regions (optical disk and exudates) which were highlighted in the second step. Finally, according to size criteria, we eliminate the other regions obtain the regions of interest related to exudates. Evaluations were done with retinal fundus image DIARETDB1 database. DIARETDB1 gathers high-quality medical images which have been verified by experts. It consists of around 89 colour fundus images of which 84 contain at least mild non-proliferative signs of the diabetic retinopathy. This tool provides a unified framework for benchmarking the methods, but also points out clear deficiencies in the current practice in the method development. Comparing to other recent methods available in literature, we found that the proposed algorithm accomplished better result in terms of sensibility (94.27%) and specificity (97.63%).展开更多
In this paper, we consider a fuzzy c-means (FCM) clustering algorithm combined with the deterministic annealing method and the Tsallis entropy maximization. The Tsallis entropy is a q-parameter extension of the Shanno...In this paper, we consider a fuzzy c-means (FCM) clustering algorithm combined with the deterministic annealing method and the Tsallis entropy maximization. The Tsallis entropy is a q-parameter extension of the Shannon entropy. By maximizing the Tsallis entropy within the framework of FCM, membership functions similar to statistical mechanical distribution functions can be derived. One of the major considerations when using this method is how to determine appropriate q values and the highest annealing temperature, Thigh?, for a given data set. Accordingly, in this paper, a method for determining these values simultaneously without introducing any additional parameters is presented. In our approach, the membership function is approximated by a series of expansion methods and the K-means clustering algorithm is utilized as a preprocessing step to estimate a radius of each data distribution. The results of experiments indicate that the proposed method is effective and both q and Thigh can be determined automatically and algebraically from a given data set.展开更多
In this paper,complexity analysis and dynamic characteristics of electroencephalogram(EEG) signal based on maximal overlap discrete wavelet transform(MODWT) has been exploited for the identification of seizure onset.S...In this paper,complexity analysis and dynamic characteristics of electroencephalogram(EEG) signal based on maximal overlap discrete wavelet transform(MODWT) has been exploited for the identification of seizure onset.Since wavelet-based studies were well suited for classification of normal and epileptic seizure EEG,we have applied MODWT which is an improved version of discrete wavelet transform(DWT).The selection of optimal wavelet sub-band and features plays a crucial role to understand the brain dynamics in epileptic patients.Therefore,we have investigated MODWT using four different wavelets,namely Haar,Coif4,Dmey,and Sym4 sub-bands until seven levels.Further,we have explored the potentials of six entropies,namely sigmoid,Shannon,wavelet,Renyi,Tsallis,and Steins unbiased risk estimator(SURE) entropies in each sub-band.The sigmoid entropy extracted from Haar wavelet in sub-band D4 showed the highest accuracy of 98.44% using support vector machine classifier for the EEG collected from Ramaiah Medical College and Hospitals(RMCH).Further,the highest accuracy of 100% and 94.51% was achieved for the University of Bonn(UBonn) and CHB-MIT databases respectively.The findings of the study showed that Haar and Dmey wavelets were found to be computationally economical and expensive respectively.Besides,in terms of dynamic characteristics,MODWT results revealed that the highest energy present in sub-bands D2,D3,and D4 and entropies in those respective sub-bands outperformed other entropies in terms of classification results for RMCH database.Similarly,using all the entropies,sub-bands D5 and D6 outperformed other sub-bands for UBonn and CHB-MIT databases respectively.In conclusion,the comparison results of MODWT outperformed DWT.展开更多
In this paper,we introduce thickr-sensitivity,multi-r-sensitivity and block thick r-sensitivity for r≥2.We first give a characterization of a minimal system which is block thickly r-sensitive.Then we obtain a suffici...In this paper,we introduce thickr-sensitivity,multi-r-sensitivity and block thick r-sensitivity for r≥2.We first give a characterization of a minimal system which is block thickly r-sensitive.Then we obtain a sufficient condition of a minimal system which is thickly r-sensitive.The maximal pattern entropy of a multi-r-sensitive topological dynamical system is also discussed.展开更多
Given a free ergodic action of a discrete abelian group G on a measure space (X, 7), the crossed product LX (X, 7)p G contains two distinguished maximal abelian subalgebras. We discuss what kind of information about t...Given a free ergodic action of a discrete abelian group G on a measure space (X, 7), the crossed product LX (X, 7)p G contains two distinguished maximal abelian subalgebras. We discuss what kind of information about the action can be extracted from the positions of these two subalgebras inside the crossed product algebra.展开更多
We introduce the notion of measurable n-sensitivity for measure preserving systems,and study the relation between measurable n-sensitivity and the maximal pattern entropy.We prove that,if(X,ℬ,μ,T)is ergodic,then(X,ℬ,...We introduce the notion of measurable n-sensitivity for measure preserving systems,and study the relation between measurable n-sensitivity and the maximal pattern entropy.We prove that,if(X,ℬ,μ,T)is ergodic,then(X,ℬ,μ,T)is measurable n-sensitive but not measurable(n+1)-sensitive if and only if h_(μ)^(*)(T)=log n,where h_(μ)^(*)(T)is the maximal pattern entropy of T.展开更多
基金National Natural Science Foundation of China (Grant No. 60433020, 60673099, 60673023)"985" project of Jilin University
文摘Detecting the boundaries of protein domains is an important and challenging task in both experimental and computational structural biology. In this paper, a promising method for detecting the domain structure of a protein from sequence information alone is presented. The method is based on analyzing multiple sequence alignments derived from a database search. Multiple measures are defined to quantify the domain information content of each position along the sequence. Then they are combined into a single predictor using support vector machine. What is more important, the domain detection is first taken as an imbal- anced data learning problem. A novel undersampling method is proposed on distance-based maximal entropy in the feature space of Support Vector Machine (SVM). The overall precision is about 80%. Simulation results demonstrate that the method can help not only in predicting the complete 3D structure of a protein but also in the machine learning system on general im- balanced datasets.
基金The National Natural Science Foundation of China(No.71171048)
文摘The maximal entropy ordered weighted averaging (ME-OWA) operator is used to aggregate metasearch engine results, and its newly analytical solution is also applied. Within the current context of the OWA operator, the methods for aggregating metasearch engine results are divided into two kinds. One has a unique solution, and the other has multiple solutions. The proposed method not only has crisp weights, but also provides multiple aggregation results for decision makers to choose from. In order to prove the application of the ME-OWA operator method, under the context of aggregating metasearch engine results, an example is given, which shows the results obtained by the ME-OWA operator method and the minimax linear programming ( minimax-LP ) method. Comparison between these two methods are also made. The results show that the ME-OWA operator has nearly the same aggregation results as those of the minimax-LP method.
文摘Blindness which is considered as degrading disabling disease is the final stage that occurs when a certain threshold of visual acuity is overlapped. It happens with vision deficiencies that are pathologic states due to many ocular diseases. Among them, diabetic retinopathy is nowadays a chronic disease that attacks most of diabetic patients. Early detection through automatic screening programs reduces considerably expansion of the disease. Exudates are one of the earliest signs. This paper presents an automated method for exudates detection in digital retinal fundus image. The first step consists of image enhancement. It focuses on histogram expansion and median filter. The difference between filtered image and his inverse reduces noise and removes background while preserving features and patterns related to the exudates. The second step refers to blood vessel removal by using morphological operators. In the last step, we compute the result image with an algorithm based on Entropy Maximization Thresholding to obtain two segmented regions (optical disk and exudates) which were highlighted in the second step. Finally, according to size criteria, we eliminate the other regions obtain the regions of interest related to exudates. Evaluations were done with retinal fundus image DIARETDB1 database. DIARETDB1 gathers high-quality medical images which have been verified by experts. It consists of around 89 colour fundus images of which 84 contain at least mild non-proliferative signs of the diabetic retinopathy. This tool provides a unified framework for benchmarking the methods, but also points out clear deficiencies in the current practice in the method development. Comparing to other recent methods available in literature, we found that the proposed algorithm accomplished better result in terms of sensibility (94.27%) and specificity (97.63%).
文摘In this paper, we consider a fuzzy c-means (FCM) clustering algorithm combined with the deterministic annealing method and the Tsallis entropy maximization. The Tsallis entropy is a q-parameter extension of the Shannon entropy. By maximizing the Tsallis entropy within the framework of FCM, membership functions similar to statistical mechanical distribution functions can be derived. One of the major considerations when using this method is how to determine appropriate q values and the highest annealing temperature, Thigh?, for a given data set. Accordingly, in this paper, a method for determining these values simultaneously without introducing any additional parameters is presented. In our approach, the membership function is approximated by a series of expansion methods and the K-means clustering algorithm is utilized as a preprocessing step to estimate a radius of each data distribution. The results of experiments indicate that the proposed method is effective and both q and Thigh can be determined automatically and algebraically from a given data set.
文摘In this paper,complexity analysis and dynamic characteristics of electroencephalogram(EEG) signal based on maximal overlap discrete wavelet transform(MODWT) has been exploited for the identification of seizure onset.Since wavelet-based studies were well suited for classification of normal and epileptic seizure EEG,we have applied MODWT which is an improved version of discrete wavelet transform(DWT).The selection of optimal wavelet sub-band and features plays a crucial role to understand the brain dynamics in epileptic patients.Therefore,we have investigated MODWT using four different wavelets,namely Haar,Coif4,Dmey,and Sym4 sub-bands until seven levels.Further,we have explored the potentials of six entropies,namely sigmoid,Shannon,wavelet,Renyi,Tsallis,and Steins unbiased risk estimator(SURE) entropies in each sub-band.The sigmoid entropy extracted from Haar wavelet in sub-band D4 showed the highest accuracy of 98.44% using support vector machine classifier for the EEG collected from Ramaiah Medical College and Hospitals(RMCH).Further,the highest accuracy of 100% and 94.51% was achieved for the University of Bonn(UBonn) and CHB-MIT databases respectively.The findings of the study showed that Haar and Dmey wavelets were found to be computationally economical and expensive respectively.Besides,in terms of dynamic characteristics,MODWT results revealed that the highest energy present in sub-bands D2,D3,and D4 and entropies in those respective sub-bands outperformed other entropies in terms of classification results for RMCH database.Similarly,using all the entropies,sub-bands D5 and D6 outperformed other sub-bands for UBonn and CHB-MIT databases respectively.In conclusion,the comparison results of MODWT outperformed DWT.
文摘In this paper,we introduce thickr-sensitivity,multi-r-sensitivity and block thick r-sensitivity for r≥2.We first give a characterization of a minimal system which is block thickly r-sensitive.Then we obtain a sufficient condition of a minimal system which is thickly r-sensitive.The maximal pattern entropy of a multi-r-sensitive topological dynamical system is also discussed.
文摘Given a free ergodic action of a discrete abelian group G on a measure space (X, 7), the crossed product LX (X, 7)p G contains two distinguished maximal abelian subalgebras. We discuss what kind of information about the action can be extracted from the positions of these two subalgebras inside the crossed product algebra.
基金supported in part by the National Natural Science Foundation of China(Grant Nos.11871188,12031019).
文摘We introduce the notion of measurable n-sensitivity for measure preserving systems,and study the relation between measurable n-sensitivity and the maximal pattern entropy.We prove that,if(X,ℬ,μ,T)is ergodic,then(X,ℬ,μ,T)is measurable n-sensitive but not measurable(n+1)-sensitive if and only if h_(μ)^(*)(T)=log n,where h_(μ)^(*)(T)is the maximal pattern entropy of T.