The problem of pattern-based subspace clustering, a special type of subspace clustering that uses pattern similarity as a measure of similarity, is studied. Unlike most traditional clustering algorithms that group the...The problem of pattern-based subspace clustering, a special type of subspace clustering that uses pattern similarity as a measure of similarity, is studied. Unlike most traditional clustering algorithms that group the close values of objects in all the dimensions or a set of dimensions, clustering by pattern similarity shows an interesting pattern, where objects exhibit a coherent pattern of rise and fall in subspaces. A novel approach, named EMaPle to mine the maximal pattern-based subspace clusters, is designed. The EMaPle searches clusters only in the attribute enumeration spaces which are relatively few compared to the large number of row combinations in the typical datasets, and it exploits novel pruning techniques. EMaPle can find the clusters satisfying coherent constraints, size constraints and sign constraints neglected in MaPle. Both synthetic data sets and real data sets are used to evaluate EMaPle and demonstrate that it is more effective and scalable than MaPle.展开更多
A configurable ontology mapping approach based on different kinds of concept feature information is introduced in this paper. In this approach, ontology concept feature information is classified as five kinds, which r...A configurable ontology mapping approach based on different kinds of concept feature information is introduced in this paper. In this approach, ontology concept feature information is classified as five kinds, which respectively corresponds to five kinds of concept similarity computation methods. Many existing ontology mapping approaches have adopted the multi-feature reasoning, whereas not all feature information can be com- puted in the real ontology mapping and only fractional feature information needs to be selected in the mapping computation. Consequently a eonfigurable ontology mapping model is introduced, which is composed of CMT model, SMT model and related transformation model. Through the configurable model, users can conveniently select the most suitable features and configure the suitable weights. Simultaneously, a related 3-step ontology mapping approach is also introduced. Associated with the traditional name and instance learner-based ontology mapping approach, this approach is evaluated by an ontology mapping application example.展开更多
Based on Immune Programming(IP), a novel Radial Basis Function (RBF) networkdesigning method is proposed. Through extracting the preliminary knowledge about the widthof the basis function as the vaccine to form the im...Based on Immune Programming(IP), a novel Radial Basis Function (RBF) networkdesigning method is proposed. Through extracting the preliminary knowledge about the widthof the basis function as the vaccine to form the immune operator, the algorithm reduces thesearching space of canonical algorithm and improves the convergence speed. The application ofthe RBF network trained with the algorithm in the modulation-style recognition of radar signalsdemonstrates that the network has a fast convergence speed with good performances.展开更多
Tax payers and tax administrators are the main structural groups in tax system. They interact and have an impact on each other's actions following by tax compliance or tax non-compliance. However, no wider study enco...Tax payers and tax administrators are the main structural groups in tax system. They interact and have an impact on each other's actions following by tax compliance or tax non-compliance. However, no wider study encompassing both tax payers and tax administrators has been conducted in Lithuania. Since a survey of all participants in the tax system would require substantial time, human, and financial resources, during the first study, only one group, tax payers, was surveyed. During the second study, tax administrators were surveyed along with the tax payers. The present study has the following objectives: to describe the problem of tax evasion in the context of attitudes and behaviours of participants in the tax system; to estimate the tax compliance and evasion situation in Lithuania on the basis of attitudes and behaviours of tax payers; to establish how tax administrators estimate the tax compliance (evasion) situation in Lithuania; to assess the relationship between tax administrators and tax payers; to determine similarities and differences of their attitudes; to assess the key aspects of tax evasion; and to identify measures for the solution of this problem.展开更多
The quantitative structure-activity relationship(QSAR) of 2-alkyl-4-(biphenylylmethoxy) pyridine derivatives was studied.Three different alignment methods were used to get the models of the comparative molecular field...The quantitative structure-activity relationship(QSAR) of 2-alkyl-4-(biphenylylmethoxy) pyridine derivatives was studied.Three different alignment methods were used to get the models of the comparative molecular field analysis(CoMFA),the comparative molecular similarity indices analysis(CoMSIA),and the hologram quantitative structure?activity relationship(HQSAR).The statistical results from the established models show believable predictivity based on the cross-validated value(q2>0.5) and the non-validated value(r2>0.9),The analysis on contour maps of CoMFA and CoMSIA models suggests that hydrophobic and hydrogen-bond acceptor fields are important factors that affect the AT1 antagonistic activity of 2-alkyl-4-(biphenylylmethoxy) pyridine derivatives besides the steric and electrostatic fields,The structural modification information from different atom contributions in the HQSAR model is in agreement with that in the 3D-QSAR models.展开更多
A novel method case-based reasoning was proposed for suspicious behavior recognition. The method is composed of three departs: human behavior decomposition, human behavior case representation and case-based reasoning....A novel method case-based reasoning was proposed for suspicious behavior recognition. The method is composed of three departs: human behavior decomposition, human behavior case representation and case-based reasoning. The new approach was proposed to decompose behavior into sub-behaviors that are easier to recognize using a saliency-based visual attention model. New representation of behavior was introduced, in which the sub-behavior and the associated time characteristic of sub-behavior were used to represent behavior case. In the process of case-based reasoning, apart from considering the similarity of basic sub-behaviors,order factor was proposed to measure the similarity of a time order among the sub-behaviors and span factor was used to measure the similarity of duration time of each sub-behavior, which makes the similarity calculations more rational and comprehensive.Experimental results show the effectiveness of the proposed method in comparison with other related works and can run in real-time for the recognition of suspicious behaviors.展开更多
Identification of disease-causing genes among a large number of candidates is a fundamental challenge in human disease studies.However,it is still time-consuming and laborious to determine the real disease-causing gen...Identification of disease-causing genes among a large number of candidates is a fundamental challenge in human disease studies.However,it is still time-consuming and laborious to determine the real disease-causing genes by biological experiments.With the advances of the high-throughput techniques,a large number of protein-protein interactions have been produced.Therefore,to address this issue,several methods based on protein interaction network have been proposed.In this paper,we propose a shortest path-based algorithm,named SPranker,to prioritize disease-causing genes in protein interaction networks.Considering the fact that diseases with similar phenotypes are generally caused by functionally related genes,we further propose an improved algorithm SPGOranker by integrating the semantic similarity of gene ontology(GO)annotations.SPGOranker not only considers the topological similarity between protein pairs in a protein interaction network but also takes their functional similarity into account.The proposed algorithms SPranker and SPGOranker were applied to 1598 known orphan disease-causing genes from 172 orphan diseases and compared with three state-of-the-art approaches,ICN,VS and RWR.The experimental results show that SPranker and SPGOranker outperform ICN,VS,and RWR for the prioritization of orphan disease-causing genes.Importantly,for the case study of severe combined immunodeficiency,SPranker and SPGOranker predict several novel causal genes.展开更多
Graphical representation of DNA sequences is a key component in studying biological problems. In order to gain new insights in DNA sequences, this paper combined the digitized methods of single-base, base pairs and co...Graphical representation of DNA sequences is a key component in studying biological problems. In order to gain new insights in DNA sequences, this paper combined the digitized methods of single-base, base pairs and coding in triplet bases with the times of base appearing, and then a novel 4D graphical representation method of DNA sequences was put forward. It was a one-to-one correspondence of the arbitrary DNA sequence and 4D graphical representation, that avoided causing non-unique 4D graphical representation and overlapping lines. The method could reflect the biological information features of DNA sequence more comprehensively and effectively without any losses. Based on the 4D graphical representation, we used the geometric center of 4D graphical representation as eigenvalue of DNA sequences analyses, which kept the original features of the data, and then established the Euclidean distances and included angles between vectors' ter- minal point for similarity analyses of the first extron of the beta-globulin gene among 11 species. Finally, we established the graph of systematic hierarchical cluster analysis of 11 species to observe more easily the relationship between species. A positive outcome was reached, and the results were in accord with biological taxonomy, which also supported the rationality and effectiveness of the novel 4D graphical representation.展开更多
Self-similarity based model of proton structure function at small x was reported in the literature sometime back. The phenomenological validity of the model is in the kinematical region 6.2 × 10-7 ≤ x ≤ 10-2 an...Self-similarity based model of proton structure function at small x was reported in the literature sometime back. The phenomenological validity of the model is in the kinematical region 6.2 × 10-7 ≤ x ≤ 10-2 and 0.045 ≤ Q2 ≤ 120 GeV2. We use momentum sum rule to pin down the corresponding self-similarity-based gluon distribution function valid in the same kinematical region. The model is then used to compute bound on the longitudinal structure function FL(X, Q2) for A1tarelli-Martinelli equation in QCD and is compared with the recent HERA data.展开更多
Vibration signal is an important prerequisite for mechanical fault detection. However, early stage defect of rotating machiner- ies is difficult to identify because their incipient energy is interfered with background...Vibration signal is an important prerequisite for mechanical fault detection. However, early stage defect of rotating machiner- ies is difficult to identify because their incipient energy is interfered with background noises. Multiwavelet is a powerful tool used to conduct non-stationary fault feature extraction. However, the existing predetermined multiwavelet bases are independ- ent of the dynamic response signals. In this paper, a constructing technique of vibration data-driven maximal-overlap adaptive multiwavelet (MOAMW) is proposed for enhancing the extracting performance of fault symptom. It is able to derive an opti- mal multiwavelet basis that best matches the critical non-stationary and transient fault signatures via genetic algorithm. In this technique, two-scale similarity transform (TST) and symmetric lifting (SymLift) scheme are combined to gain high designing freedom for matching the critical faulty vibration contents in vibration signals based on the maximal fitness objective. TST and SymLift can add modifications to the initial multiwavelet by changing the approximation order and vanishing moment of mul- tiwavelet, respectively. Moreover, the beneficial feature of the MOAWM lies in that the maximal-overlap filterbank structure can enhance the periodic and transient characteristics of the sensor signals and preserve the time and frequency analyzing res- olution during the decomposition process. The effectiveness of the proposed technique is validated via a numerical simulation as well as a rolling element beating with an outer race scrape and a gearbox with rub fault.展开更多
Based on isolobal analogy,the definition of metallafuran was described and its synthetic chemistry was briefly summarized.In the structure of furan,when one of CH groups was replaced by an isolobal metal fragment ML n...Based on isolobal analogy,the definition of metallafuran was described and its synthetic chemistry was briefly summarized.In the structure of furan,when one of CH groups was replaced by an isolobal metal fragment ML n(M=metal;L=ligand),the corresponding organometallic complex was called metallafuran,which should be two possible isomers called a-metallafuran(metal fragment at a-carbon of original furan)and b-metallafuran(metal fragment at b-carbon of original furan).As an organometallic complex,a-metallafuran has two resonant forms:one can be viewed as carbonyl coordinated vinyl metal complex and the other can be viewed as alkoxymetal carbene.Therefore,a-metallafuran was also called chelated vinyl ketone metal complex or oxametallacyclopentadiene in the early literatures.For synthesis of metallafurans,a-metallafurans were very common and easily prepared,for example,from alkynes insertion into acyl metal complexes and so on.While there were rare examples reported for b-metallafurans.In this mini review,the synthetic chemistry of metallafuran was mainly focused on its formation mechanism.展开更多
It is necessary to build turbulence model to study the response of aircraft to atmospheric turbulence for high resolution earth observation. The conventional method is on the basis of Dryden’s model with the assumpti...It is necessary to build turbulence model to study the response of aircraft to atmospheric turbulence for high resolution earth observation. The conventional method is on the basis of Dryden’s model with the assumption that individual patches are Gaussian. In this paper,based on Kraichnan’s refined similarity idea,a new 1D atmospheric turbulence model is set up by introducing the energy transfer rate as an intermittency disturbance to a Gaussian process. Our results show that the turbulent fields generated by our new method exhibit an anomalous scaling described by the She-Leveque (SL) formula,which is now well accepted for homogenous and isotropic turbulence.展开更多
New analogues of green fluorescent protein (GFP) chromophore mGFP-Cn (n = 1, 3, 5, 11) with alkyl chains of different lengths in the imidazolinone rings were synthesized and their crystal structures were determined. T...New analogues of green fluorescent protein (GFP) chromophore mGFP-Cn (n = 1, 3, 5, 11) with alkyl chains of different lengths in the imidazolinone rings were synthesized and their crystal structures were determined. These GFP-like chromophores are all emissive in the solid state. And the solid-state emission quantum yields of increase by extending the lengths of alkyl chains, owing to the fact that the intermolecular pi-pi interactions are significantly weakened based on their crystal structures.展开更多
基金The National Natural Science Foundation of China(No60273075)
文摘The problem of pattern-based subspace clustering, a special type of subspace clustering that uses pattern similarity as a measure of similarity, is studied. Unlike most traditional clustering algorithms that group the close values of objects in all the dimensions or a set of dimensions, clustering by pattern similarity shows an interesting pattern, where objects exhibit a coherent pattern of rise and fall in subspaces. A novel approach, named EMaPle to mine the maximal pattern-based subspace clusters, is designed. The EMaPle searches clusters only in the attribute enumeration spaces which are relatively few compared to the large number of row combinations in the typical datasets, and it exploits novel pruning techniques. EMaPle can find the clusters satisfying coherent constraints, size constraints and sign constraints neglected in MaPle. Both synthetic data sets and real data sets are used to evaluate EMaPle and demonstrate that it is more effective and scalable than MaPle.
基金Sponsored by the 973 Natural Key Basis Research and Development Plan (Grant No.973: 2003CB316905)the National Natural Science Foundationof China (Grant No.60374071)
文摘A configurable ontology mapping approach based on different kinds of concept feature information is introduced in this paper. In this approach, ontology concept feature information is classified as five kinds, which respectively corresponds to five kinds of concept similarity computation methods. Many existing ontology mapping approaches have adopted the multi-feature reasoning, whereas not all feature information can be com- puted in the real ontology mapping and only fractional feature information needs to be selected in the mapping computation. Consequently a eonfigurable ontology mapping model is introduced, which is composed of CMT model, SMT model and related transformation model. Through the configurable model, users can conveniently select the most suitable features and configure the suitable weights. Simultaneously, a related 3-step ontology mapping approach is also introduced. Associated with the traditional name and instance learner-based ontology mapping approach, this approach is evaluated by an ontology mapping application example.
文摘Based on Immune Programming(IP), a novel Radial Basis Function (RBF) networkdesigning method is proposed. Through extracting the preliminary knowledge about the widthof the basis function as the vaccine to form the immune operator, the algorithm reduces thesearching space of canonical algorithm and improves the convergence speed. The application ofthe RBF network trained with the algorithm in the modulation-style recognition of radar signalsdemonstrates that the network has a fast convergence speed with good performances.
文摘Tax payers and tax administrators are the main structural groups in tax system. They interact and have an impact on each other's actions following by tax compliance or tax non-compliance. However, no wider study encompassing both tax payers and tax administrators has been conducted in Lithuania. Since a survey of all participants in the tax system would require substantial time, human, and financial resources, during the first study, only one group, tax payers, was surveyed. During the second study, tax administrators were surveyed along with the tax payers. The present study has the following objectives: to describe the problem of tax evasion in the context of attitudes and behaviours of participants in the tax system; to estimate the tax compliance and evasion situation in Lithuania on the basis of attitudes and behaviours of tax payers; to establish how tax administrators estimate the tax compliance (evasion) situation in Lithuania; to assess the relationship between tax administrators and tax payers; to determine similarities and differences of their attitudes; to assess the key aspects of tax evasion; and to identify measures for the solution of this problem.
基金Project(20876180) supported by the National Natural Science Foundation of China
文摘The quantitative structure-activity relationship(QSAR) of 2-alkyl-4-(biphenylylmethoxy) pyridine derivatives was studied.Three different alignment methods were used to get the models of the comparative molecular field analysis(CoMFA),the comparative molecular similarity indices analysis(CoMSIA),and the hologram quantitative structure?activity relationship(HQSAR).The statistical results from the established models show believable predictivity based on the cross-validated value(q2>0.5) and the non-validated value(r2>0.9),The analysis on contour maps of CoMFA and CoMSIA models suggests that hydrophobic and hydrogen-bond acceptor fields are important factors that affect the AT1 antagonistic activity of 2-alkyl-4-(biphenylylmethoxy) pyridine derivatives besides the steric and electrostatic fields,The structural modification information from different atom contributions in the HQSAR model is in agreement with that in the 3D-QSAR models.
基金Project(50808025)supported by the National Natural Science Foundation of ChinaProject(2013GK3012)supported by the Science and Technology Project of Hunan Province,China
文摘A novel method case-based reasoning was proposed for suspicious behavior recognition. The method is composed of three departs: human behavior decomposition, human behavior case representation and case-based reasoning. The new approach was proposed to decompose behavior into sub-behaviors that are easier to recognize using a saliency-based visual attention model. New representation of behavior was introduced, in which the sub-behavior and the associated time characteristic of sub-behavior were used to represent behavior case. In the process of case-based reasoning, apart from considering the similarity of basic sub-behaviors,order factor was proposed to measure the similarity of a time order among the sub-behaviors and span factor was used to measure the similarity of duration time of each sub-behavior, which makes the similarity calculations more rational and comprehensive.Experimental results show the effectiveness of the proposed method in comparison with other related works and can run in real-time for the recognition of suspicious behaviors.
基金supported in part by the National Natural Science Foundation of China(61370024,61428209,61232001)Program for New Century Excellent Talents in University(NCET-12-0547)
文摘Identification of disease-causing genes among a large number of candidates is a fundamental challenge in human disease studies.However,it is still time-consuming and laborious to determine the real disease-causing genes by biological experiments.With the advances of the high-throughput techniques,a large number of protein-protein interactions have been produced.Therefore,to address this issue,several methods based on protein interaction network have been proposed.In this paper,we propose a shortest path-based algorithm,named SPranker,to prioritize disease-causing genes in protein interaction networks.Considering the fact that diseases with similar phenotypes are generally caused by functionally related genes,we further propose an improved algorithm SPGOranker by integrating the semantic similarity of gene ontology(GO)annotations.SPGOranker not only considers the topological similarity between protein pairs in a protein interaction network but also takes their functional similarity into account.The proposed algorithms SPranker and SPGOranker were applied to 1598 known orphan disease-causing genes from 172 orphan diseases and compared with three state-of-the-art approaches,ICN,VS and RWR.The experimental results show that SPranker and SPGOranker outperform ICN,VS,and RWR for the prioritization of orphan disease-causing genes.Importantly,for the case study of severe combined immunodeficiency,SPranker and SPGOranker predict several novel causal genes.
基金The work was supported by the National Natural Science Foundation of China (Grant No. 11271163) and by the Fundamental Research Funds for the Central Universities (JUSRP51317B).
文摘Graphical representation of DNA sequences is a key component in studying biological problems. In order to gain new insights in DNA sequences, this paper combined the digitized methods of single-base, base pairs and coding in triplet bases with the times of base appearing, and then a novel 4D graphical representation method of DNA sequences was put forward. It was a one-to-one correspondence of the arbitrary DNA sequence and 4D graphical representation, that avoided causing non-unique 4D graphical representation and overlapping lines. The method could reflect the biological information features of DNA sequence more comprehensively and effectively without any losses. Based on the 4D graphical representation, we used the geometric center of 4D graphical representation as eigenvalue of DNA sequences analyses, which kept the original features of the data, and then established the Euclidean distances and included angles between vectors' ter- minal point for similarity analyses of the first extron of the beta-globulin gene among 11 species. Finally, we established the graph of systematic hierarchical cluster analysis of 11 species to observe more easily the relationship between species. A positive outcome was reached, and the results were in accord with biological taxonomy, which also supported the rationality and effectiveness of the novel 4D graphical representation.
文摘Self-similarity based model of proton structure function at small x was reported in the literature sometime back. The phenomenological validity of the model is in the kinematical region 6.2 × 10-7 ≤ x ≤ 10-2 and 0.045 ≤ Q2 ≤ 120 GeV2. We use momentum sum rule to pin down the corresponding self-similarity-based gluon distribution function valid in the same kinematical region. The model is then used to compute bound on the longitudinal structure function FL(X, Q2) for A1tarelli-Martinelli equation in QCD and is compared with the recent HERA data.
基金supported by the National Natural Science Foundation of China(Grant No.51275384)the Key Project of National Natural Science Foundation of China(Grant No.51035007)+1 种基金the National Basic Research Program of China(Grant No.2009CB724405)the Specialized Research Fund for the Doctoral Program of Higher Education(Grant No.20110201130001)
文摘Vibration signal is an important prerequisite for mechanical fault detection. However, early stage defect of rotating machiner- ies is difficult to identify because their incipient energy is interfered with background noises. Multiwavelet is a powerful tool used to conduct non-stationary fault feature extraction. However, the existing predetermined multiwavelet bases are independ- ent of the dynamic response signals. In this paper, a constructing technique of vibration data-driven maximal-overlap adaptive multiwavelet (MOAMW) is proposed for enhancing the extracting performance of fault symptom. It is able to derive an opti- mal multiwavelet basis that best matches the critical non-stationary and transient fault signatures via genetic algorithm. In this technique, two-scale similarity transform (TST) and symmetric lifting (SymLift) scheme are combined to gain high designing freedom for matching the critical faulty vibration contents in vibration signals based on the maximal fitness objective. TST and SymLift can add modifications to the initial multiwavelet by changing the approximation order and vanishing moment of mul- tiwavelet, respectively. Moreover, the beneficial feature of the MOAWM lies in that the maximal-overlap filterbank structure can enhance the periodic and transient characteristics of the sensor signals and preserve the time and frequency analyzing res- olution during the decomposition process. The effectiveness of the proposed technique is validated via a numerical simulation as well as a rolling element beating with an outer race scrape and a gearbox with rub fault.
基金supported by the National Natural Science Foundation of China (21302158 and 21472156)the National Basic Research Program of China (2012CB821600)
文摘Based on isolobal analogy,the definition of metallafuran was described and its synthetic chemistry was briefly summarized.In the structure of furan,when one of CH groups was replaced by an isolobal metal fragment ML n(M=metal;L=ligand),the corresponding organometallic complex was called metallafuran,which should be two possible isomers called a-metallafuran(metal fragment at a-carbon of original furan)and b-metallafuran(metal fragment at b-carbon of original furan).As an organometallic complex,a-metallafuran has two resonant forms:one can be viewed as carbonyl coordinated vinyl metal complex and the other can be viewed as alkoxymetal carbene.Therefore,a-metallafuran was also called chelated vinyl ketone metal complex or oxametallacyclopentadiene in the early literatures.For synthesis of metallafurans,a-metallafurans were very common and easily prepared,for example,from alkynes insertion into acyl metal complexes and so on.While there were rare examples reported for b-metallafurans.In this mini review,the synthetic chemistry of metallafuran was mainly focused on its formation mechanism.
基金supported by the National Natural Science Foundation of China (Grant No.10902007)the National Basic Research Program of China ("973" Program) (Grant No.2009CB724001)
文摘It is necessary to build turbulence model to study the response of aircraft to atmospheric turbulence for high resolution earth observation. The conventional method is on the basis of Dryden’s model with the assumption that individual patches are Gaussian. In this paper,based on Kraichnan’s refined similarity idea,a new 1D atmospheric turbulence model is set up by introducing the energy transfer rate as an intermittency disturbance to a Gaussian process. Our results show that the turbulent fields generated by our new method exhibit an anomalous scaling described by the She-Leveque (SL) formula,which is now well accepted for homogenous and isotropic turbulence.
基金the National Natural Science Foundation of China (21021091)National Basic Research Program of China (2013CB834700, 2011CB808400)
文摘New analogues of green fluorescent protein (GFP) chromophore mGFP-Cn (n = 1, 3, 5, 11) with alkyl chains of different lengths in the imidazolinone rings were synthesized and their crystal structures were determined. These GFP-like chromophores are all emissive in the solid state. And the solid-state emission quantum yields of increase by extending the lengths of alkyl chains, owing to the fact that the intermolecular pi-pi interactions are significantly weakened based on their crystal structures.