^(23)Na is a nuclear magnetic resonance(NMR)-active isotope with a nuclear spin quantum number of 3/2.^(23)Na relaxation phenomenon is at the core of ^(23)Na NMR measurement and analysis.Due to the dominance of quadru...^(23)Na is a nuclear magnetic resonance(NMR)-active isotope with a nuclear spin quantum number of 3/2.^(23)Na relaxation phenomenon is at the core of ^(23)Na NMR measurement and analysis.Due to the dominance of quadrupolar interaction,the relaxation behavior of ^(23)Na is physically and mathematically more complex than that of a typical spin-1/2 isotope.In this review,we overview the semi-classical Redfield theory for deriving the formulations of ^(23)Na relaxation.We show that the relaxation behaviors of ^(23)Na can be quantitatively described by constructing the spectral density functions based on the second-order perturbation theory.In addition,we summarize the applications of ^(23)Na relaxometry in different research fields,including biomedicine,sodium ion batteries,and quantum information processing.Because sodium is an essential element in our body,food and industrial materials,the research on sodium by ^(23)Na NMR emerges as important future directions.The theoretical and practical understandings on ^(23)Na relaxation are the step stones for mastering advanced ^(23)Na NMR techniques.展开更多
As the importance of email increases,the amount of malicious email is also increasing,so the need for malicious email filtering is growing.Since it is more economical to combine commodity hardware consisting of a medi...As the importance of email increases,the amount of malicious email is also increasing,so the need for malicious email filtering is growing.Since it is more economical to combine commodity hardware consisting of a medium server or PC with a virtual environment to use as a single server resource and filter malicious email using machine learning techniques,we used a Hadoop MapReduce framework and Naïve Bayes among machine learning methods for malicious email filtering.Naïve Bayes was selected because it is one of the top machine learning methods(Support Vector Machine(SVM),Naïve Bayes,K-Nearest Neighbor(KNN),and Decision Tree)in terms of execution time and accuracy.Malicious email was filtered with MapReduce programming using the Naïve Bayes technique,which is a supervised machine learning method,in a Hadoop framework with optimized performance and also with the Python program technique with the Naïve Bayes technique applied in a bare metal server environment with the Hadoop environment not applied.According to the results of a comparison of the accuracy and predictive error rates of the two methods,the Hadoop MapReduce Naïve Bayes method improved the accuracy of spam and ham email identification 1.11 times and the prediction error rate 14.13 times compared to the non-Hadoop Python Naïve Bayes method.展开更多
The differences of grain-refining effect between Sc and Ti additions in aluminum,which cannot be substantially explained by traditional theories,were carefully studied.The empirical electron theory(EET) of solids and ...The differences of grain-refining effect between Sc and Ti additions in aluminum,which cannot be substantially explained by traditional theories,were carefully studied.The empirical electron theory(EET) of solids and molecules was employed to calculate the valence electron structures(VES) of Al3Ti and Al3Sc.The conclusions can be drawn that,in the two alloys Al-Ti and Al-Sc,the different valence electron structures of Al3Ti and Al3Sc and the consequent differences of growth habit of the two particles,and the different interfacial electron density between particles and matrix fundamentally lead to the differences of grain-refining effect between Sc and Ti additions on aluminum and make Sr the better grain-refiner of aluminum.展开更多
The naïve Bayes classifier is one of the commonly used data mining methods for classification.Despite its simplicity,naïve Bayes is effective and computationally efficient.Although the strong attribute indep...The naïve Bayes classifier is one of the commonly used data mining methods for classification.Despite its simplicity,naïve Bayes is effective and computationally efficient.Although the strong attribute independence assumption in the naïve Bayes classifier makes it a tractable method for learning,this assumption may not hold in real-world applications.Many enhancements to the basic algorithm have been proposed in order to alleviate the violation of attribute independence assumption.While these methods improve the classification performance,they do not necessarily retain the mathematical structure of the naïve Bayes model and some at the expense of computational time.One approach to reduce the naïvetéof the classifier is to incorporate attribute weights in the conditional probability.In this paper,we proposed a method to incorporate attribute weights to naïve Bayes.To evaluate the performance of our method,we used the public benchmark datasets.We compared our method with the standard naïve Bayes and baseline attribute weighting methods.Experimental results show that our method to incorporate attribute weights improves the classification performance compared to both standard naïve Bayes and baseline attribute weighting methods in terms of classification accuracy and F1,especially when the independence assumption is strongly violated,which was validated using the Chi-square test of independence.展开更多
The equilibrium geometries, relative stabilities, and electronic properties of MnAgm(M=Na, Li; n + m ≤ 7) as well as pure Agn, Nan, Lin (n ≤ 7) clusters are systematically investigated by means of the density f...The equilibrium geometries, relative stabilities, and electronic properties of MnAgm(M=Na, Li; n + m ≤ 7) as well as pure Agn, Nan, Lin (n ≤ 7) clusters are systematically investigated by means of the density functional theory. The optimized geometries reveal that for 2 ≤ n ≤ 7, there are significant similarities in geometry among pure Agn, Nan, and Lin clusters, and the transitions from planar to three-dimensional configurations occur at n = 7, 7, and 6, respectively. In contrast, the first three-dimensional (3D) structures are observed at n + m = 5 for both NanAgm and LinAgm clusters. When n + m ≥5, a striking feature is that the trigonal bipyramid becomes the main subunit of LinAgm. Furthermore, dramatic odd-even alternative behaviours are obtained in the fragmentation energies, secondorder difference energies, highest occupied and lowest unoccupied molecular orbital energy gaps, and chemical hardness for both pure and doped clusters. The analytic results exhibit that clusters with an even electronic configuration (2, 4, 6) possess the weakest chemical reactivity and more enhanced stability.展开更多
<span style="font-family:Verdana;">The presence of bearing faults reduces the efficiency of rotating machines and thus increases energy consumption or even the total stoppage of the machine. </span&...<span style="font-family:Verdana;">The presence of bearing faults reduces the efficiency of rotating machines and thus increases energy consumption or even the total stoppage of the machine. </span><span style="font-family:Verdana;">It becomes essential to correctly diagnose the fault caused by the bearing.</span><span style="font-family:Verdana;"> Hence the importance of determining an effective features extraction method that best describes the fault. The vision of this paper is to merge the features selection methods in order to define the most relevant featuresin the texture </span><span style="font-family:Verdana;">of the vibration signal images. In this study, the Gray Level Co-occurrence </span><span style="font-family:Verdana;">Matrix (GLCM) in texture analysis is applied on the vibration signal represented in images. Features</span><span><span><span style="font-family:;" "=""> </span></span></span><span><span><span style="font-family:;" "=""><span style="font-family:Verdana;">selection based on the merge of PCA (Principal component Analysis) method and SFE (Sequential Features Extraction) method is </span><span style="font-family:Verdana;">done to obtain the most relevant features. The multiclass-Na<span style="white-space:nowrap;">?</span>ve Bayesclassifi</span><span style="font-family:Verdana;">er is used to test the proposed approach. The success rate of this classification is 98.27%. The relevant features obtained give promising results and are more efficient than the methods observed in the literature.</span></span></span></span>展开更多
The electronic structures, Born effective charges(BECs), and full phonon dispersions of cubic, tetragonal, orthorhombic, and rhombohedral K0.5Na0.5Nb O3 are investigated by the first principles method based on densi...The electronic structures, Born effective charges(BECs), and full phonon dispersions of cubic, tetragonal, orthorhombic, and rhombohedral K0.5Na0.5Nb O3 are investigated by the first principles method based on density functional theory.The hybridized states of Nb 4d and O 2p states are observed in the valence band, showing the formation of a strong Nb–O covalent bond which should be responsible for the displacement of Nb and O atoms. The abnormally large BECs of Nb and O indicate the possibility of phase instability induced by the off-center displacement of Nb and O atoms. The phonon dispersions reveal that the ferroelectric instability of K0.5Na0.5Nb O3 is dominated by Nb and O displacements with significant Na characteristics. In addition to the ferroelectric instability, there is also rotational instability coming from the oxygen octahedra rotation around one axis. Moreover, the Γ phonon properties of orthorhombic KNb O3, Na Nb O3, and K0.5Na0.5Nb O3 are also studied in detail.展开更多
AIM: To characterize the prevalence of subpopulations of CD4+ cells along with that of major inhibitor or stimulator cell types in therapy-nave childhood Crohn's disease (CD) and to test whether abnormalities of...AIM: To characterize the prevalence of subpopulations of CD4+ cells along with that of major inhibitor or stimulator cell types in therapy-nave childhood Crohn's disease (CD) and to test whether abnormalities of immune phenotype are normalized with the improvement of clinical signs and symptoms of disease. METHODS: We enrolled 26 pediatric patients with CD. 14 therapy-nave CD children; of those, 10 children remitted on conventional therapy and formed the remission group. We also tested another group of 12 chil-dren who relapsed with conventional therapy and were given infliximab; and 15 healthy children who served as controls. The prevalence of Th1 and Th2, nave and memory, activated and regulatory T cells, along with the members of innate immunity such as natural killer (NK), NK-T, myeloid and plasmocytoid dendritic cells (DCs), monocytes and Toll-like receptor (TLR)-2 and TLR-4 expression were determined in peripheral blood samples. RESULTS: Children with therapy-nave CD and those in relapse showed a decrease in Th1 cell prevalence. Simultaneously, an increased prevalence of memory and activated lymphocytes along with that of DCs and monocytes was observed. In addition, the ratio of myeloid /plasmocytoid DCs and the prevalence of TLR-2 or TLR-4 positive DCs and monocytes were also higher in therapy-nave CD than in controls. The majority of alterations diminished in remitted CD irrespective of whether remission was obtained by conventional or biological therapy. CONCLUSION: The finding that immune phenotype is normalized in remission suggests a link between immune phenotype and disease activity in childhood CD. Our observations support the involvement of members of the adaptive and innate immune systems in childhood CD.展开更多
This paper seeks to identify the minimal restrictions that need to be placed on the naive comprehension principle to avoid inconsistency in set theory. Analysis of the logical antinomies shows that at the root of inco...This paper seeks to identify the minimal restrictions that need to be placed on the naive comprehension principle to avoid inconsistency in set theory. Analysis of the logical antinomies shows that at the root of inconsistency in naive set theory are certain "self contradictory" predicate functions in extensional set descriptions containing the matrix "-(x∈y)" (or "-(x∈x)") rather than "size," vicious circularity, or self-reference. A reformed set comprehension system is proposed that excludes extensional set descriptions that conform to the formula, (Vx) (Зy) (x∈y →P (x)) (3u) (u∈y→(u∈y)), from comprehension and otherwise preserves the ontology of na'fve set theory. This reform avoids the paradoxes by scrutiny of a set's description without recourse to type or other constructivist limitations on self-membership and has the most liberal rules for set formation conceivable including self-membership. The intuitive appeal for such an approach is compelling because as a revision of na'fve set theory, it allows all possible set descriptions that do not lead to inconsistency.展开更多
Intrusion detection is the investigation process of information about the system activities or its data to detect any malicious behavior or unauthorized activity.Most of the IDS implement K-means clustering technique ...Intrusion detection is the investigation process of information about the system activities or its data to detect any malicious behavior or unauthorized activity.Most of the IDS implement K-means clustering technique due to its linear complexity and fast computing ability.Nonetheless,it is Naïve use of the mean data value for the cluster core that presents a major drawback.The chances of two circular clusters having different radius and centering at the same mean will occur.This condition cannot be addressed by the K-means algorithm because the mean value of the various clusters is very similar together.However,if the clusters are not spherical,it fails.To overcome this issue,a new integrated hybrid model by integrating expectation maximizing(EM)clustering using a Gaussian mixture model(GMM)and naïve Bays classifier have been proposed.In this model,GMM give more flexibility than K-Means in terms of cluster covariance.Also,they use probabilities function and soft clustering,that’s why they can have multiple cluster for a single data.In GMM,we can define the cluster form in GMM by two parameters:the mean and the standard deviation.This means that by using these two parameters,the cluster can take any kind of elliptical shape.EM-GMM will be used to cluster data based on data activity into the corresponding category.展开更多
The electronic structure of Eu-doped NaTaO3 in Na-rich environment is investigated by the first-principles theory. By simulating the two different models of Eu3+ ions selectively located in Ta and Na sites, respectiv...The electronic structure of Eu-doped NaTaO3 in Na-rich environment is investigated by the first-principles theory. By simulating the two different models of Eu3+ ions selectively located in Ta and Na sites, respectively, the band gaps of two Eu-doped NaTaO3 models were all narrowed, which were assigned to lattice defects and impurity band of the Eu dopent. For the model of Eu3+ ions located in the Na+ sites of NaTaO3, the new impurity band mainly composited of Eu 4f orbital appeared at the top over the valence band, indicating the enhanced oxidative ability. For the model of Eu3+ ions located in the Ta5+ sites of NaTaO3, a midgap state generated was located at the bottom of conduct band and the band potential shifted up, confirming the strong reductive ability in the Na-rich enviornment. The densities of electron states were significantly increased in both the conduction and valence bands in Na-rich model, which resulted in the increased carrier migration rate and thus photocatalytic activity enhancement. It is proposed that Eu3+ ions doping at the Ta sites could enhance the reduced photocatalytic performance via controlling the nonstoichiometric Na/Ta molar ratio in the Eu-doped NaTaO3 system.展开更多
This paper proposed an improved Naïve Bayes Classifier for sentimental analysis from a large-scale dataset such as in YouTube.YouTube contains large unstructured and unorganized comments and reactions,which carry...This paper proposed an improved Naïve Bayes Classifier for sentimental analysis from a large-scale dataset such as in YouTube.YouTube contains large unstructured and unorganized comments and reactions,which carry important information.Organizing large amounts of data and extracting useful information is a challenging task.The extracted information can be considered as new knowledge and can be used for deci sion-making.We extract comments from YouTube on videos and categorized them in domain-specific,and then apply the Naïve Bayes classifier with improved techniques.Our method provided a decent 80%accuracy in classifying those comments.This experiment shows that the proposed method provides excellent adaptability for large-scale text classification.展开更多
Social media networks are becoming essential to our daily activities,and many issues are due to this great involvement in our lives.Cyberbullying is a social media network issue,a global crisis affecting the victims a...Social media networks are becoming essential to our daily activities,and many issues are due to this great involvement in our lives.Cyberbullying is a social media network issue,a global crisis affecting the victims and society as a whole.It results from a misunderstanding regarding freedom of speech.In this work,we proposed a methodology for detecting such behaviors(bullying,harassment,and hate-related texts)using supervised machine learning algo-rithms(SVM,Naïve Bayes,Logistic regression,and random forest)and for predicting a topic associated with these text data using unsupervised natural language processing,such as latent Dirichlet allocation.In addition,we used accuracy,precision,recall,and F1 score to assess prior classifiers.Results show that the use of logistic regression,support vector machine,random forest model,and Naïve Bayes has 95%,94.97%,94.66%,and 93.1%accuracy,respectively.展开更多
基金National Natural Science Foundation of China 22275159 and 22072133.Leading Innovation and Entrepreneurship Team of Zhejiang Province 2020R01003.
文摘^(23)Na is a nuclear magnetic resonance(NMR)-active isotope with a nuclear spin quantum number of 3/2.^(23)Na relaxation phenomenon is at the core of ^(23)Na NMR measurement and analysis.Due to the dominance of quadrupolar interaction,the relaxation behavior of ^(23)Na is physically and mathematically more complex than that of a typical spin-1/2 isotope.In this review,we overview the semi-classical Redfield theory for deriving the formulations of ^(23)Na relaxation.We show that the relaxation behaviors of ^(23)Na can be quantitatively described by constructing the spectral density functions based on the second-order perturbation theory.In addition,we summarize the applications of ^(23)Na relaxometry in different research fields,including biomedicine,sodium ion batteries,and quantum information processing.Because sodium is an essential element in our body,food and industrial materials,the research on sodium by ^(23)Na NMR emerges as important future directions.The theoretical and practical understandings on ^(23)Na relaxation are the step stones for mastering advanced ^(23)Na NMR techniques.
文摘As the importance of email increases,the amount of malicious email is also increasing,so the need for malicious email filtering is growing.Since it is more economical to combine commodity hardware consisting of a medium server or PC with a virtual environment to use as a single server resource and filter malicious email using machine learning techniques,we used a Hadoop MapReduce framework and Naïve Bayes among machine learning methods for malicious email filtering.Naïve Bayes was selected because it is one of the top machine learning methods(Support Vector Machine(SVM),Naïve Bayes,K-Nearest Neighbor(KNN),and Decision Tree)in terms of execution time and accuracy.Malicious email was filtered with MapReduce programming using the Naïve Bayes technique,which is a supervised machine learning method,in a Hadoop framework with optimized performance and also with the Python program technique with the Naïve Bayes technique applied in a bare metal server environment with the Hadoop environment not applied.According to the results of a comparison of the accuracy and predictive error rates of the two methods,the Hadoop MapReduce Naïve Bayes method improved the accuracy of spam and ham email identification 1.11 times and the prediction error rate 14.13 times compared to the non-Hadoop Python Naïve Bayes method.
基金Project(20050003042) supported by Research Fund for the Doctoral Program of Higher Education of China
文摘The differences of grain-refining effect between Sc and Ti additions in aluminum,which cannot be substantially explained by traditional theories,were carefully studied.The empirical electron theory(EET) of solids and molecules was employed to calculate the valence electron structures(VES) of Al3Ti and Al3Sc.The conclusions can be drawn that,in the two alloys Al-Ti and Al-Sc,the different valence electron structures of Al3Ti and Al3Sc and the consequent differences of growth habit of the two particles,and the different interfacial electron density between particles and matrix fundamentally lead to the differences of grain-refining effect between Sc and Ti additions on aluminum and make Sr the better grain-refiner of aluminum.
文摘The naïve Bayes classifier is one of the commonly used data mining methods for classification.Despite its simplicity,naïve Bayes is effective and computationally efficient.Although the strong attribute independence assumption in the naïve Bayes classifier makes it a tractable method for learning,this assumption may not hold in real-world applications.Many enhancements to the basic algorithm have been proposed in order to alleviate the violation of attribute independence assumption.While these methods improve the classification performance,they do not necessarily retain the mathematical structure of the naïve Bayes model and some at the expense of computational time.One approach to reduce the naïvetéof the classifier is to incorporate attribute weights in the conditional probability.In this paper,we proposed a method to incorporate attribute weights to naïve Bayes.To evaluate the performance of our method,we used the public benchmark datasets.We compared our method with the standard naïve Bayes and baseline attribute weighting methods.Experimental results show that our method to incorporate attribute weights improves the classification performance compared to both standard naïve Bayes and baseline attribute weighting methods in terms of classification accuracy and F1,especially when the independence assumption is strongly violated,which was validated using the Chi-square test of independence.
基金Project supported by the Doctoral Education Fund of the Education Ministry of Chain (Grant No. 20100181110086) and the National Natural Science Foundation of China (Grant Nos. 11104190 and 10974138).
文摘The equilibrium geometries, relative stabilities, and electronic properties of MnAgm(M=Na, Li; n + m ≤ 7) as well as pure Agn, Nan, Lin (n ≤ 7) clusters are systematically investigated by means of the density functional theory. The optimized geometries reveal that for 2 ≤ n ≤ 7, there are significant similarities in geometry among pure Agn, Nan, and Lin clusters, and the transitions from planar to three-dimensional configurations occur at n = 7, 7, and 6, respectively. In contrast, the first three-dimensional (3D) structures are observed at n + m = 5 for both NanAgm and LinAgm clusters. When n + m ≥5, a striking feature is that the trigonal bipyramid becomes the main subunit of LinAgm. Furthermore, dramatic odd-even alternative behaviours are obtained in the fragmentation energies, secondorder difference energies, highest occupied and lowest unoccupied molecular orbital energy gaps, and chemical hardness for both pure and doped clusters. The analytic results exhibit that clusters with an even electronic configuration (2, 4, 6) possess the weakest chemical reactivity and more enhanced stability.
文摘<span style="font-family:Verdana;">The presence of bearing faults reduces the efficiency of rotating machines and thus increases energy consumption or even the total stoppage of the machine. </span><span style="font-family:Verdana;">It becomes essential to correctly diagnose the fault caused by the bearing.</span><span style="font-family:Verdana;"> Hence the importance of determining an effective features extraction method that best describes the fault. The vision of this paper is to merge the features selection methods in order to define the most relevant featuresin the texture </span><span style="font-family:Verdana;">of the vibration signal images. In this study, the Gray Level Co-occurrence </span><span style="font-family:Verdana;">Matrix (GLCM) in texture analysis is applied on the vibration signal represented in images. Features</span><span><span><span style="font-family:;" "=""> </span></span></span><span><span><span style="font-family:;" "=""><span style="font-family:Verdana;">selection based on the merge of PCA (Principal component Analysis) method and SFE (Sequential Features Extraction) method is </span><span style="font-family:Verdana;">done to obtain the most relevant features. The multiclass-Na<span style="white-space:nowrap;">?</span>ve Bayesclassifi</span><span style="font-family:Verdana;">er is used to test the proposed approach. The success rate of this classification is 98.27%. The relevant features obtained give promising results and are more efficient than the methods observed in the literature.</span></span></span></span>
基金Project supported by the Jiangxi Provincial Natural Science Foundation,China(Grant No.20122BAB216007)the Foundation of Jiangxi Provincial Educational Committee,China(Grant No.GJJ14648)
文摘The electronic structures, Born effective charges(BECs), and full phonon dispersions of cubic, tetragonal, orthorhombic, and rhombohedral K0.5Na0.5Nb O3 are investigated by the first principles method based on density functional theory.The hybridized states of Nb 4d and O 2p states are observed in the valence band, showing the formation of a strong Nb–O covalent bond which should be responsible for the displacement of Nb and O atoms. The abnormally large BECs of Nb and O indicate the possibility of phase instability induced by the off-center displacement of Nb and O atoms. The phonon dispersions reveal that the ferroelectric instability of K0.5Na0.5Nb O3 is dominated by Nb and O displacements with significant Na characteristics. In addition to the ferroelectric instability, there is also rotational instability coming from the oxygen octahedra rotation around one axis. Moreover, the Γ phonon properties of orthorhombic KNb O3, Na Nb O3, and K0.5Na0.5Nb O3 are also studied in detail.
文摘AIM: To characterize the prevalence of subpopulations of CD4+ cells along with that of major inhibitor or stimulator cell types in therapy-nave childhood Crohn's disease (CD) and to test whether abnormalities of immune phenotype are normalized with the improvement of clinical signs and symptoms of disease. METHODS: We enrolled 26 pediatric patients with CD. 14 therapy-nave CD children; of those, 10 children remitted on conventional therapy and formed the remission group. We also tested another group of 12 chil-dren who relapsed with conventional therapy and were given infliximab; and 15 healthy children who served as controls. The prevalence of Th1 and Th2, nave and memory, activated and regulatory T cells, along with the members of innate immunity such as natural killer (NK), NK-T, myeloid and plasmocytoid dendritic cells (DCs), monocytes and Toll-like receptor (TLR)-2 and TLR-4 expression were determined in peripheral blood samples. RESULTS: Children with therapy-nave CD and those in relapse showed a decrease in Th1 cell prevalence. Simultaneously, an increased prevalence of memory and activated lymphocytes along with that of DCs and monocytes was observed. In addition, the ratio of myeloid /plasmocytoid DCs and the prevalence of TLR-2 or TLR-4 positive DCs and monocytes were also higher in therapy-nave CD than in controls. The majority of alterations diminished in remitted CD irrespective of whether remission was obtained by conventional or biological therapy. CONCLUSION: The finding that immune phenotype is normalized in remission suggests a link between immune phenotype and disease activity in childhood CD. Our observations support the involvement of members of the adaptive and innate immune systems in childhood CD.
文摘This paper seeks to identify the minimal restrictions that need to be placed on the naive comprehension principle to avoid inconsistency in set theory. Analysis of the logical antinomies shows that at the root of inconsistency in naive set theory are certain "self contradictory" predicate functions in extensional set descriptions containing the matrix "-(x∈y)" (or "-(x∈x)") rather than "size," vicious circularity, or self-reference. A reformed set comprehension system is proposed that excludes extensional set descriptions that conform to the formula, (Vx) (Зy) (x∈y →P (x)) (3u) (u∈y→(u∈y)), from comprehension and otherwise preserves the ontology of na'fve set theory. This reform avoids the paradoxes by scrutiny of a set's description without recourse to type or other constructivist limitations on self-membership and has the most liberal rules for set formation conceivable including self-membership. The intuitive appeal for such an approach is compelling because as a revision of na'fve set theory, it allows all possible set descriptions that do not lead to inconsistency.
文摘Intrusion detection is the investigation process of information about the system activities or its data to detect any malicious behavior or unauthorized activity.Most of the IDS implement K-means clustering technique due to its linear complexity and fast computing ability.Nonetheless,it is Naïve use of the mean data value for the cluster core that presents a major drawback.The chances of two circular clusters having different radius and centering at the same mean will occur.This condition cannot be addressed by the K-means algorithm because the mean value of the various clusters is very similar together.However,if the clusters are not spherical,it fails.To overcome this issue,a new integrated hybrid model by integrating expectation maximizing(EM)clustering using a Gaussian mixture model(GMM)and naïve Bays classifier have been proposed.In this model,GMM give more flexibility than K-Means in terms of cluster covariance.Also,they use probabilities function and soft clustering,that’s why they can have multiple cluster for a single data.In GMM,we can define the cluster form in GMM by two parameters:the mean and the standard deviation.This means that by using these two parameters,the cluster can take any kind of elliptical shape.EM-GMM will be used to cluster data based on data activity into the corresponding category.
基金Financially supported by the National Natural Science Foundation of China(No.21267014 and 21567017)
文摘The electronic structure of Eu-doped NaTaO3 in Na-rich environment is investigated by the first-principles theory. By simulating the two different models of Eu3+ ions selectively located in Ta and Na sites, respectively, the band gaps of two Eu-doped NaTaO3 models were all narrowed, which were assigned to lattice defects and impurity band of the Eu dopent. For the model of Eu3+ ions located in the Na+ sites of NaTaO3, the new impurity band mainly composited of Eu 4f orbital appeared at the top over the valence band, indicating the enhanced oxidative ability. For the model of Eu3+ ions located in the Ta5+ sites of NaTaO3, a midgap state generated was located at the bottom of conduct band and the band potential shifted up, confirming the strong reductive ability in the Na-rich enviornment. The densities of electron states were significantly increased in both the conduction and valence bands in Na-rich model, which resulted in the increased carrier migration rate and thus photocatalytic activity enhancement. It is proposed that Eu3+ ions doping at the Ta sites could enhance the reduced photocatalytic performance via controlling the nonstoichiometric Na/Ta molar ratio in the Eu-doped NaTaO3 system.
文摘This paper proposed an improved Naïve Bayes Classifier for sentimental analysis from a large-scale dataset such as in YouTube.YouTube contains large unstructured and unorganized comments and reactions,which carry important information.Organizing large amounts of data and extracting useful information is a challenging task.The extracted information can be considered as new knowledge and can be used for deci sion-making.We extract comments from YouTube on videos and categorized them in domain-specific,and then apply the Naïve Bayes classifier with improved techniques.Our method provided a decent 80%accuracy in classifying those comments.This experiment shows that the proposed method provides excellent adaptability for large-scale text classification.
文摘Social media networks are becoming essential to our daily activities,and many issues are due to this great involvement in our lives.Cyberbullying is a social media network issue,a global crisis affecting the victims and society as a whole.It results from a misunderstanding regarding freedom of speech.In this work,we proposed a methodology for detecting such behaviors(bullying,harassment,and hate-related texts)using supervised machine learning algo-rithms(SVM,Naïve Bayes,Logistic regression,and random forest)and for predicting a topic associated with these text data using unsupervised natural language processing,such as latent Dirichlet allocation.In addition,we used accuracy,precision,recall,and F1 score to assess prior classifiers.Results show that the use of logistic regression,support vector machine,random forest model,and Naïve Bayes has 95%,94.97%,94.66%,and 93.1%accuracy,respectively.