In this paper, based on hourly precipitation observations in 1977e2013 in the Beijing area, China, hourly precipitation in summer (June?August) is classified into three categories: light (below the 50th percentile val...In this paper, based on hourly precipitation observations in 1977e2013 in the Beijing area, China, hourly precipitation in summer (June?August) is classified into three categories: light (below the 50th percentile values), moderate (the 50th to 95th percentile values), and heavy (above the 95th percentile values). Results reveal that both light and moderate precipitation decreased significantly during the research period and thereby caused the decrease in summer totals. By contrast, pronounced trends failed to be detected in the heavy category. Since 2004, the contribution of heavy rainfall to the summer total precipitation in the urban area increased as compared to the suburban area, which is opposite to light rainfall. There are obvious differences in the diurnal variations of classified precipitation. Light precipitation shows a double peak structure in the early morning and at night, while moderate and heavy rainfall show a single peak at night. Light precipitation at the early morning peak time decreased significantly in the whole Beijing area. Compared with the suburban area, light precipitation in the urban area occurred less frequently whereas heavy precipitation occurred more frequently at evening peak time after 2004. The asymmetry of the rainfall is obvious, especially, for heavy precipitation. The asymmetry of heavy precipitation events in the urban area exhibits a significant increasing trend.展开更多
A new remote sensing image coding scheme based on the wavelet transform and classified vector quantization (CVQ) is proposed. The original image is first decomposed into a hierarchy of 3 layers including 10 subimages ...A new remote sensing image coding scheme based on the wavelet transform and classified vector quantization (CVQ) is proposed. The original image is first decomposed into a hierarchy of 3 layers including 10 subimages by DWT. The lowest frequency subimage is compressed by scalar quantization and ADPCM. The high frequency subimages are compressed by CVQ to utilize the similarity among different resolutions while improving the edge quality and reducing computational complexity. The experimental results show that the proposed scheme has a better performance than JPEG, and a PSNR of reconstructed image is 31~33 dB with a rate of 0.2 bpp.展开更多
Feature extraction and selection from signals is a key issue for metal magnetic memory testing technique. In order to realize the classification of metal magnetic memory signals of welding defects, four fractal analys...Feature extraction and selection from signals is a key issue for metal magnetic memory testing technique. In order to realize the classification of metal magnetic memory signals of welding defects, four fractal analysis methods, such as box- counting, detrended fluctuation, minimal cover and rescaled-range analysis, were used to extract the feature signal after the original metal magnet memory signal was de-noising and differential processing, then the Karhunen-Lo^e transformation was adopted as classification tool to identify the defect signals. The result shows that this study can provide an efficient classification method for metal magnetic memory signal of welding defects.展开更多
In order to disclose present situation and problem of classified collection of municipal solid waste in Wanghua District of Fushun and ana- lyze its practicability, questionnaire was designed in this paper, random res...In order to disclose present situation and problem of classified collection of municipal solid waste in Wanghua District of Fushun and ana- lyze its practicability, questionnaire was designed in this paper, random research was adopted in Wanghua District, and statistic analysis of investi- gation result was conducted. This investigation could provide basis for popularizing classified collection of municipal solid waste in the whole nation.展开更多
As the risks associated with air turbulence are intensified by climate change and the growth of the aviation industry,it has become imperative to monitor and mitigate these threats to ensure civil aviation safety.The ...As the risks associated with air turbulence are intensified by climate change and the growth of the aviation industry,it has become imperative to monitor and mitigate these threats to ensure civil aviation safety.The eddy dissipation rate(EDR)has been established as the standard metric for quantifying turbulence in civil aviation.This study aims to explore a universally applicable symbolic classification approach based on genetic programming to detect turbulence anomalies using quick access recorder(QAR)data.The detection of atmospheric turbulence is approached as an anomaly detection problem.Comparative evaluations demonstrate that this approach performs on par with direct EDR calculation methods in identifying turbulence events.Moreover,comparisons with alternative machine learning techniques indicate that the proposed technique is the optimal methodology currently available.In summary,the use of symbolic classification via genetic programming enables accurate turbulence detection from QAR data,comparable to that with established EDR approaches and surpassing that achieved with machine learning algorithms.This finding highlights the potential of integrating symbolic classifiers into turbulence monitoring systems to enhance civil aviation safety amidst rising environmental and operational hazards.展开更多
Protected areas contain most of Burkina Faso’s plant biodiversity which confer different benefits for the communities. However, the composition of some of them remains unknown. In a context of overexploitation and cl...Protected areas contain most of Burkina Faso’s plant biodiversity which confer different benefits for the communities. However, the composition of some of them remains unknown. In a context of overexploitation and climate change, it is important to have a detailed knowledge of the vegetation of forests that have not been studied, such as Péni Classified Forest (PCF) to develop better preservation protocols. The aim of this study is to contribute to the knowledge of the flora of Burkina Faso. Phytosociological surveys were carried out in 213 plots, have identified 475 species distributed in 321 genera and 87 families. We identified during this study 201 woody species representing 38% of the woody flora of Burkina Faso. 64% of this flora is confined to shrub savannahs and 61% to tree savannahs. Among the vegetation units, shrub savannahs and tree savannahs have respectively 56.21% and 44.67% of very rare species. Poaceae (11.90%), Fabaceae-Faboideae (11.27%) and Rubiaceae (6.26%) are the most dominant families. The dominant biological types of the flora are phanerophytes (42.32%) and therophytes (30.32%), and Sudanian species (20.63%) are the best represented. Logging is the most frequent disturbance factor (100%) in the PCF. The PCF is a particular ecosystem with a great diversity but subject to many disturbances. Actions to strengthen its protection are necessary.展开更多
Over the past years,with the increasing enrollment of high school,vocational schools are facing great challenge for their existence and development,concerning the low proficiency of the students and great gap among th...Over the past years,with the increasing enrollment of high school,vocational schools are facing great challenge for their existence and development,concerning the low proficiency of the students and great gap among them.The traditional English teaching mode which employs the same teaching contents,same teaching methods and teaching aims cannot satisfy students with different English levels.Therefore,in order to change the present situation,this paper proposes a new English teaching mode:classified English teaching.In the new mode,different students will be taught by different materials,different methods and with different aims.It can stimulate students'enthusiasm in English learning,and make every student develop appropriately.展开更多
Based on the data of the cases of severe convection weather such as hail,thunderstorm(thunderstorm gale)and short-time heavy precipitation in recent 10 years,the spatial and temporal distribution characteristics of di...Based on the data of the cases of severe convection weather such as hail,thunderstorm(thunderstorm gale)and short-time heavy precipitation in recent 10 years,the spatial and temporal distribution characteristics of different types of severe convection weather were analyzed.The results show that the frequency of severe convection weather tended to increase,of which short-time heavy precipitation and thunderstorm weather rose,and hail and thunderstorm gale weather decreased.Severe convection weather began to extend in late spring and early autumn.Typical cases were selected to analyze the evolution mechanism,and the conceptual models of severe convective weather caused by cold advection forcing,warm advection forcing and baroclinic frontogenesis were obtained.The key predictors for the potential prediction of severe convection weather were proposed,such as CAPE(convective available potential energy)for hail weather,UH index(maximum ascending helicity)for thunderstorm gale and PWV(precipitable water vapor)for short-time heavy precipitation.ERA5 data were used to get the forecast threshold of the key factor of classified severe convection weather,and it was verified that the threshold was available.Meanwhile,the causes of the error of failure cases were analyzed.For instance,the larger deviation of CAPE was caused by the 2 m deviation of temperature.Supplementary correction method and threshold were given to provide a reference for the objective forecast and early warning of severe convection weather.展开更多
The key objective of intrusion detection systems(IDS)is to protect the particular host or network by investigating and predicting the network traffic as an attack or normal.These IDS uses many methods of machine learn...The key objective of intrusion detection systems(IDS)is to protect the particular host or network by investigating and predicting the network traffic as an attack or normal.These IDS uses many methods of machine learning(ML)to learn from pastexperience attack i.e.signatures based and identify the new ones.Even though these methods are effective,but they have to suffer from large computational costs due to considering all the traffic features,together.Moreover,emerging technologies like the Internet of Things(Io T),big data,etc.are getting advanced day by day;as a result,network traffics are also increasing rapidly.Therefore,the issue of computational cost needs to be addressed properly.Thus,in this research,firstly,the ML methods have been used with the feature selection technique(FST)to reduce the number of features by picking out only the important ones from NSL-KDD,CICIDS2017,and CIC-DDo S2019datasets later that helped to build IDSs with lower cost but with the higher performance which would be appropriate for vast scale network.The experimental result demonstrated that the proposed model i.e.Decision tree(DT)with Recursive feature elimination(RFE)performs better than other classifiers with RFE in terms of accuracy,specificity,precision,sensitivity,F1-score,and G-means on the investigated datasets.展开更多
Cross entropy is a measure in machine learning and deep learning that assesses the difference between predicted and actual probability distributions. In this study, we propose cross entropy as a performance evaluation...Cross entropy is a measure in machine learning and deep learning that assesses the difference between predicted and actual probability distributions. In this study, we propose cross entropy as a performance evaluation metric for image classifier models and apply it to the CT image classification of lung cancer. A convolutional neural network is employed as the deep neural network (DNN) image classifier, with the residual network (ResNet) 50 chosen as the DNN archi-tecture. The image data used comprise a lung CT image set. Two classification models are built from datasets with varying amounts of data, and lung cancer is categorized into four classes using 10-fold cross-validation. Furthermore, we employ t-distributed stochastic neighbor embedding to visually explain the data distribution after classification. Experimental results demonstrate that cross en-tropy is a highly useful metric for evaluating the reliability of image classifier models. It is noted that for a more comprehensive evaluation of model perfor-mance, combining with other evaluation metrics is considered essential. .展开更多
The Internet of Things(IoT)is a growing technology that allows the sharing of data with other devices across wireless networks.Specifically,IoT systems are vulnerable to cyberattacks due to its opennes The proposed wo...The Internet of Things(IoT)is a growing technology that allows the sharing of data with other devices across wireless networks.Specifically,IoT systems are vulnerable to cyberattacks due to its opennes The proposed work intends to implement a new security framework for detecting the most specific and harmful intrusions in IoT networks.In this framework,a Covariance Linear Learning Embedding Selection(CL2ES)methodology is used at first to extract the features highly associated with the IoT intrusions.Then,the Kernel Distributed Bayes Classifier(KDBC)is created to forecast attacks based on the probability distribution value precisely.In addition,a unique Mongolian Gazellas Optimization(MGO)algorithm is used to optimize the weight value for the learning of the classifier.The effectiveness of the proposed CL2ES-KDBC framework has been assessed using several IoT cyber-attack datasets,The obtained results are then compared with current classification methods regarding accuracy(97%),precision(96.5%),and other factors.Computational analysis of the CL2ES-KDBC system on IoT intrusion datasets is performed,which provides valuable insight into its performance,efficiency,and suitability for securing IoT networks.展开更多
In order to handle the semi-supervised problem quickly and efficiently in the twin support vector machine (TWSVM) field, a semi-supervised twin support vector machine (S2TSVM) is proposed by adding the original unlabe...In order to handle the semi-supervised problem quickly and efficiently in the twin support vector machine (TWSVM) field, a semi-supervised twin support vector machine (S2TSVM) is proposed by adding the original unlabeled samples. In S2TSVM, the addition of unlabeled samples can easily cause the classification hyper plane to deviate from the sample points. Then a centerdistance principle is proposed to pre-classify unlabeled samples, and a pre-classified S2TSVM (PS2TSVM) is proposed. Compared with S2TSVM, PS2TSVM not only improves the problem of the samples deviating from the classification hyper plane, but also improves the training speed. Then PS2TSVM is smoothed. After smoothing the model, the pre-classified smooth S2TSVM (PS3TSVM) is obtained, and its convergence is deduced. Finally, nine datasets are selected in the UCI machine learning database for comparison with other types of semi-supervised models. The experimental results show that the proposed PS3TSVM model has better classification results.展开更多
In the process of Higher Vocational classified examination enrollment reform,Jilin Province has adopted a diversified examination enrollment model and“cultural quality test+vocational skill test”evaluation method,an...In the process of Higher Vocational classified examination enrollment reform,Jilin Province has adopted a diversified examination enrollment model and“cultural quality test+vocational skill test”evaluation method,and established the“vocational education college entrance examination”system.This paper analyzes the important role and practical difficulties of“vocational skill test”in Higher Vocational classified examination,studies the existing problems,and puts forward to reasonably divide the proportion of“cultural quality test”and“vocational skill test”,sets diversified admission standards,scientifically sets up the assessment methods and contents of“vocational skill test”,further improves the“cultural quality test+vocational skill test”evaluation method and builds a classified examination and enrollment system more in line with the characteristics of vocational education.展开更多
The number of blogs and other forms of opinionated online content has increased dramatically in recent years.Many fields,including academia and national security,place an emphasis on automated political article orient...The number of blogs and other forms of opinionated online content has increased dramatically in recent years.Many fields,including academia and national security,place an emphasis on automated political article orientation detection.Political articles(especially in the Arab world)are different from other articles due to their subjectivity,in which the author’s beliefs and political affiliation might have a significant influence on a political article.With categories representing the main political ideologies,this problem may be thought of as a subset of the text categorization(classification).In general,the performance of machine learning models for text classification is sensitive to hyperparameter settings.Furthermore,the feature vector used to represent a document must capture,to some extent,the complex semantics of natural language.To this end,this paper presents an intelligent system to detect political Arabic article orientation that adapts the categorical boosting(CatBoost)method combined with a multi-level feature concept.Extracting features at multiple levels can enhance the model’s ability to discriminate between different classes or patterns.Each level may capture different aspects of the input data,contributing to a more comprehensive representation.CatBoost,a robust and efficient gradient-boosting algorithm,is utilized to effectively learn and predict the complex relationships between these features and the political orientation labels associated with the articles.A dataset of political Arabic texts collected from diverse sources,including postings and articles,is used to assess the suggested technique.Conservative,reform,and revolutionary are the three subcategories of these opinions.The results of this study demonstrate that compared to other frequently used machine learning models for text classification,the CatBoost method using multi-level features performs better with an accuracy of 98.14%.展开更多
Breast cancer is a deadly disease and radiologists recommend mammography to detect it at the early stages. This paper presents two types of HanmanNets using the information set concept for the derivation of deep infor...Breast cancer is a deadly disease and radiologists recommend mammography to detect it at the early stages. This paper presents two types of HanmanNets using the information set concept for the derivation of deep information set features from ResNet by modifying its kernel functions to yield Type-1 HanmanNets and then AlexNet, GoogLeNet and VGG-16 by changing their feature maps to yield Type-2 HanmanNets. The two types of HanmanNets exploit the final feature maps of these architectures in the generation of deep information set features from mammograms for their classification using the Hanman Transform Classifier. In this work, the characteristics of the abnormality present in the mammograms are captured using the above network architectures that help derive the features of HanmanNets based on information set concept and their performance is compared via the classification accuracies. The highest accuracy of 100% is achieved for the multi-class classifications on the mini-MIAS database thus surpassing the results in the literature. Validation of the results is done by the expert radiologists to show their clinical relevance.展开更多
基金This work was supported by the National Natural Science Foundation of China (41575094) and Special Scientific Research Fund of Meteorological Public Welfare Profession of China (GYHY201506014).
文摘In this paper, based on hourly precipitation observations in 1977e2013 in the Beijing area, China, hourly precipitation in summer (June?August) is classified into three categories: light (below the 50th percentile values), moderate (the 50th to 95th percentile values), and heavy (above the 95th percentile values). Results reveal that both light and moderate precipitation decreased significantly during the research period and thereby caused the decrease in summer totals. By contrast, pronounced trends failed to be detected in the heavy category. Since 2004, the contribution of heavy rainfall to the summer total precipitation in the urban area increased as compared to the suburban area, which is opposite to light rainfall. There are obvious differences in the diurnal variations of classified precipitation. Light precipitation shows a double peak structure in the early morning and at night, while moderate and heavy rainfall show a single peak at night. Light precipitation at the early morning peak time decreased significantly in the whole Beijing area. Compared with the suburban area, light precipitation in the urban area occurred less frequently whereas heavy precipitation occurred more frequently at evening peak time after 2004. The asymmetry of the rainfall is obvious, especially, for heavy precipitation. The asymmetry of heavy precipitation events in the urban area exhibits a significant increasing trend.
文摘A new remote sensing image coding scheme based on the wavelet transform and classified vector quantization (CVQ) is proposed. The original image is first decomposed into a hierarchy of 3 layers including 10 subimages by DWT. The lowest frequency subimage is compressed by scalar quantization and ADPCM. The high frequency subimages are compressed by CVQ to utilize the similarity among different resolutions while improving the edge quality and reducing computational complexity. The experimental results show that the proposed scheme has a better performance than JPEG, and a PSNR of reconstructed image is 31~33 dB with a rate of 0.2 bpp.
基金This work was supported by Tianjin Natural Science Foundation (No. 11JCYBJC06000) and Specialized Research Fund for the Doctoral Program of Higher Education of China (No. 20100032120019).
文摘Feature extraction and selection from signals is a key issue for metal magnetic memory testing technique. In order to realize the classification of metal magnetic memory signals of welding defects, four fractal analysis methods, such as box- counting, detrended fluctuation, minimal cover and rescaled-range analysis, were used to extract the feature signal after the original metal magnet memory signal was de-noising and differential processing, then the Karhunen-Lo^e transformation was adopted as classification tool to identify the defect signals. The result shows that this study can provide an efficient classification method for metal magnetic memory signal of welding defects.
文摘In order to disclose present situation and problem of classified collection of municipal solid waste in Wanghua District of Fushun and ana- lyze its practicability, questionnaire was designed in this paper, random research was adopted in Wanghua District, and statistic analysis of investi- gation result was conducted. This investigation could provide basis for popularizing classified collection of municipal solid waste in the whole nation.
基金supported by the Meteorological Soft Science Project(Grant No.2023ZZXM29)the Natural Science Fund Project of Tianjin,China(Grant No.21JCYBJC00740)the Key Research and Development-Social Development Program of Jiangsu Province,China(Grant No.BE2021685).
文摘As the risks associated with air turbulence are intensified by climate change and the growth of the aviation industry,it has become imperative to monitor and mitigate these threats to ensure civil aviation safety.The eddy dissipation rate(EDR)has been established as the standard metric for quantifying turbulence in civil aviation.This study aims to explore a universally applicable symbolic classification approach based on genetic programming to detect turbulence anomalies using quick access recorder(QAR)data.The detection of atmospheric turbulence is approached as an anomaly detection problem.Comparative evaluations demonstrate that this approach performs on par with direct EDR calculation methods in identifying turbulence events.Moreover,comparisons with alternative machine learning techniques indicate that the proposed technique is the optimal methodology currently available.In summary,the use of symbolic classification via genetic programming enables accurate turbulence detection from QAR data,comparable to that with established EDR approaches and surpassing that achieved with machine learning algorithms.This finding highlights the potential of integrating symbolic classifiers into turbulence monitoring systems to enhance civil aviation safety amidst rising environmental and operational hazards.
文摘Protected areas contain most of Burkina Faso’s plant biodiversity which confer different benefits for the communities. However, the composition of some of them remains unknown. In a context of overexploitation and climate change, it is important to have a detailed knowledge of the vegetation of forests that have not been studied, such as Péni Classified Forest (PCF) to develop better preservation protocols. The aim of this study is to contribute to the knowledge of the flora of Burkina Faso. Phytosociological surveys were carried out in 213 plots, have identified 475 species distributed in 321 genera and 87 families. We identified during this study 201 woody species representing 38% of the woody flora of Burkina Faso. 64% of this flora is confined to shrub savannahs and 61% to tree savannahs. Among the vegetation units, shrub savannahs and tree savannahs have respectively 56.21% and 44.67% of very rare species. Poaceae (11.90%), Fabaceae-Faboideae (11.27%) and Rubiaceae (6.26%) are the most dominant families. The dominant biological types of the flora are phanerophytes (42.32%) and therophytes (30.32%), and Sudanian species (20.63%) are the best represented. Logging is the most frequent disturbance factor (100%) in the PCF. The PCF is a particular ecosystem with a great diversity but subject to many disturbances. Actions to strengthen its protection are necessary.
文摘Over the past years,with the increasing enrollment of high school,vocational schools are facing great challenge for their existence and development,concerning the low proficiency of the students and great gap among them.The traditional English teaching mode which employs the same teaching contents,same teaching methods and teaching aims cannot satisfy students with different English levels.Therefore,in order to change the present situation,this paper proposes a new English teaching mode:classified English teaching.In the new mode,different students will be taught by different materials,different methods and with different aims.It can stimulate students'enthusiasm in English learning,and make every student develop appropriately.
基金Supported by the Open-end Funds of Key Laboratory for Disaster Prevention and Mitigation of Qinghai Province(QFZ-2021-Z04)。
文摘Based on the data of the cases of severe convection weather such as hail,thunderstorm(thunderstorm gale)and short-time heavy precipitation in recent 10 years,the spatial and temporal distribution characteristics of different types of severe convection weather were analyzed.The results show that the frequency of severe convection weather tended to increase,of which short-time heavy precipitation and thunderstorm weather rose,and hail and thunderstorm gale weather decreased.Severe convection weather began to extend in late spring and early autumn.Typical cases were selected to analyze the evolution mechanism,and the conceptual models of severe convective weather caused by cold advection forcing,warm advection forcing and baroclinic frontogenesis were obtained.The key predictors for the potential prediction of severe convection weather were proposed,such as CAPE(convective available potential energy)for hail weather,UH index(maximum ascending helicity)for thunderstorm gale and PWV(precipitable water vapor)for short-time heavy precipitation.ERA5 data were used to get the forecast threshold of the key factor of classified severe convection weather,and it was verified that the threshold was available.Meanwhile,the causes of the error of failure cases were analyzed.For instance,the larger deviation of CAPE was caused by the 2 m deviation of temperature.Supplementary correction method and threshold were given to provide a reference for the objective forecast and early warning of severe convection weather.
文摘The key objective of intrusion detection systems(IDS)is to protect the particular host or network by investigating and predicting the network traffic as an attack or normal.These IDS uses many methods of machine learning(ML)to learn from pastexperience attack i.e.signatures based and identify the new ones.Even though these methods are effective,but they have to suffer from large computational costs due to considering all the traffic features,together.Moreover,emerging technologies like the Internet of Things(Io T),big data,etc.are getting advanced day by day;as a result,network traffics are also increasing rapidly.Therefore,the issue of computational cost needs to be addressed properly.Thus,in this research,firstly,the ML methods have been used with the feature selection technique(FST)to reduce the number of features by picking out only the important ones from NSL-KDD,CICIDS2017,and CIC-DDo S2019datasets later that helped to build IDSs with lower cost but with the higher performance which would be appropriate for vast scale network.The experimental result demonstrated that the proposed model i.e.Decision tree(DT)with Recursive feature elimination(RFE)performs better than other classifiers with RFE in terms of accuracy,specificity,precision,sensitivity,F1-score,and G-means on the investigated datasets.
文摘Cross entropy is a measure in machine learning and deep learning that assesses the difference between predicted and actual probability distributions. In this study, we propose cross entropy as a performance evaluation metric for image classifier models and apply it to the CT image classification of lung cancer. A convolutional neural network is employed as the deep neural network (DNN) image classifier, with the residual network (ResNet) 50 chosen as the DNN archi-tecture. The image data used comprise a lung CT image set. Two classification models are built from datasets with varying amounts of data, and lung cancer is categorized into four classes using 10-fold cross-validation. Furthermore, we employ t-distributed stochastic neighbor embedding to visually explain the data distribution after classification. Experimental results demonstrate that cross en-tropy is a highly useful metric for evaluating the reliability of image classifier models. It is noted that for a more comprehensive evaluation of model perfor-mance, combining with other evaluation metrics is considered essential. .
文摘The Internet of Things(IoT)is a growing technology that allows the sharing of data with other devices across wireless networks.Specifically,IoT systems are vulnerable to cyberattacks due to its opennes The proposed work intends to implement a new security framework for detecting the most specific and harmful intrusions in IoT networks.In this framework,a Covariance Linear Learning Embedding Selection(CL2ES)methodology is used at first to extract the features highly associated with the IoT intrusions.Then,the Kernel Distributed Bayes Classifier(KDBC)is created to forecast attacks based on the probability distribution value precisely.In addition,a unique Mongolian Gazellas Optimization(MGO)algorithm is used to optimize the weight value for the learning of the classifier.The effectiveness of the proposed CL2ES-KDBC framework has been assessed using several IoT cyber-attack datasets,The obtained results are then compared with current classification methods regarding accuracy(97%),precision(96.5%),and other factors.Computational analysis of the CL2ES-KDBC system on IoT intrusion datasets is performed,which provides valuable insight into its performance,efficiency,and suitability for securing IoT networks.
基金supported by the Fundamental Research Funds for University of Science and Technology Beijing(FRF-BR-12-021)
文摘In order to handle the semi-supervised problem quickly and efficiently in the twin support vector machine (TWSVM) field, a semi-supervised twin support vector machine (S2TSVM) is proposed by adding the original unlabeled samples. In S2TSVM, the addition of unlabeled samples can easily cause the classification hyper plane to deviate from the sample points. Then a centerdistance principle is proposed to pre-classify unlabeled samples, and a pre-classified S2TSVM (PS2TSVM) is proposed. Compared with S2TSVM, PS2TSVM not only improves the problem of the samples deviating from the classification hyper plane, but also improves the training speed. Then PS2TSVM is smoothed. After smoothing the model, the pre-classified smooth S2TSVM (PS3TSVM) is obtained, and its convergence is deduced. Finally, nine datasets are selected in the UCI machine learning database for comparison with other types of semi-supervised models. The experimental results show that the proposed PS3TSVM model has better classification results.
基金This work was supported by the Social Science Project of the 13th Five-Year Plan of Jilin Provincial Department of Education under Grant no.JJKH20200635SKthe 2019 Vocational Education and Adult Education Teaching Reform Research Project of Jilin Provincial Department of Education under Grant nos.2019ZCZ067,2019ZCY413 and 2019ZCY414.
文摘In the process of Higher Vocational classified examination enrollment reform,Jilin Province has adopted a diversified examination enrollment model and“cultural quality test+vocational skill test”evaluation method,and established the“vocational education college entrance examination”system.This paper analyzes the important role and practical difficulties of“vocational skill test”in Higher Vocational classified examination,studies the existing problems,and puts forward to reasonably divide the proportion of“cultural quality test”and“vocational skill test”,sets diversified admission standards,scientifically sets up the assessment methods and contents of“vocational skill test”,further improves the“cultural quality test+vocational skill test”evaluation method and builds a classified examination and enrollment system more in line with the characteristics of vocational education.
文摘The number of blogs and other forms of opinionated online content has increased dramatically in recent years.Many fields,including academia and national security,place an emphasis on automated political article orientation detection.Political articles(especially in the Arab world)are different from other articles due to their subjectivity,in which the author’s beliefs and political affiliation might have a significant influence on a political article.With categories representing the main political ideologies,this problem may be thought of as a subset of the text categorization(classification).In general,the performance of machine learning models for text classification is sensitive to hyperparameter settings.Furthermore,the feature vector used to represent a document must capture,to some extent,the complex semantics of natural language.To this end,this paper presents an intelligent system to detect political Arabic article orientation that adapts the categorical boosting(CatBoost)method combined with a multi-level feature concept.Extracting features at multiple levels can enhance the model’s ability to discriminate between different classes or patterns.Each level may capture different aspects of the input data,contributing to a more comprehensive representation.CatBoost,a robust and efficient gradient-boosting algorithm,is utilized to effectively learn and predict the complex relationships between these features and the political orientation labels associated with the articles.A dataset of political Arabic texts collected from diverse sources,including postings and articles,is used to assess the suggested technique.Conservative,reform,and revolutionary are the three subcategories of these opinions.The results of this study demonstrate that compared to other frequently used machine learning models for text classification,the CatBoost method using multi-level features performs better with an accuracy of 98.14%.
文摘Breast cancer is a deadly disease and radiologists recommend mammography to detect it at the early stages. This paper presents two types of HanmanNets using the information set concept for the derivation of deep information set features from ResNet by modifying its kernel functions to yield Type-1 HanmanNets and then AlexNet, GoogLeNet and VGG-16 by changing their feature maps to yield Type-2 HanmanNets. The two types of HanmanNets exploit the final feature maps of these architectures in the generation of deep information set features from mammograms for their classification using the Hanman Transform Classifier. In this work, the characteristics of the abnormality present in the mammograms are captured using the above network architectures that help derive the features of HanmanNets based on information set concept and their performance is compared via the classification accuracies. The highest accuracy of 100% is achieved for the multi-class classifications on the mini-MIAS database thus surpassing the results in the literature. Validation of the results is done by the expert radiologists to show their clinical relevance.