期刊文献+
共找到12篇文章
< 1 >
每页显示 20 50 100
An Efficient Smooth Quantile Boost Algorithm for Binary Classification
1
作者 Zhefeng Wang Wanzhou Ye 《Advances in Pure Mathematics》 2016年第9期615-624,共10页
In this paper, we propose a Smooth Quantile Boost Classification (SQBC) algorithm for binary classification problem. The SQBC algorithm directly uses a smooth function to approximate the “check function” of the quan... In this paper, we propose a Smooth Quantile Boost Classification (SQBC) algorithm for binary classification problem. The SQBC algorithm directly uses a smooth function to approximate the “check function” of the quantile regression. Compared to other boost-based classification algorithms, the proposed algorithm is more accurate, flexible and robust to noisy predictors. Furthermore, the SQBC algorithm also can work well in high dimensional space. Extensive numerical experiments show that our proposed method has better performance on randomly simulations and real data. 展开更多
关键词 BOOSTING Quantile Regression Smooth Check Function binary classification
下载PDF
Quadratic Kernel-Free Least Square Twin Support Vector Machine for Binary Classification Problems 被引量:2
2
作者 Qian-Qian Gao Yan-Qin Bai Ya-Ru Zhan 《Journal of the Operations Research Society of China》 EI CSCD 2019年第4期539-559,共21页
In this paper,a new quadratic kernel-free least square twin support vector machine(QLSTSVM)is proposed for binary classification problems.The advantage of QLSTSVM is that there is no need to select the kernel function... In this paper,a new quadratic kernel-free least square twin support vector machine(QLSTSVM)is proposed for binary classification problems.The advantage of QLSTSVM is that there is no need to select the kernel function and related parameters for nonlinear classification problems.After using consensus technique,we adopt alternating direction method of multipliers to solve the reformulated consensus QLSTSVM directly.To reduce CPU time,the Karush-Kuhn-Tucker(KKT)conditions is also used to solve the QLSTSVM.The performance of QLSTSVM is tested on two artificial datasets and several University of California Irvine(UCI)benchmark datasets.Numerical results indicate that the QLSTSVM may outperform several existing methods for solving twin support vector machine with Gaussian kernel in terms of the classification accuracy and operation time. 展开更多
关键词 Twin support vector machine Quadratic kernel-free Least square binary classification
原文传递
Detection of Angioectasias and Haemorrhages Incorporated into a Multi-Class Classification Tool for the GI Tract Anomalies by Using Binary CNNs
3
作者 Christos Barbagiannis Alexios Polydorou +2 位作者 Michail Zervakis Andreas Polydorou Eleftheria Sergaki 《Journal of Biomedical Science and Engineering》 2021年第12期402-414,共13页
The proposed deep learning algorithm will be integrated as a binary classifier under the umbrella of a multi-class classification tool to facilitate the automated detection of non-healthy deformities, anatomical landm... The proposed deep learning algorithm will be integrated as a binary classifier under the umbrella of a multi-class classification tool to facilitate the automated detection of non-healthy deformities, anatomical landmarks, pathological findings, other anomalies and normal cases, by examining medical endoscopic images of GI tract. Each binary classifier is trained to detect one specific non-healthy condition. The algorithm analyzed in the present work expands the ability of detection of this tool by classifying GI tract image snapshots into two classes, depicting haemorrhage and non-haemorrhage state. The proposed algorithm is the result of the collaboration between interdisciplinary specialists on AI and Data Analysis, Computer Vision, Gastroenterologists of four University Gastroenterology Departments of Greek Medical Schools. The data used are 195 videos (177 from non-healthy cases and 18 from healthy cases) videos captured from the PillCam<sup>(R)</sup> Medronics device, originated from 195 patients, all diagnosed with different forms of angioectasia, haemorrhages and other diseases from different sites of the gastrointestinal (GI), mainly including difficult cases of diagnosis. Our AI algorithm is based on convolutional neural network (CNN) trained on annotated images at image level, using a semantic tag indicating whether the image contains angioectasia and haemorrhage traces or not. At least 22 CNN architectures were created and evaluated some of which pre-trained applying transfer learning on ImageNet data. All the CNN variations were introduced, trained to a prevalence dataset of 50%, and evaluated of unseen data. On test data, the best results were obtained from our CNN architectures which do not utilize backbone of transfer learning. Across a balanced dataset from no-healthy images and healthy images from 39 videos from different patients, identified correct diagnosis with sensitivity 90%, specificity 92%, precision 91.8%, FPR 8%, FNR 10%. Besides, we compared the performance of our best CNN algorithm versus our same goal algorithm based on HSV colorimetric lesions features extracted of pixel-level annotations, both algorithms trained and tested on the same data. It is evaluated that the CNN trained on image level annotated images, is 9% less sensitive, achieves 2.6% less precision, 1.2% less FPR, and 7% less FNR, than that based on HSV filters, extracted from on pixel-level annotated training data. 展开更多
关键词 Capsule Endoscopy (CE) Small Bowel Bleeding (SBB) Angioectasia Haemorrhage Gatrointestinal (GI) Small Bowel Capsule Endoscopy (SBCE) Convolutional Neural Network (CNN) Computer Aided Diagnosis (CAD) Image Level Annotation Pixel Level Annotation binary classification
下载PDF
A comparative study on machine learning-based classification to find photothrombotic lesion in histological rabbit brain images
4
作者 Sang Hee Jo Yoonhee Kim +2 位作者 Yoon Bum Lee Sung Suk Oh Jong-ryul Choi 《Journal of Innovative Optical Health Sciences》 SCIE EI CAS 2021年第6期81-89,共9页
Recently,research has been conducted to assist in the processing and analysis of histopathological images using machine learning algorithms.In this study,we established machine learning-based algorithms to detect phot... Recently,research has been conducted to assist in the processing and analysis of histopathological images using machine learning algorithms.In this study,we established machine learning-based algorithms to detect photothrombotic lesions in histological images of photothrombosis-induced rabbit brains.Six machine learning-based algorithms for binary classification were applied,and the accu-racies were compared to classify normal tissues and photothrombotic lesions.The lesion classification model consisting of a 3-layered neural network with a rectified linear unit(ReLU)activation function,Xavier initialization,and Adam optimization using datasets with a unit size of 128×128 pixels yielded the highest accuracy(0.975).In the validation using the tested histological images,it was confirmed that the model could identify regions where brain damage occurred due to photochemical ischemic stroke.Through the development of machine learning-based photothrombotic lesion classi-fication models and performance comparisons,we confirmed that machine learning algorithms have the potential to be utilized in histopathology and various medical diagnostic techniques. 展开更多
关键词 Machine learning histopathological images photothrombotic lesion rabbit brain binary classification logistic regression multi-layer neural networks
下载PDF
Hybrid Models for Breast Cancer Detection via Transfer Learning Technique
5
作者 Sukhendra Singh Sur Singh Rawat +5 位作者 Manoj Gupta B.K.Tripathi Faisal Alanzi Arnab Majumdar Pattaraporn Khuwuthyakorn Orawit Thinnukool 《Computers, Materials & Continua》 SCIE EI 2023年第2期3063-3083,共21页
Currently,breast cancer has been amajor cause of deaths in women worldwide and the World Health Organization(WHO)has confirmed this.The severity of this disease can be minimized to the large extend,if it is diagnosed ... Currently,breast cancer has been amajor cause of deaths in women worldwide and the World Health Organization(WHO)has confirmed this.The severity of this disease can be minimized to the large extend,if it is diagnosed properly at an early stage of the disease.Therefore,the proper treatment of a patient having cancer can be processed in better way,if it can be diagnosed properly as early as possible using the better algorithms.Moreover,it has been currently observed that the deep neural networks have delivered remarkable performance for detecting cancer in histopathological images of breast tissues.To address the above said issues,this paper presents a hybrid model using the transfer learning to study the histopathological images,which help in detection and rectification of the disease at a low cost.Extensive dataset experiments were carried out to validate the suggested hybrid model in this paper.The experimental results show that the proposed model outperformed the baseline methods,with F-scores of 0.81 for DenseNet+Logistic Regression hybrid model,(F-score:0.73)for Visual Geometry Group(VGG)+Logistic Regression hybrid model,(F-score:0.74)for VGG+Random Forest,(F-score:0.79)for DenseNet+Random Forest,and(F-score:0.79)for VGG+Densenet+Logistic Regression hybrid model on the dataset of histopathological images. 展开更多
关键词 HISTOPATHOLOGICAL deep neural network machine learning breast cancer binary classification transfer learning
下载PDF
Enhanced history matching process by incorporation of saturation logs as model selection criteria
6
作者 APONTE Jesus Manuel WEBBER Robert +3 位作者 CENTENO Maria Astrid DHAKAL Hom Nath SAYED Mohamed Hassan MALAKOOTI Reza 《Petroleum Exploration and Development》 SCIE 2023年第2期450-463,共14页
This paper proposes a methodology for an alternative history matching process enhanced by the incorporation of a simplified binary interpretation of reservoir saturation logs(RST) as objective function. Incorporating ... This paper proposes a methodology for an alternative history matching process enhanced by the incorporation of a simplified binary interpretation of reservoir saturation logs(RST) as objective function. Incorporating fluids saturation logs during the history matching phase unlocks the possibility to adjust or select models that better represent the near wellbore waterfront movement, which is particularly important for uncertainty mitigation during future well interference assessments in water driven reservoirs. For the purposes of this study, a semi-synthetic open-source reservoir model was used as base case to evaluate the proposed methodology. The reservoir model represents a water driven, highly heterogenous sandstone reservoir from Namorado field in Brazil. To effectively compare the proposed methodology against the conventional methods, a commercial reservoir simulator was used in combination with a state-of-the-art benchmarking workflow based on the Big LoopTMapproach. A well-known group of binary metrics were evaluated to be used as the objective function, and the Matthew correlation coefficient(MCC) has been proved to offer the best results when using binary data from water saturation logs. History matching results obtained with the proposed methodology allowed the selection of a more reliable group of reservoir models,especially for cases with high heterogeneity. The methodology also offers additional information and understanding of sweep behaviour behind the well casing at specific production zones, thus revealing full model potential to define new wells and reservoir development opportunities. 展开更多
关键词 geological modeling reservoir model objective function binary classification history matching saturation logs
下载PDF
The Establishment of Mathematical Models for the Composition Analysis and Identification of Ancient Glass Products
7
作者 Jenny Zhang Ding Li +1 位作者 Yu Xie Junfeng Xiang 《Open Journal of Applied Sciences》 2023年第11期2149-2171,共23页
Glass is the precious material evidence of the trade of the early Silk Road. The ancient glass was easily affected by the environmental impact and weathering, and the change of composition ratios affected the correct ... Glass is the precious material evidence of the trade of the early Silk Road. The ancient glass was easily affected by the environmental impact and weathering, and the change of composition ratios affected the correct judgment of its category. In this paper, mathematical models and methods such as Chi-square test, weighted average method, principal component analysis, cluster analysis, binary classification model and grey correlation analysis were used comprehensively to analyze the data of sample glass products combined with their categories. The results showed that the weathered high-potassium glass could be divided into 12, 9, 10 and 27, 7, 22 and so on. 展开更多
关键词 Principal Component Analysis System Clustering Sensitivity Analysis binary classification Model Logistic Regression Analysis Grey Correlation Analysis
下载PDF
Prediction of COVID-19 Cases Using Machine Learning for Effective Public Health Management 被引量:2
8
作者 Fahad Ahmad Saleh N.Almuayqil +3 位作者 Mamoona Humayun Shahid Naseem Wasim Ahmad Khan Kashaf Junaid 《Computers, Materials & Continua》 SCIE EI 2021年第3期2265-2282,共18页
COVID-19 is a pandemic that has affected nearly every country in the world.At present,sustainable development in the area of public health is considered vital to securing a promising and prosperous future for humans.H... COVID-19 is a pandemic that has affected nearly every country in the world.At present,sustainable development in the area of public health is considered vital to securing a promising and prosperous future for humans.However,widespread diseases,such as COVID-19,create numerous challenges to this goal,and some of those challenges are not yet defined.In this study,a Shallow Single-Layer Perceptron Neural Network(SSLPNN)and Gaussian Process Regression(GPR)model were used for the classification and prediction of confirmed COVID-19 cases in five geographically distributed regions of Asia with diverse settings and environmental conditions:namely,China,South Korea,Japan,Saudi Arabia,and Pakistan.Significant environmental and non-environmental features were taken as the input dataset,and confirmed COVID-19 cases were taken as the output dataset.A correlation analysis was done to identify patterns in the cases related to fluctuations in the associated variables.The results of this study established that the population and air quality index of a region had a statistically significant influence on the cases.However,age and the human development index had a negative influence on the cases.The proposed SSLPNN-based classification model performed well when predicting the classes of confirmed cases.During training,the binary classification model was highly accurate,with a Root Mean Square Error(RMSE)of 0.91.Likewise,the results of the regression analysis using the GPR technique with Matern 5/2 were highly accurate(RMSE=0.95239)when predicting the number of confirmed COVID-19 cases in an area.However,dynamic management has occupied a core place in studies on the sustainable development of public health but dynamic management depends on proactive strategies based on statistically verified approaches,like Artificial Intelligence(AI).In this study,an SSLPNN model has been trained to fit public health associated data into an appropriate class,allowing GPR to predict the number of confirmed COVID-19 cases in an area based on the given values of selected parameters. Therefore, this tool can help authorities in different ecological settingseffectively manage COVID-19. 展开更多
关键词 Public health sustainable development artificial intelligence SARSCoV-2 shallow single-layer perceptron neural network binary classification gaussian process regression
下载PDF
Credit Card Fraud Detection Using Weighted Support Vector Machine 被引量:2
9
作者 Dongfang Zhang Basu Bhandari Dennis Black 《Applied Mathematics》 2020年第12期1275-1291,共17页
Credit card fraudulent data is highly imbalanced, and it has presented an overwhelmingly large portion of nonfraudulent transactions and a small portion of fraudulent transactions. The measures used to judge the verac... Credit card fraudulent data is highly imbalanced, and it has presented an overwhelmingly large portion of nonfraudulent transactions and a small portion of fraudulent transactions. The measures used to judge the veracity of the detection algorithms become critical to the deployment of a model that accurately scores fraudulent transactions taking into account case imbalance, and the cost of identifying a case as genuine when, in fact, the case is a fraudulent transaction. In this paper, a new criterion to judge classification algorithms, which considers the cost of misclassification, is proposed, and several undersampling techniques are compared by this new criterion. At the same time, a weighted support vector machine (SVM) algorithm considering the financial cost of misclassification is introduced, proving to be more practical for credit card fraud detection than traditional methodologies. This weighted SVM uses transaction balances as weights for fraudulent transactions, and a uniformed weight for nonfraudulent transactions. The results show this strategy greatly improve performance of credit card fraud detection. 展开更多
关键词 Support Vector Machine binary classification Imbalanced Data UNDERSAMPLING Credit Card Fraud
下载PDF
Valuable Data Extraction for Resistivity Imaging Logging Interpretation 被引量:7
10
作者 Yili Ren Renbin Gong +1 位作者 Zhou Feng Meichao Li 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2020年第2期281-293,共13页
Imaging logging has become a popular means of well logging because it can visually represent the lithologic and structural characteristics of strata.The manual interpretation of imaging logging is affected by the limi... Imaging logging has become a popular means of well logging because it can visually represent the lithologic and structural characteristics of strata.The manual interpretation of imaging logging is affected by the limitations of the naked eye and experiential factors.As a result,manual interpretation accuracy is low.Therefore,it is highly useful to develop effective automatic imaging logging interpretation by machine learning.Resistivity imaging logging is the most widely used technology for imaging logging.In this paper,we propose an automatic extraction procedure for the geological features in resistivity imaging logging images.This procedure is based on machine learning and achieves good results in practical applications.Acknowledging that the existence of valueless data significantly affects the recognition effect,we propose three strategies for the identification of valueless data based on binary classification.We compare the effect of the three strategies both on an experimental dataset and in a production environment,and find that the merging method is the best performing of the three strategies.It effectively identifies the valueless data in the well logging images,thus significantly improving the automatic recognition effect of geological features in resistivity logging images. 展开更多
关键词 machine learning binary classification multiclass classification outlier detection imaging logging
原文传递
The balance property in neural network modelling
11
作者 Mario V.Wüthrich 《Statistical Theory and Related Fields》 2022年第1期1-9,共9页
In estimation and prediction theory,considerable attention is paid to the question of hav-ing unbiased estimators on a global population level.Recent developments in neural network modelling have mainly focused on acc... In estimation and prediction theory,considerable attention is paid to the question of hav-ing unbiased estimators on a global population level.Recent developments in neural network modelling have mainly focused on accuracy on a granular sample level,and the question of unbi-asedness on the population level has almost completely been neglected by that community.We discuss this question within neural network regression models,and we provide methods of receiving unbiased estimators for these models on the global population level. 展开更多
关键词 Balance property UNBIASED binary classification logistic regression neural network classification tree
原文传递
Fast Maximum Entropy Machine for Big Imbalanced Datasets
12
作者 Feng Yin Shuqing Lin +1 位作者 Chuxin Piao Shuguang(Robert)Cui 《Journal of Communications and Information Networks》 2018年第3期20-30,共11页
Driven by the need of a plethora of machine learning applications,several attempts have been made at improving the performance of classifiers applied to imbalanced datasets.In this paper,we present a fast maximum entr... Driven by the need of a plethora of machine learning applications,several attempts have been made at improving the performance of classifiers applied to imbalanced datasets.In this paper,we present a fast maximum entropy machine(MEM)combined with a synthetic minority over-sampling technique for handling binary classification problems with high imbalance ratios,large numbers of data samples,and medium/large numbers of features.A random Fourier feature representation of kernel functions and primal estimated sub-gradient solver for support vector machine(PEGASOS)are applied to speed up the classic MEM.Experiments have been conducted using various real datasets(including two China Mobile datasets and several other standard test datasets)with various configurations.The obtained results demonstrate that the proposed algorithm has extremely low complexity but an excellent overall classification performance(in terms of several widely used evaluation metrics)as compared to the classic MEM and some other state-of-the-art methods.The proposed algorithm is particularly valuable in big data applications owing to its significantly low computational complexity. 展开更多
关键词 binary classification imbalanced datasets maximum entropy machine PEGASOS random Fourier feature SMOTE
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部