Diabetes mellitus is a long-term condition characterized by hyperglycemia.It could lead to plenty of difficulties.According to rising morbidity in recent years,the world’s diabetic patients will exceed 642 million by...Diabetes mellitus is a long-term condition characterized by hyperglycemia.It could lead to plenty of difficulties.According to rising morbidity in recent years,the world’s diabetic patients will exceed 642 million by 2040,implying that one out of every ten persons will be diabetic.There is no doubt that this startling figure requires immediate attention from industry and academia to promote innovation and growth in diabetes risk prediction to save individuals’lives.Due to its rapid development,deep learning(DL)was used to predict numerous diseases.However,DLmethods still suffer from their limited prediction performance due to the hyperparameters selection and parameters optimization.Therefore,the selection of hyper-parameters is critical in improving classification performance.This study presents Convolutional Neural Network(CNN)that has achieved remarkable results in many medical domains where the Bayesian optimization algorithm(BOA)has been employed for hyperparameters selection and parameters optimization.Two issues have been investigated and solved during the experiment to enhance the results.The first is the dataset class imbalance,which is solved using Synthetic Minority Oversampling Technique(SMOTE)technique.The second issue is the model’s poor performance,which has been solved using the Bayesian optimization algorithm.The findings indicate that the Bayesian based-CNN model superbases all the state-of-the-art models in the literature with an accuracy of 89.36%,F1-score of 0.88.6,andMatthews Correlation Coefficient(MCC)of 0.88.6.展开更多
In this paper,a Bayesian sea ice detection algorithm is first used based on the HY-2A/SCAT data,and a backpropagation(BP)neural network is used to classify the Arctic sea ice type.During the implementation of the Baye...In this paper,a Bayesian sea ice detection algorithm is first used based on the HY-2A/SCAT data,and a backpropagation(BP)neural network is used to classify the Arctic sea ice type.During the implementation of the Bayesian sea ice detection algorithm,linear sea ice model parameters and the backscatter variance suitable for HY-2A/SCAT were proposed.The sea ice extent obtained by the Bayesian sea ice detection algorithm was projected on a 12.5 km grid sea ice map and validated by the Advanced Microwave Scanning Radiometer 2(AMSR2)15%sea ice concentration data.The sea ice extent obtained by the Bayesian sea ice detection al-gorithm was found to be in good agreement with that of the AMSR2 during the ice growth season.Meanwhile,the Bayesian sea ice detection algorithm gave a wider ice edge than the AMSR2 during the ice melting season.For the sea ice type classification,the BP neural network was used to classify the Arctic sea ice type(multi-year and first-year ice)from January to May and October to De-cember in 2014.Comparison results between the HY-2A/SCAT sea ice type and Equal-Area Scalable Earth Grid(EASE-Grid)sea ice age data showed that the HY-2A/SCAT multi-year ice extent variation had the same trend as the EASE-Grid data.Classification errors,defined as the ratio of the mismatched sea ice type points between HY-2A/SCAT and EASE-Grid to the total sea ice points,were less than 12%,and the average classification error was 8.6%for the study period,which indicated that the BP neural network classification was a feasible algorithm for HY-2A/SCAT sea ice type classification.展开更多
Purpose-This study aims to differential diagnosis of some diseases using classification methods to support effective medical treatment.For this purpose,different classification methods based on data,experts’knowledge...Purpose-This study aims to differential diagnosis of some diseases using classification methods to support effective medical treatment.For this purpose,different classification methods based on data,experts’knowledge and both are considered in some cases.Besides,feature reduction and some clustering methods are used to improve their performance.Design/methodology/approach-First,the performances of classification methods are evaluated for differential diagnosis of different diseases.Then,experts’knowledge is utilized to modify the Bayesian networks’structures.Analyses of the results show that using experts’knowledge is more effective than other algorithms for increasing the accuracy of Bayesian network classification.A total of ten different diseases are used for testing,taken from the Machine Learning Repository datasets of the University of California at Irvine(UCI).Findings-The proposed method improves both the computation time and accuracy of the classification methods used in this paper.Bayesian networks based on experts’knowledge achieve a maximum average accuracy of 87 percent,with a minimum standard deviation average of 0.04 over the sample datasets among all classification methods.Practical implications-The proposed methodology can be applied to perform disease differential diagnosis analysis.Originality/value-This study presents the usefulness of experts’knowledge in the diagnosis while proposing an adopted improvement method for classifications.Besides,the Bayesian network based on experts’knowledge is useful for different diseases neglected by previous papers.展开更多
In the privacy preservation of association rules, sensitivity analysis should be reported after the quantification of items in terms of their occurrence. The traditional methodologies, used for preserving confidential...In the privacy preservation of association rules, sensitivity analysis should be reported after the quantification of items in terms of their occurrence. The traditional methodologies, used for preserving confidentiality of association rules, are based on the assumptions while safeguarding susceptible information rather than recognition of insightful items. Therefore, it is time to go one step ahead in order to remove such assumptions in the protection of responsive information especially in XML association rule mining. Thus, we focus on this central and highly researched area in terms of generating XML association rule mining without arguing on the disclosure risks involvement in such mining process. Hence, we described the identification of susceptible items in order to hide the confidential information through a supervised learning technique. These susceptible items show the high dependency on other items that are measured in terms of statistical significance with Bayesian Network. Thus, we proposed two methodologies based on items probabilistic occurrence and mode of items. Additionally, all this information is modeled and named PPDM (Privacy Preservation in Data Mining) model for XARs. Furthermore, the PPDM model is helpful for sharing markets information among competitors with a lower chance of generating monopoly. Finally, PPDM model introduces great accuracy in computing sensitivity of items and opens new dimensions to the academia for the standardization of such NP-hard problems.展开更多
基金This research/paper was fully supported by Universiti Teknologi PETRONAS,under the Yayasan Universiti Teknologi PETRONAS(YUTP)Fundamental Research Grant Scheme(015LC0-311).
文摘Diabetes mellitus is a long-term condition characterized by hyperglycemia.It could lead to plenty of difficulties.According to rising morbidity in recent years,the world’s diabetic patients will exceed 642 million by 2040,implying that one out of every ten persons will be diabetic.There is no doubt that this startling figure requires immediate attention from industry and academia to promote innovation and growth in diabetes risk prediction to save individuals’lives.Due to its rapid development,deep learning(DL)was used to predict numerous diseases.However,DLmethods still suffer from their limited prediction performance due to the hyperparameters selection and parameters optimization.Therefore,the selection of hyper-parameters is critical in improving classification performance.This study presents Convolutional Neural Network(CNN)that has achieved remarkable results in many medical domains where the Bayesian optimization algorithm(BOA)has been employed for hyperparameters selection and parameters optimization.Two issues have been investigated and solved during the experiment to enhance the results.The first is the dataset class imbalance,which is solved using Synthetic Minority Oversampling Technique(SMOTE)technique.The second issue is the model’s poor performance,which has been solved using the Bayesian optimization algorithm.The findings indicate that the Bayesian based-CNN model superbases all the state-of-the-art models in the literature with an accuracy of 89.36%,F1-score of 0.88.6,andMatthews Correlation Coefficient(MCC)of 0.88.6.
基金supported by the National Natural Science Foundation of China(No.42030406)。
文摘In this paper,a Bayesian sea ice detection algorithm is first used based on the HY-2A/SCAT data,and a backpropagation(BP)neural network is used to classify the Arctic sea ice type.During the implementation of the Bayesian sea ice detection algorithm,linear sea ice model parameters and the backscatter variance suitable for HY-2A/SCAT were proposed.The sea ice extent obtained by the Bayesian sea ice detection algorithm was projected on a 12.5 km grid sea ice map and validated by the Advanced Microwave Scanning Radiometer 2(AMSR2)15%sea ice concentration data.The sea ice extent obtained by the Bayesian sea ice detection al-gorithm was found to be in good agreement with that of the AMSR2 during the ice growth season.Meanwhile,the Bayesian sea ice detection algorithm gave a wider ice edge than the AMSR2 during the ice melting season.For the sea ice type classification,the BP neural network was used to classify the Arctic sea ice type(multi-year and first-year ice)from January to May and October to De-cember in 2014.Comparison results between the HY-2A/SCAT sea ice type and Equal-Area Scalable Earth Grid(EASE-Grid)sea ice age data showed that the HY-2A/SCAT multi-year ice extent variation had the same trend as the EASE-Grid data.Classification errors,defined as the ratio of the mismatched sea ice type points between HY-2A/SCAT and EASE-Grid to the total sea ice points,were less than 12%,and the average classification error was 8.6%for the study period,which indicated that the BP neural network classification was a feasible algorithm for HY-2A/SCAT sea ice type classification.
文摘Purpose-This study aims to differential diagnosis of some diseases using classification methods to support effective medical treatment.For this purpose,different classification methods based on data,experts’knowledge and both are considered in some cases.Besides,feature reduction and some clustering methods are used to improve their performance.Design/methodology/approach-First,the performances of classification methods are evaluated for differential diagnosis of different diseases.Then,experts’knowledge is utilized to modify the Bayesian networks’structures.Analyses of the results show that using experts’knowledge is more effective than other algorithms for increasing the accuracy of Bayesian network classification.A total of ten different diseases are used for testing,taken from the Machine Learning Repository datasets of the University of California at Irvine(UCI).Findings-The proposed method improves both the computation time and accuracy of the classification methods used in this paper.Bayesian networks based on experts’knowledge achieve a maximum average accuracy of 87 percent,with a minimum standard deviation average of 0.04 over the sample datasets among all classification methods.Practical implications-The proposed methodology can be applied to perform disease differential diagnosis analysis.Originality/value-This study presents the usefulness of experts’knowledge in the diagnosis while proposing an adopted improvement method for classifications.Besides,the Bayesian network based on experts’knowledge is useful for different diseases neglected by previous papers.
文摘In the privacy preservation of association rules, sensitivity analysis should be reported after the quantification of items in terms of their occurrence. The traditional methodologies, used for preserving confidentiality of association rules, are based on the assumptions while safeguarding susceptible information rather than recognition of insightful items. Therefore, it is time to go one step ahead in order to remove such assumptions in the protection of responsive information especially in XML association rule mining. Thus, we focus on this central and highly researched area in terms of generating XML association rule mining without arguing on the disclosure risks involvement in such mining process. Hence, we described the identification of susceptible items in order to hide the confidential information through a supervised learning technique. These susceptible items show the high dependency on other items that are measured in terms of statistical significance with Bayesian Network. Thus, we proposed two methodologies based on items probabilistic occurrence and mode of items. Additionally, all this information is modeled and named PPDM (Privacy Preservation in Data Mining) model for XARs. Furthermore, the PPDM model is helpful for sharing markets information among competitors with a lower chance of generating monopoly. Finally, PPDM model introduces great accuracy in computing sensitivity of items and opens new dimensions to the academia for the standardization of such NP-hard problems.