The use of Explainable Artificial Intelligence(XAI)models becomes increasingly important for making decisions in smart healthcare environments.It is to make sure that decisions are based on trustworthy algorithms and ...The use of Explainable Artificial Intelligence(XAI)models becomes increasingly important for making decisions in smart healthcare environments.It is to make sure that decisions are based on trustworthy algorithms and that healthcare workers understand the decisions made by these algorithms.These models can potentially enhance interpretability and explainability in decision-making processes that rely on artificial intelligence.Nevertheless,the intricate nature of the healthcare field necessitates the utilization of sophisticated models to classify cancer images.This research presents an advanced investigation of XAI models to classify cancer images.It describes the different levels of explainability and interpretability associated with XAI models and the challenges faced in deploying them in healthcare applications.In addition,this study proposes a novel framework for cancer image classification that incorporates XAI models with deep learning and advanced medical imaging techniques.The proposed model integrates several techniques,including end-to-end explainable evaluation,rule-based explanation,and useradaptive explanation.The proposed XAI reaches 97.72%accuracy,90.72%precision,93.72%recall,96.72%F1-score,9.55%FDR,9.66%FOR,and 91.18%DOR.It will discuss the potential applications of the proposed XAI models in the smart healthcare environment.It will help ensure trust and accountability in AI-based decisions,which is essential for achieving a safe and reliable smart healthcare environment.展开更多
In the era of advanced machine learning techniques,the development of accurate predictive models for complex medical conditions,such as thyroid cancer,has shown remarkable progress.Accurate predictivemodels for thyroi...In the era of advanced machine learning techniques,the development of accurate predictive models for complex medical conditions,such as thyroid cancer,has shown remarkable progress.Accurate predictivemodels for thyroid cancer enhance early detection,improve resource allocation,and reduce overtreatment.However,the widespread adoption of these models in clinical practice demands predictive performance along with interpretability and transparency.This paper proposes a novel association-rule based feature-integratedmachine learning model which shows better classification and prediction accuracy than present state-of-the-artmodels.Our study also focuses on the application of SHapley Additive exPlanations(SHAP)values as a powerful tool for explaining thyroid cancer prediction models.In the proposed method,the association-rule based feature integration framework identifies frequently occurring attribute combinations in the dataset.The original dataset is used in trainingmachine learning models,and further used in generating SHAP values fromthesemodels.In the next phase,the dataset is integrated with the dominant feature sets identified through association-rule based analysis.This new integrated dataset is used in re-training the machine learning models.The new SHAP values generated from these models help in validating the contributions of feature sets in predicting malignancy.The conventional machine learning models lack interpretability,which can hinder their integration into clinical decision-making systems.In this study,the SHAP values are introduced along with association-rule based feature integration as a comprehensive framework for understanding the contributions of feature sets inmodelling the predictions.The study discusses the importance of reliable predictive models for early diagnosis of thyroid cancer,and a validation framework of explainability.The proposed model shows an accuracy of 93.48%.Performance metrics such as precision,recall,F1-score,and the area under the receiver operating characteristic(AUROC)are also higher than the baseline models.The results of the proposed model help us identify the dominant feature sets that impact thyroid cancer classification and prediction.The features{calcification}and{shape}consistently emerged as the top-ranked features associated with thyroid malignancy,in both association-rule based interestingnessmetric values and SHAPmethods.The paper highlights the potential of the rule-based integrated models with SHAP in bridging the gap between the machine learning predictions and the interpretability of this prediction which is required for real-world medical applications.展开更多
Corpus-based studies show that near-synonyms differ in their collocational behavior and semantic prosody which causes deviation in Chinese EFL learners’English using.This paper attempts to investigate the effectivene...Corpus-based studies show that near-synonyms differ in their collocational behavior and semantic prosody which causes deviation in Chinese EFL learners’English using.This paper attempts to investigate the effectiveness of a corpus-based approach to the teaching and learning of near synonyms and their collocations.By using internet accessible computers to log on to websites of BNC and COCA,learners can view real language data and study the usage of targeted near synonyms.This empirical study tends to show that a corpus based approach is positively effective to the learning of near synonyms.It also provides some pedagogical implications that learners can play a major and active role in DDL and teachers only act as assistants.展开更多
Word similarity(WS)is a fundamental and critical task in natural language processing.Existing approaches to WS are mainly to calculate the similarity or relatedness of word pairs based on word embedding obtained by ma...Word similarity(WS)is a fundamental and critical task in natural language processing.Existing approaches to WS are mainly to calculate the similarity or relatedness of word pairs based on word embedding obtained by massive and high-quality corpus.However,it may suffer from poor performance for insufficient corpus in some specific fields,and cannot capture rich semantic and sentimental information.To address these above problems,we propose an enhancing embedding-based word similarity evaluation with character-word concepts and synonyms knowledge,namely EWS-CS model,which can provide extra semantic information to enhance word similarity evaluation.The core of our approach contains knowledge encoder and word encoder.In knowledge encoder,we incorporate the semantic knowledge extracted from knowledge resources,including character-word concepts,synonyms and sentiment lexicons,to obtain knowledge representation.Word encoder is to learn enhancing embedding-based word representation from pre-trained model and knowledge representation based on similarity task.Finally,compared with baseline models,the experiments on four similarity evaluation datasets validate the effectiveness of our EWS-CS model in WS task.展开更多
In the Cambrian-Ordoviclan strata in the lower reaches of the Qingjiang River (western Hubei province), many well-consolidated paleokarst breccia bodies of different sizes and occurrences have been observed. By studyi...In the Cambrian-Ordoviclan strata in the lower reaches of the Qingjiang River (western Hubei province), many well-consolidated paleokarst breccia bodies of different sizes and occurrences have been observed. By studying them in detail and by introducing new ideas of modern sedimentology into the study, the authors described and classified these paleokarst breccias. Their forming conditions and mechanisms are also explained on the basis of our research展开更多
Languages embody a lot of words that are considered as synonyms, and people just take it for granted that such words are identical in meaning without any discrimination. However, a corpus-based approach to the study o...Languages embody a lot of words that are considered as synonyms, and people just take it for granted that such words are identical in meaning without any discrimination. However, a corpus-based approach to the study of the collocational behavior of the two frequently-used pairs of synonyms (selection and option, ill and sick) reveals significant discrepancies in the use of these two pairs of synonyms by Chinese English learners and native speakers. According to the analysis, the major problems lie in the current ways of vocabulary teaching and learning. This paper aims to highlight the important role of the corpus-based collocational research in English vocabulary teaching and learning. In the end, some suggestions concerning vocabulary teaching and leaming are put forward on the basis of corpus-based research.展开更多
Aim: This study aimed to investigate the effect of non-synonymous SNPs (nsSNPs) of the Glucagon-like peptide-1 Receptor (GLP-1R) gene in protein function and structure using different computational software. Introduct...Aim: This study aimed to investigate the effect of non-synonymous SNPs (nsSNPs) of the Glucagon-like peptide-1 Receptor (GLP-1R) gene in protein function and structure using different computational software. Introduction: The GLP1R gene provides the necessary instruction for the synthesis of the insulin hormones which is needed for glucose catabolism. Polymorphisms in this gene are associated with diabetes. The protein is an important drug target for the treatment of type-2 diabetes and stroke. Material and Methods: Different nsSNPs and protein-related sequences were obtained from NCBI and ExPASY database. Gene associations and interactions were predicted using GeneMANIA software. Deleterious and damaging effects of nsSNPs were analyzed using SIFT, Provean, and Polyphen-2. The association of the nsSNPs with the disease was predicted using SNPs & GO software. Protein stability was investigated using I-Mutant and MUpro software. The structural and functional impact of point mutations was predicted using Project Hope software. Project Hope analyzes the mutations according to their size, charge, hydrophobicity, and conservancy. Results: The GLP1R gene was found to have an association with 20 other different genes. Among the most important ones is the GCG (glucagon) gene which is also a trans membrane protein. Overall 7229 variants were seen, and the missense variants or nsSNPs (146) were selected for further analysis. The total number of nsSNPs obtained in this study was 146. After being subjected to SIFT software (27 Deleterious and 119 Tolerated) were predicted. Analysis with Provean showed that (20 deleterious and 7 neutral). Analysis using Polyphen-2 revealed 17 probably damaging, 2 possibly damaging and 1 benign nsSNPs. Using two additional software SNPs & GO and PHD-SNPs showed that 14 and 17 nsSNPs had a disease effect, respectively. Project Hope software predicts the effect of the 14 nsSNPs on the protein function due to differences in charge, size, hydrophobicity, and conservancy between the wild and mutant types. Conclusion: In this study, the 14 nsSNPs which were highly affected the protein function. This protein is providing the necessary instruction for the synthesis of the insulin hormones which is needed for glucose catabolism. Polymorphisms in this gene are associated with diabetes and also affect the treatment of diabetic patients due to the fact that the protein acts as an important drug target.展开更多
The discrimination of synonyms has always been one of the great challenges for English learners.Taking assessment and evaluation as examples,this study analyses the similarities and differences of the two words,as wel...The discrimination of synonyms has always been one of the great challenges for English learners.Taking assessment and evaluation as examples,this study analyses the similarities and differences of the two words,as well as their usage from the perspectives of frequency,stylistics,collocation and semantic prosody with the help of British National Corpus,and demonstrates the importance of corpus retrieval tools in synonyms discrimination.Furthermore,this paper will give some suggestions for English learners and teachers in English vocabulary teaching.展开更多
E. Stone in the article “18 Mysteries and Unanswered Questions About Our Solar System. Little Astronomy” wrote: One of the great things about astronomy is that there is still so much out there for us to discover. Th...E. Stone in the article “18 Mysteries and Unanswered Questions About Our Solar System. Little Astronomy” wrote: One of the great things about astronomy is that there is still so much out there for us to discover. There are so many unanswered questions and mysteries about the universe. There is always a puzzle to solve and that is part of beauty. Even in our own neighborhood, the Solar System, there are many questions we still have not been able to answer [1]. In the present paper, we explain the majority of these Mysteries and some other unexplained phenomena in the Solar System (SS) in frames of the developed Hypersphere World-Universe Model (WUM) [2].展开更多
基金supported by theCONAHCYT(Consejo Nacional deHumanidades,Ciencias y Tecnologias).
文摘The use of Explainable Artificial Intelligence(XAI)models becomes increasingly important for making decisions in smart healthcare environments.It is to make sure that decisions are based on trustworthy algorithms and that healthcare workers understand the decisions made by these algorithms.These models can potentially enhance interpretability and explainability in decision-making processes that rely on artificial intelligence.Nevertheless,the intricate nature of the healthcare field necessitates the utilization of sophisticated models to classify cancer images.This research presents an advanced investigation of XAI models to classify cancer images.It describes the different levels of explainability and interpretability associated with XAI models and the challenges faced in deploying them in healthcare applications.In addition,this study proposes a novel framework for cancer image classification that incorporates XAI models with deep learning and advanced medical imaging techniques.The proposed model integrates several techniques,including end-to-end explainable evaluation,rule-based explanation,and useradaptive explanation.The proposed XAI reaches 97.72%accuracy,90.72%precision,93.72%recall,96.72%F1-score,9.55%FDR,9.66%FOR,and 91.18%DOR.It will discuss the potential applications of the proposed XAI models in the smart healthcare environment.It will help ensure trust and accountability in AI-based decisions,which is essential for achieving a safe and reliable smart healthcare environment.
文摘In the era of advanced machine learning techniques,the development of accurate predictive models for complex medical conditions,such as thyroid cancer,has shown remarkable progress.Accurate predictivemodels for thyroid cancer enhance early detection,improve resource allocation,and reduce overtreatment.However,the widespread adoption of these models in clinical practice demands predictive performance along with interpretability and transparency.This paper proposes a novel association-rule based feature-integratedmachine learning model which shows better classification and prediction accuracy than present state-of-the-artmodels.Our study also focuses on the application of SHapley Additive exPlanations(SHAP)values as a powerful tool for explaining thyroid cancer prediction models.In the proposed method,the association-rule based feature integration framework identifies frequently occurring attribute combinations in the dataset.The original dataset is used in trainingmachine learning models,and further used in generating SHAP values fromthesemodels.In the next phase,the dataset is integrated with the dominant feature sets identified through association-rule based analysis.This new integrated dataset is used in re-training the machine learning models.The new SHAP values generated from these models help in validating the contributions of feature sets in predicting malignancy.The conventional machine learning models lack interpretability,which can hinder their integration into clinical decision-making systems.In this study,the SHAP values are introduced along with association-rule based feature integration as a comprehensive framework for understanding the contributions of feature sets inmodelling the predictions.The study discusses the importance of reliable predictive models for early diagnosis of thyroid cancer,and a validation framework of explainability.The proposed model shows an accuracy of 93.48%.Performance metrics such as precision,recall,F1-score,and the area under the receiver operating characteristic(AUROC)are also higher than the baseline models.The results of the proposed model help us identify the dominant feature sets that impact thyroid cancer classification and prediction.The features{calcification}and{shape}consistently emerged as the top-ranked features associated with thyroid malignancy,in both association-rule based interestingnessmetric values and SHAPmethods.The paper highlights the potential of the rule-based integrated models with SHAP in bridging the gap between the machine learning predictions and the interpretability of this prediction which is required for real-world medical applications.
文摘Corpus-based studies show that near-synonyms differ in their collocational behavior and semantic prosody which causes deviation in Chinese EFL learners’English using.This paper attempts to investigate the effectiveness of a corpus-based approach to the teaching and learning of near synonyms and their collocations.By using internet accessible computers to log on to websites of BNC and COCA,learners can view real language data and study the usage of targeted near synonyms.This empirical study tends to show that a corpus based approach is positively effective to the learning of near synonyms.It also provides some pedagogical implications that learners can play a major and active role in DDL and teachers only act as assistants.
基金This work is supported by the National Natural Science Foundation of China(No.61801440),the High-quality and Cutting-edge Disciplines Construction Project for Universities in Beijing(Internet Information,Communication University of China),State Key Laboratory of Media Convergence and Communication(Communication University of China),and the Fundamental Research Funds for the Central Universities.
文摘Word similarity(WS)is a fundamental and critical task in natural language processing.Existing approaches to WS are mainly to calculate the similarity or relatedness of word pairs based on word embedding obtained by massive and high-quality corpus.However,it may suffer from poor performance for insufficient corpus in some specific fields,and cannot capture rich semantic and sentimental information.To address these above problems,we propose an enhancing embedding-based word similarity evaluation with character-word concepts and synonyms knowledge,namely EWS-CS model,which can provide extra semantic information to enhance word similarity evaluation.The core of our approach contains knowledge encoder and word encoder.In knowledge encoder,we incorporate the semantic knowledge extracted from knowledge resources,including character-word concepts,synonyms and sentiment lexicons,to obtain knowledge representation.Word encoder is to learn enhancing embedding-based word representation from pre-trained model and knowledge representation based on similarity task.Finally,compared with baseline models,the experiments on four similarity evaluation datasets validate the effectiveness of our EWS-CS model in WS task.
文摘In the Cambrian-Ordoviclan strata in the lower reaches of the Qingjiang River (western Hubei province), many well-consolidated paleokarst breccia bodies of different sizes and occurrences have been observed. By studying them in detail and by introducing new ideas of modern sedimentology into the study, the authors described and classified these paleokarst breccias. Their forming conditions and mechanisms are also explained on the basis of our research
文摘Languages embody a lot of words that are considered as synonyms, and people just take it for granted that such words are identical in meaning without any discrimination. However, a corpus-based approach to the study of the collocational behavior of the two frequently-used pairs of synonyms (selection and option, ill and sick) reveals significant discrepancies in the use of these two pairs of synonyms by Chinese English learners and native speakers. According to the analysis, the major problems lie in the current ways of vocabulary teaching and learning. This paper aims to highlight the important role of the corpus-based collocational research in English vocabulary teaching and learning. In the end, some suggestions concerning vocabulary teaching and leaming are put forward on the basis of corpus-based research.
文摘Aim: This study aimed to investigate the effect of non-synonymous SNPs (nsSNPs) of the Glucagon-like peptide-1 Receptor (GLP-1R) gene in protein function and structure using different computational software. Introduction: The GLP1R gene provides the necessary instruction for the synthesis of the insulin hormones which is needed for glucose catabolism. Polymorphisms in this gene are associated with diabetes. The protein is an important drug target for the treatment of type-2 diabetes and stroke. Material and Methods: Different nsSNPs and protein-related sequences were obtained from NCBI and ExPASY database. Gene associations and interactions were predicted using GeneMANIA software. Deleterious and damaging effects of nsSNPs were analyzed using SIFT, Provean, and Polyphen-2. The association of the nsSNPs with the disease was predicted using SNPs & GO software. Protein stability was investigated using I-Mutant and MUpro software. The structural and functional impact of point mutations was predicted using Project Hope software. Project Hope analyzes the mutations according to their size, charge, hydrophobicity, and conservancy. Results: The GLP1R gene was found to have an association with 20 other different genes. Among the most important ones is the GCG (glucagon) gene which is also a trans membrane protein. Overall 7229 variants were seen, and the missense variants or nsSNPs (146) were selected for further analysis. The total number of nsSNPs obtained in this study was 146. After being subjected to SIFT software (27 Deleterious and 119 Tolerated) were predicted. Analysis with Provean showed that (20 deleterious and 7 neutral). Analysis using Polyphen-2 revealed 17 probably damaging, 2 possibly damaging and 1 benign nsSNPs. Using two additional software SNPs & GO and PHD-SNPs showed that 14 and 17 nsSNPs had a disease effect, respectively. Project Hope software predicts the effect of the 14 nsSNPs on the protein function due to differences in charge, size, hydrophobicity, and conservancy between the wild and mutant types. Conclusion: In this study, the 14 nsSNPs which were highly affected the protein function. This protein is providing the necessary instruction for the synthesis of the insulin hormones which is needed for glucose catabolism. Polymorphisms in this gene are associated with diabetes and also affect the treatment of diabetic patients due to the fact that the protein acts as an important drug target.
文摘The discrimination of synonyms has always been one of the great challenges for English learners.Taking assessment and evaluation as examples,this study analyses the similarities and differences of the two words,as well as their usage from the perspectives of frequency,stylistics,collocation and semantic prosody with the help of British National Corpus,and demonstrates the importance of corpus retrieval tools in synonyms discrimination.Furthermore,this paper will give some suggestions for English learners and teachers in English vocabulary teaching.
文摘E. Stone in the article “18 Mysteries and Unanswered Questions About Our Solar System. Little Astronomy” wrote: One of the great things about astronomy is that there is still so much out there for us to discover. There are so many unanswered questions and mysteries about the universe. There is always a puzzle to solve and that is part of beauty. Even in our own neighborhood, the Solar System, there are many questions we still have not been able to answer [1]. In the present paper, we explain the majority of these Mysteries and some other unexplained phenomena in the Solar System (SS) in frames of the developed Hypersphere World-Universe Model (WUM) [2].
文摘纵向联邦学习(vertical federated learning,VFL)常用于高风险场景中的跨领域数据共享,用户需要理解并信任模型决策以推动模型应用。现有研究主要关注VFL中可解释性与隐私之间的权衡,未充分满足用户对模型建立信任及调优的需求。为此,提出了一种基于人在回路(human-in-the-loop,HITL)的纵向联邦学习解释方法(explainable vertical federated learning based on human-in-the-loop,XVFL-HITL),通过构建分布式HITL结构将用户反馈纳入VFL的基于Shapley值的解释方法中,利用各参与方的知识校正训练数据来提高模型性能。进一步,考虑到隐私问题,基于Shapley值的可加性原理,将非当前参与方的特征贡献值整合为一个整体展示,从而有效保护了各参与方的特征隐私。实验结果表明,在基准数据上,XVFL-HITL的解释结果具有有效性,并保护了用户的特征隐私;同时,XVFL-HITL对比VFL-Random和直接使用SHAP的VFL-Shapley进行特征选择的方法,模型准确率分别提高了约14%和11%。