Medical image analysis is an active research topic,with thousands of studies published in the past few years.Transfer learning(TL)including convolutional neural networks(CNNs)focused to enhance efficiency on an innova...Medical image analysis is an active research topic,with thousands of studies published in the past few years.Transfer learning(TL)including convolutional neural networks(CNNs)focused to enhance efficiency on an innovative task using the knowledge of the same tasks learnt in advance.It has played a major role in medical image analysis since it solves the data scarcity issue along with that it saves hardware resources and time.This study develops an EnhancedTunicate SwarmOptimization withTransfer Learning EnabledMedical Image Analysis System(ETSOTL-MIAS).The goal of the ETSOTL-MIAS technique lies in the identification and classification of diseases through medical imaging.The ETSOTL-MIAS technique involves the Chan Vese segmentation technique to identify the affected regions in the medical image.For feature extraction purposes,the ETSOTL-MIAS technique designs a modified DarkNet-53 model.To avoid the manual hyperparameter adjustment process,the ETSOTLMIAS technique exploits the ETSO algorithm,showing the novelty of the work.Finally,the classification of medical images takes place by random forest(RF)classifier.The performance validation of the ETSOTL-MIAS technique is tested on a benchmark medical image database.The extensive experimental analysis showed the promising performance of the ETSOTL-MIAS technique under different measures.展开更多
Transformers have dominated the field of natural language processing and have recently made an impact in the area of computer vision.In the field of medical image analysis,transformers have also been successfully used...Transformers have dominated the field of natural language processing and have recently made an impact in the area of computer vision.In the field of medical image analysis,transformers have also been successfully used in to full-stack clinical applications,including image synthesis/reconstruction,registration,segmentation,detection,and diagnosis.This paper aimed to promote awareness of the applications of transformers in medical image analysis.Specifically,we first provided an overview of the core concepts of the attention mechanism built into transformers and other basic components.Second,we reviewed various transformer architectures tailored for medical image applications and discuss their limitations.Within this review,we investigated key challenges including the use of transformers in different learning paradigms,improving model efficiency,and coupling with other techniques.We hope this review would provide a comprehensive picture of transformers to readers with an interest in medical image analysis.展开更多
Computer-aided diagnosis based on image color rendering promotes medical image analysis and doctor-patient communication by highlighting important information of medical diagnosis.To overcome the limitations of the co...Computer-aided diagnosis based on image color rendering promotes medical image analysis and doctor-patient communication by highlighting important information of medical diagnosis.To overcome the limitations of the color rendering method based on deep learning,such as poor model stability,poor rendering quality,fuzzy boundaries and crossed color boundaries,we propose a novel hinge-cross-entropy generative adversarial network(HCEGAN).The self-attention mechanism was added and improved to focus on the important information of the image.And the hinge-cross-entropy loss function was used to stabilize the training process of GAN models.In this study,we implement the HCEGAN model for image color rendering based on DIV2K and COCO datasets,and evaluate the results using SSIM and PSNR.The experimental results show that the proposed HCEGAN automatically re-renders images,significantly improves the quality of color rendering and greatly improves the stability of prior GAN models.展开更多
Artificial Intelligence(AI)is being increasingly used for diagnosing Vision-Threatening Diabetic Retinopathy(VTDR),which is a leading cause of visual impairment and blindness worldwide.However,previous automated VTDR ...Artificial Intelligence(AI)is being increasingly used for diagnosing Vision-Threatening Diabetic Retinopathy(VTDR),which is a leading cause of visual impairment and blindness worldwide.However,previous automated VTDR detection methods have mainly relied on manual feature extraction and classification,leading to errors.This paper proposes a novel VTDR detection and classification model that combines different models through majority voting.Our proposed methodology involves preprocessing,data augmentation,feature extraction,and classification stages.We use a hybrid convolutional neural network-singular value decomposition(CNN-SVD)model for feature extraction and selection and an improved SVM-RBF with a Decision Tree(DT)and K-Nearest Neighbor(KNN)for classification.We tested our model on the IDRiD dataset and achieved an accuracy of 98.06%,a sensitivity of 83.67%,and a specificity of 100%for DR detection and evaluation tests,respectively.Our proposed approach outperforms baseline techniques and provides a more robust and accurate method for VTDR detection.展开更多
Deep learning (DL) has seen an exponential development in recent years, with major impact in many medical fields, especially in the field of medical image. The purpose of the work converges in determining the importan...Deep learning (DL) has seen an exponential development in recent years, with major impact in many medical fields, especially in the field of medical image. The purpose of the work converges in determining the importance of each component, describing the specificity and correlations of these elements involved in achieving the precision of interpretation of medical images using DL. The major contribution of this work is primarily to the updated characterisation of the characteristics of the constituent elements of the deep learning process, scientific data, methods of knowledge incorporation, DL models according to the objectives for which they were designed and the presentation of medical applications in accordance with these tasks. Secondly, it describes the specific correlations between the quality, type and volume of data, the deep learning patterns used in the interpretation of diagnostic medical images and their applications in medicine. Finally presents problems and directions of future research. Data quality and volume, annotations and labels, identification and automatic extraction of specific medical terms can help deep learning models perform image analysis tasks. Moreover, the development of models capable of extracting unattended features and easily incorporated into the architecture of DL networks and the development of techniques to search for a certain network architecture according to the objectives set lead to performance in the interpretation of medical images.展开更多
Covid-19 is a deadly virus that is rapidly spread around the world towards the end of the 2020.The consequences of this virus are quite frightening,especially when accompanied by an underlying disease.The novelty of t...Covid-19 is a deadly virus that is rapidly spread around the world towards the end of the 2020.The consequences of this virus are quite frightening,especially when accompanied by an underlying disease.The novelty of the virus,the constant emergence of different variants and its rapid spread have a negative impact on the control and treatment process.Although the new test kits provide almost certain results,chest X-rays are extremely important to detect the progression and degree of the disease.In addition to the Covid-19 virus,pneumonia and harmless opacity of the lungs also complicate the diagnosis.Considering the negative results caused by the virus and the treatment costs,the importance of fast and accurate diagnosis is clearly seen.In this context,deep learning methods appear as an extremely popular approach.In this study,a hybrid model design with superior properties of convolutional neural networks is presented to correctly classify the Covid-19 disease.In addition,in order to contribute to the literature,a suitable dataset with balanced case numbers that can be used in all artificial intelligence classification studies is presented.With this ensemble model design,quite remarkable results are obtained for the diagnosis of three and four-class Covid-19.The proposed model can classify normal,pneumonia,and Covid-19 with 92.6%accuracy and 82.6%for normal,pneumonia,Covid-19,and lung opacity.展开更多
Aim:This study aims to establish an artificial intelligence model,ThyroidNet,to diagnose thyroid nodules using deep learning techniques accurately.Methods:A novel method,ThyroidNet,is introduced and evaluated based on...Aim:This study aims to establish an artificial intelligence model,ThyroidNet,to diagnose thyroid nodules using deep learning techniques accurately.Methods:A novel method,ThyroidNet,is introduced and evaluated based on deep learning for the localization and classification of thyroid nodules.First,we propose the multitask TransUnet,which combines the TransUnet encoder and decoder with multitask learning.Second,we propose the DualLoss function,tailored to the thyroid nodule localization and classification tasks.It balances the learning of the localization and classification tasks to help improve the model’s generalization ability.Third,we introduce strategies for augmenting the data.Finally,we submit a novel deep learning model,ThyroidNet,to accurately detect thyroid nodules.Results:ThyroidNet was evaluated on private datasets and was comparable to other existing methods,including U-Net and TransUnet.Experimental results show that ThyroidNet outperformed these methods in localizing and classifying thyroid nodules.It achieved improved accuracy of 3.9%and 1.5%,respectively.Conclusion:ThyroidNet significantly improves the clinical diagnosis of thyroid nodules and supports medical image analysis tasks.Future research directions include optimization of the model structure,expansion of the dataset size,reduction of computational complexity and memory requirements,and exploration of additional applications of ThyroidNet in medical image analysis.展开更多
Pneumonia ranks as a leading cause of mortality, particularly in children aged five and under. Detecting this disease typically requires radiologists to examine chest X-rays and report their findings to physicians, a ...Pneumonia ranks as a leading cause of mortality, particularly in children aged five and under. Detecting this disease typically requires radiologists to examine chest X-rays and report their findings to physicians, a task susceptible to human error. The application of Deep Transfer Learning (DTL) for the identification of pneumonia through chest X-rays is hindered by a shortage of available images, which has led to less than optimal DTL performance and issues with overfitting. Overfitting is characterized by a model’s learning that is too closely fitted to the training data, reducing its effectiveness on unseen data. The problem of overfitting is especially prevalent in medical image processing due to the high costs and extensive time required for image annotation, as well as the challenge of collecting substantial datasets that also respect patient privacy concerning infectious diseases such as pneumonia. To mitigate these challenges, this paper introduces the use of conditional generative adversarial networks (CGAN) to enrich the pneumonia dataset with 2690 synthesized X-ray images of the minority class, aiming to even out the dataset distribution for improved diagnostic performance. Subsequently, we applied four modified lightweight deep transfer learning models such as Xception, MobileNetV2, MobileNet, and EfficientNetB0. These models have been fine-tuned and evaluated, demonstrating remarkable detection accuracies of 99.26%, 98.23%, 97.06%, and 94.55%, respectively, across fifty epochs. The experimental results validate that the models we have proposed achieve high detection accuracy rates, with the best model reaching up to 99.26% effectiveness, outperforming other models in the diagnosis of pneumonia from X-ray images.展开更多
Acral melanoma(AM)is a rare and lethal type of skin cancer.It can be diagnosed by expert dermatologists,using dermoscopic imaging.It is challenging for dermatologists to diagnose melanoma because of the very minor dif...Acral melanoma(AM)is a rare and lethal type of skin cancer.It can be diagnosed by expert dermatologists,using dermoscopic imaging.It is challenging for dermatologists to diagnose melanoma because of the very minor differences between melanoma and non-melanoma cancers.Most of the research on skin cancer diagnosis is related to the binary classification of lesions into melanoma and non-melanoma.However,to date,limited research has been conducted on the classification of melanoma subtypes.The current study investigated the effectiveness of dermoscopy and deep learning in classifying melanoma subtypes,such as,AM.In this study,we present a novel deep learning model,developed to classify skin cancer.We utilized a dermoscopic image dataset from the Yonsei University Health System South Korea for the classification of skin lesions.Various image processing and data augmentation techniques have been applied to develop a robust automated system for AM detection.Our custombuilt model is a seven-layered deep convolutional network that was trained from scratch.Additionally,transfer learning was utilized to compare the performance of our model,where AlexNet and ResNet-18 were modified,fine-tuned,and trained on the same dataset.We achieved improved results from our proposed model with an accuracy of more than 90%for AM and benign nevus,respectively.Additionally,using the transfer learning approach,we achieved an average accuracy of nearly 97%,which is comparable to that of state-of-the-art methods.From our analysis and results,we found that our model performed well and was able to effectively classify skin cancer.Our results show that the proposed system can be used by dermatologists in the clinical decision-making process for the early diagnosis of AM.展开更多
Low contrast of Magnetic Resonance(MR)images limits the visibility of subtle structures and adversely affects the outcome of both subjective and automated diagnosis.State-of-the-art contrast boosting techniques intole...Low contrast of Magnetic Resonance(MR)images limits the visibility of subtle structures and adversely affects the outcome of both subjective and automated diagnosis.State-of-the-art contrast boosting techniques intolerably alter inherent features of MR images.Drastic changes in brightness features,induced by post-processing are not appreciated in medical imaging as the grey level values have certain diagnostic meanings.To overcome these issues this paper proposes an algorithm that enhance the contrast of MR images while preserving the underlying features as well.This method termed as Power-law and Logarithmic Modification-based Histogram Equalization(PLMHE)partitions the histogram of the image into two sub histograms after a power-law transformation and a log compression.After a modification intended for improving the dispersion of the sub-histograms and subsequent normalization,cumulative histograms are computed.Enhanced grey level values are computed from the resultant cumulative histograms.The performance of the PLMHE algorithm is comparedwith traditional histogram equalization based algorithms and it has been observed from the results that PLMHE can boost the image contrast without causing dynamic range compression,a significant change in mean brightness,and contrast-overshoot.展开更多
Over the last decade, supervised deep learning on manually annotated big data has been progressing significantly on computer vision tasks. But, the application of deep learning in medical image analysis is limited by ...Over the last decade, supervised deep learning on manually annotated big data has been progressing significantly on computer vision tasks. But, the application of deep learning in medical image analysis is limited by the scarcity of high-quality annotated medical imaging data. An emerging solution is self-supervised learning (SSL), among which contrastive SSL is the most successful approach to rivalling or outperforming supervised learning. This review investigates several state-of-the-art contrastive SSL algorithms originally on natural images as well as their adaptations for medical images, and concludes by discussing recent advances, current limitations, and future directions in applying contrastive SSL in the medical domain.展开更多
This research work develops new and better prognostic markers for predicting Childhood MedulloBlastoma(CMB)using a well-defined deep learning architecture.A deep learning architecture could be designed using ideas fro...This research work develops new and better prognostic markers for predicting Childhood MedulloBlastoma(CMB)using a well-defined deep learning architecture.A deep learning architecture could be designed using ideas from image processing and neural networks to predict CMB using histopathological images.First,a convolution process transforms the histopathological image into deep features that uniquely describe it using different two-dimensional filters of various sizes.A 10-layer deep learning architecture is designed to extract deep features.The introduction of pooling layers in the architecture reduces the feature dimension.The extracted and dimension-reduced deep features from the arrangement of convolution layers and pooling layers are used to classify histopathological images using a neural network classifier.The performance of the CMB classification system is evaluated using 1414(10×magnification)and 1071(100×magnification)augmented histopathological images with five classes of CMB such as desmoplastic,nodular,large cell,classic,and normal.Experimental results show that the average classification accuracy of 99.38%(10×)and 99.07%(100×)is attained by the proposed CNB classification system.展开更多
The earliest and most accurate detection of the pathological manifestations of hepatic diseases ensures effective treatments and thus positive prognostic outcomes.In clinical settings,screening and determining the ext...The earliest and most accurate detection of the pathological manifestations of hepatic diseases ensures effective treatments and thus positive prognostic outcomes.In clinical settings,screening and determining the extent of a pathology are prominent factors in preparing remedial agents and administering approp-riate therapeutic procedures.Moreover,in a patient undergoing liver resection,a realistic preoperative simulation of the subject-specific anatomy and physiology also plays a vital part in conducting initial assessments,making surgical decisions during the procedure,and anticipating postoperative results.Conventionally,various medical imaging modalities,e.g.,computed tomography,magnetic resonance imaging,and positron emission tomography,have been employed to assist in these tasks.In fact,several standardized procedures,such as lesion detection and liver segmentation,are also incorporated into prominent commercial software packages.Thus far,most integrated software as a medical device typically involves tedious interactions from the physician,such as manual delineation and empirical adjustments,as per a given patient.With the rapid progress in digital health approaches,especially medical image analysis,a wide range of computer algorithms have been proposed to facilitate those procedures.They include pattern recognition of a liver,its periphery,and lesion,as well as pre-and postoperative simulations.Prior to clinical adoption,however,software must conform to regulatory requirements set by the governing agency,for instance,valid clinical association and analytical and clinical validation.Therefore,this paper provides a detailed account and discussion of the state-of-the-art methods for liver image analyses,visualization,and simulation in the literature.Emphasis is placed upon their concepts,algorithmic classifications,merits,limitations,clinical considerations,and future research trends.展开更多
(Aim)COVID-19 is an ongoing infectious disease.It has caused more than 107.45 m confirmed cases and 2.35 m deaths till 11/Feb/2021.Traditional computer vision methods have achieved promising results on the automatic s...(Aim)COVID-19 is an ongoing infectious disease.It has caused more than 107.45 m confirmed cases and 2.35 m deaths till 11/Feb/2021.Traditional computer vision methods have achieved promising results on the automatic smart diagnosis.(Method)This study aims to propose a novel deep learning method that can obtain better performance.We use the pseudo-Zernike moment(PZM),derived from Zernike moment,as the extracted features.Two settings are introducing:(i)image plane over unit circle;and(ii)image plane inside the unit circle.Afterward,we use a deep-stacked sparse autoencoder(DSSAE)as the classifier.Besides,multiple-way data augmentation is chosen to overcome overfitting.The multiple-way data augmentation is based on Gaussian noise,salt-and-pepper noise,speckle noise,horizontal and vertical shear,rotation,Gamma correction,random translation and scaling.(Results)10 runs of 10-fold cross validation shows that our PZM-DSSAE method achieves a sensitivity of 92.06%±1.54%,a specificity of 92.56%±1.06%,a precision of 92.53%±1.03%,and an accuracy of 92.31%±1.08%.Its F1 score,MCC,and FMI arrive at 92.29%±1.10%,84.64%±2.15%,and 92.29%±1.10%,respectively.The AUC of our model is 0.9576.(Conclusion)We demonstrate“image plane over unit circle”can get better results than“image plane inside a unit circle.”Besides,this proposed PZM-DSSAE model is better than eight state-of-the-art approaches.展开更多
In March 2020,the World Health Organization declared the coronavirus disease(COVID-19)outbreak as a pandemic due to its uncontrolled global spread.Reverse transcription polymerase chain reaction is a laboratory test t...In March 2020,the World Health Organization declared the coronavirus disease(COVID-19)outbreak as a pandemic due to its uncontrolled global spread.Reverse transcription polymerase chain reaction is a laboratory test that is widely used for the diagnosis of this deadly disease.However,the limited availability of testing kits and qualified staff and the drastically increasing number of cases have hampered massive testing.To handle COVID19 testing problems,we apply the Internet of Things and artificial intelligence to achieve self-adaptive,secure,and fast resource allocation,real-time tracking,remote screening,and patient monitoring.In addition,we implement a cloud platform for efficient spectrum utilization.Thus,we propose a cloudbased intelligent system for remote COVID-19 screening using cognitiveradio-based Internet of Things and deep learning.Specifically,a deep learning technique recognizes radiographic patterns in chest computed tomography(CT)scans.To this end,contrast-limited adaptive histogram equalization is applied to an input CT scan followed by bilateral filtering to enhance the spatial quality.The image quality assessment of the CT scan is performed using the blind/referenceless image spatial quality evaluator.Then,a deep transfer learning model,VGG-16,is trained to diagnose a suspected CT scan as either COVID-19 positive or negative.Experimental results demonstrate that the proposed VGG-16 model outperforms existing COVID-19 screening models regarding accuracy,sensitivity,and specificity.The results obtained from the proposed system can be verified by doctors and sent to remote places through the Internet.展开更多
Information engineering mainly focus on application,uncertainty and information for its utility.This lecture discussed several aspects of information engineering research in Oxford,included the areas of mobile robotic...Information engineering mainly focus on application,uncertainty and information for its utility.This lecture discussed several aspects of information engineering research in Oxford,included the areas of mobile robotics,signal processing,real-time computer vision for object tracking,3D reconstruction of space,medical image analysis and artificial intelligence.Then what information engineering really means was discussed and the possibilities for the future of this field was prospected finally.展开更多
A lump growing in the breast may be referred to as a breast mass related to the tumor.However,not all tumors are cancerous or malignant.Breast masses can cause discomfort and pain,depending on the size and texture of ...A lump growing in the breast may be referred to as a breast mass related to the tumor.However,not all tumors are cancerous or malignant.Breast masses can cause discomfort and pain,depending on the size and texture of the breast.With an appropriate diagnosis,non-cancerous breast masses can be diagnosed earlier to prevent their cultivation from being malignant.With the development of the artificial neural network,the deep discriminative model,such as a convolutional neural network,may evaluate the breast lesion to distinguish benign and malignant cancers frommammogram breast masses images.This work accomplished breastmasses classification relative to benign and malignant cancers using a digital database for screening mammography image datasets.A residual neural network 50(ResNet50)model along with an adaptive gradient algorithm,adaptive moment estimation,and stochastic gradient descent optimizers,as well as data augmentations and fine-tuning methods,were implemented.In addition,a learning rate scheduler and 5-fold cross-validation were applied with 60 training procedures to determine the best models.The results of training accuracy,p-value,test accuracy,area under the curve,sensitivity,precision,F1-score,specificity,and kappa for adaptive gradient algorithm 25%,75%,100%,and stochastic gradient descent 100%fine-tunings indicate that the classifier is feasible for categorizing breast cancer into benign and malignant from the mammographic breast masses images.展开更多
A neural network is one of the current trends in deep learning,which is increasingly gaining attention owing to its contribution in transforming the different facets of human life.It also paves a way to approach the c...A neural network is one of the current trends in deep learning,which is increasingly gaining attention owing to its contribution in transforming the different facets of human life.It also paves a way to approach the current crisis caused by the coronavirus disease(COVID-19)from all scientific directions.Convolutional neural network(CNN),a type of neural network,is extensively applied in the medical field,and is particularly useful in the current COVID-19 pandemic.In this article,we present the application of CNNs for the diagnosis and prognosis of COVID-19 using X-ray and computed tomography(CT)images of COVID-19 patients.The CNN models discussed in this review were mainly developed for the detection,classification,and segmentation of COVID-19 images.The base models used for detection and classification were AlexNet,Visual Geometry Group Network with 16 layers,residual network,DensNet,GoogLeNet,MobileNet,Inception,and extreme Inception.U-Net and voxel-based broad learning network were used for segmentation.Even with limited datasets,these methods proved to be beneficial for efficiently identifying the occurrence of COVID-19.To further validate these observations,we conducted an experimental study using a simple CNN framework for the binary classification of COVID-19 CT images.We achieved an accuracy of 93%with an F1-score of 0.93.Thus,with the availability of improved medical image datasets,it is evident that CNNs are very useful for the efficient diagnosis and prognosis of COVID-19.展开更多
This research implements a novel segmentation of mammographic mass.Three methods are proposed,namely,segmentation of mass based on iterative active contour,automatic region growing,and fully automatic mask selectionba...This research implements a novel segmentation of mammographic mass.Three methods are proposed,namely,segmentation of mass based on iterative active contour,automatic region growing,and fully automatic mask selectionbased active contour techniques.In the first method,iterative threshold is performed for manual cropped preprocessed image,and active contour is applied thereafter.To overcome manual cropping in the second method,an automatic seed selection followed by region growing is performed.Given that the result is only a few images owing to over segmentation,the third method uses a fully automatic active contour.Results of the segmentation techniques are compared with the manual markup by experts,specifically by taking the difference in their mean values.Accordingly,the difference in the mean value of the third method is 1.0853,which indicates the closeness of the segmentation.Moreover,the proposed method is compared with the existing fuzzy C means and level set methods.The automatic mass segmentation based on active contour technique results in segmentation with high accuracy.By using adaptive neuro fuzzy inference system,classification is done and results in a sensitivity of 94.73%,accuracy of 93.93%,and Mathew’s correlation coefficient(MCC)of 0.876.展开更多
Colorectal cancer is one of the biggest health threats to humans and takes thousands of lives every year.Colonoscopy is the gold standard in clinical practice to inspect the intestinal wall,detect polyps and remove po...Colorectal cancer is one of the biggest health threats to humans and takes thousands of lives every year.Colonoscopy is the gold standard in clinical practice to inspect the intestinal wall,detect polyps and remove polypsin early stages,preventing polyps from becoming malignant and forming colorectal cancer instances.In recentyears,computer-aided polyp detection systems have been widely used in colonoscopies to improve the qualityof colonoscopy examination and increase the polyp detection rate.Currently,the most efficient computer-aidedsystems are built with machine learning methods.However,developing such a computer-aided detection systemrequires experienced doctors to label a large number of image data from colonoscopy videos,which is extremelytime-consuming,laborious and expensive.One possible solution is to adopt a semi-supervised learning,which canbuild a detection system on a dataset where part of its data is not necessary to be labeled.In this paper,on thebasis of state-of-the-art object detection method and semi-supervised learning technique,we design and implementa semi-supervised colonoscopy polyp detection system containing four main steps:running standard supervisedtraining with all labeled data;running inference on unlabeled data to obtain pseudo labels;applying a set ofstrong augmentation to both unlabeled data and pseudo label;combining labeled data,and unlabeled data withits pseudo labels to retrain the detector.The semi-supervised learning system is evaluated both on public datasetand our original private dataset and proves its effectiveness.Also,the inference speed of the semi-supervisedlearning system can meet the requirement of real-time operation.展开更多
基金support for this work from the Deanship of Scientific Research (DSR),University of Tabuk,Tabuk,Saudi Arabia,under grant number S-1440-0262.
文摘Medical image analysis is an active research topic,with thousands of studies published in the past few years.Transfer learning(TL)including convolutional neural networks(CNNs)focused to enhance efficiency on an innovative task using the knowledge of the same tasks learnt in advance.It has played a major role in medical image analysis since it solves the data scarcity issue along with that it saves hardware resources and time.This study develops an EnhancedTunicate SwarmOptimization withTransfer Learning EnabledMedical Image Analysis System(ETSOTL-MIAS).The goal of the ETSOTL-MIAS technique lies in the identification and classification of diseases through medical imaging.The ETSOTL-MIAS technique involves the Chan Vese segmentation technique to identify the affected regions in the medical image.For feature extraction purposes,the ETSOTL-MIAS technique designs a modified DarkNet-53 model.To avoid the manual hyperparameter adjustment process,the ETSOTLMIAS technique exploits the ETSO algorithm,showing the novelty of the work.Finally,the classification of medical images takes place by random forest(RF)classifier.The performance validation of the ETSOTL-MIAS technique is tested on a benchmark medical image database.The extensive experimental analysis showed the promising performance of the ETSOTL-MIAS technique under different measures.
基金the National Natural Science Foundation of China(Grant No.62106101)the Natural Science Foundation of Jiangsu Province(Grant No.BK20210180).
文摘Transformers have dominated the field of natural language processing and have recently made an impact in the area of computer vision.In the field of medical image analysis,transformers have also been successfully used in to full-stack clinical applications,including image synthesis/reconstruction,registration,segmentation,detection,and diagnosis.This paper aimed to promote awareness of the applications of transformers in medical image analysis.Specifically,we first provided an overview of the core concepts of the attention mechanism built into transformers and other basic components.Second,we reviewed various transformer architectures tailored for medical image applications and discuss their limitations.Within this review,we investigated key challenges including the use of transformers in different learning paradigms,improving model efficiency,and coupling with other techniques.We hope this review would provide a comprehensive picture of transformers to readers with an interest in medical image analysis.
基金Foundation of China(No.61902311)funding for this studysupported in part by the Natural Science Foundation of Shaanxi Province in China under Grants 2022JM-508,2022JM-317 and 2019JM-162.
文摘Computer-aided diagnosis based on image color rendering promotes medical image analysis and doctor-patient communication by highlighting important information of medical diagnosis.To overcome the limitations of the color rendering method based on deep learning,such as poor model stability,poor rendering quality,fuzzy boundaries and crossed color boundaries,we propose a novel hinge-cross-entropy generative adversarial network(HCEGAN).The self-attention mechanism was added and improved to focus on the important information of the image.And the hinge-cross-entropy loss function was used to stabilize the training process of GAN models.In this study,we implement the HCEGAN model for image color rendering based on DIV2K and COCO datasets,and evaluate the results using SSIM and PSNR.The experimental results show that the proposed HCEGAN automatically re-renders images,significantly improves the quality of color rendering and greatly improves the stability of prior GAN models.
基金This research was funded by the National Natural Science Foundation of China(Nos.71762010,62262019,62162025,61966013,12162012)the Hainan Provincial Natural Science Foundation of China(Nos.823RC488,623RC481,620RC603,621QN241,620RC602,121RC536)+1 种基金the Haikou Science and Technology Plan Project of China(No.2022-016)the Project supported by the Education Department of Hainan Province,No.Hnky2021-23.
文摘Artificial Intelligence(AI)is being increasingly used for diagnosing Vision-Threatening Diabetic Retinopathy(VTDR),which is a leading cause of visual impairment and blindness worldwide.However,previous automated VTDR detection methods have mainly relied on manual feature extraction and classification,leading to errors.This paper proposes a novel VTDR detection and classification model that combines different models through majority voting.Our proposed methodology involves preprocessing,data augmentation,feature extraction,and classification stages.We use a hybrid convolutional neural network-singular value decomposition(CNN-SVD)model for feature extraction and selection and an improved SVM-RBF with a Decision Tree(DT)and K-Nearest Neighbor(KNN)for classification.We tested our model on the IDRiD dataset and achieved an accuracy of 98.06%,a sensitivity of 83.67%,and a specificity of 100%for DR detection and evaluation tests,respectively.Our proposed approach outperforms baseline techniques and provides a more robust and accurate method for VTDR detection.
文摘Deep learning (DL) has seen an exponential development in recent years, with major impact in many medical fields, especially in the field of medical image. The purpose of the work converges in determining the importance of each component, describing the specificity and correlations of these elements involved in achieving the precision of interpretation of medical images using DL. The major contribution of this work is primarily to the updated characterisation of the characteristics of the constituent elements of the deep learning process, scientific data, methods of knowledge incorporation, DL models according to the objectives for which they were designed and the presentation of medical applications in accordance with these tasks. Secondly, it describes the specific correlations between the quality, type and volume of data, the deep learning patterns used in the interpretation of diagnostic medical images and their applications in medicine. Finally presents problems and directions of future research. Data quality and volume, annotations and labels, identification and automatic extraction of specific medical terms can help deep learning models perform image analysis tasks. Moreover, the development of models capable of extracting unattended features and easily incorporated into the architecture of DL networks and the development of techniques to search for a certain network architecture according to the objectives set lead to performance in the interpretation of medical images.
文摘Covid-19 is a deadly virus that is rapidly spread around the world towards the end of the 2020.The consequences of this virus are quite frightening,especially when accompanied by an underlying disease.The novelty of the virus,the constant emergence of different variants and its rapid spread have a negative impact on the control and treatment process.Although the new test kits provide almost certain results,chest X-rays are extremely important to detect the progression and degree of the disease.In addition to the Covid-19 virus,pneumonia and harmless opacity of the lungs also complicate the diagnosis.Considering the negative results caused by the virus and the treatment costs,the importance of fast and accurate diagnosis is clearly seen.In this context,deep learning methods appear as an extremely popular approach.In this study,a hybrid model design with superior properties of convolutional neural networks is presented to correctly classify the Covid-19 disease.In addition,in order to contribute to the literature,a suitable dataset with balanced case numbers that can be used in all artificial intelligence classification studies is presented.With this ensemble model design,quite remarkable results are obtained for the diagnosis of three and four-class Covid-19.The proposed model can classify normal,pneumonia,and Covid-19 with 92.6%accuracy and 82.6%for normal,pneumonia,Covid-19,and lung opacity.
基金supported by MRC,UK (MC_PC_17171)Royal Society,UK (RP202G0230)+8 种基金BHF,UK (AA/18/3/34220)Hope Foundation for Cancer Research,UK (RM60G0680)GCRF,UK (P202PF11)Sino-UK Industrial Fund,UK (RP202G0289)LIAS,UK (P202ED10,P202RE969)Data Science Enhancement Fund,UK (P202RE237)Fight for Sight,UK (24NN201)Sino-UK Education Fund,UK (OP202006)BBSRC,UK (RM32G0178B8).
文摘Aim:This study aims to establish an artificial intelligence model,ThyroidNet,to diagnose thyroid nodules using deep learning techniques accurately.Methods:A novel method,ThyroidNet,is introduced and evaluated based on deep learning for the localization and classification of thyroid nodules.First,we propose the multitask TransUnet,which combines the TransUnet encoder and decoder with multitask learning.Second,we propose the DualLoss function,tailored to the thyroid nodule localization and classification tasks.It balances the learning of the localization and classification tasks to help improve the model’s generalization ability.Third,we introduce strategies for augmenting the data.Finally,we submit a novel deep learning model,ThyroidNet,to accurately detect thyroid nodules.Results:ThyroidNet was evaluated on private datasets and was comparable to other existing methods,including U-Net and TransUnet.Experimental results show that ThyroidNet outperformed these methods in localizing and classifying thyroid nodules.It achieved improved accuracy of 3.9%and 1.5%,respectively.Conclusion:ThyroidNet significantly improves the clinical diagnosis of thyroid nodules and supports medical image analysis tasks.Future research directions include optimization of the model structure,expansion of the dataset size,reduction of computational complexity and memory requirements,and exploration of additional applications of ThyroidNet in medical image analysis.
文摘Pneumonia ranks as a leading cause of mortality, particularly in children aged five and under. Detecting this disease typically requires radiologists to examine chest X-rays and report their findings to physicians, a task susceptible to human error. The application of Deep Transfer Learning (DTL) for the identification of pneumonia through chest X-rays is hindered by a shortage of available images, which has led to less than optimal DTL performance and issues with overfitting. Overfitting is characterized by a model’s learning that is too closely fitted to the training data, reducing its effectiveness on unseen data. The problem of overfitting is especially prevalent in medical image processing due to the high costs and extensive time required for image annotation, as well as the challenge of collecting substantial datasets that also respect patient privacy concerning infectious diseases such as pneumonia. To mitigate these challenges, this paper introduces the use of conditional generative adversarial networks (CGAN) to enrich the pneumonia dataset with 2690 synthesized X-ray images of the minority class, aiming to even out the dataset distribution for improved diagnostic performance. Subsequently, we applied four modified lightweight deep transfer learning models such as Xception, MobileNetV2, MobileNet, and EfficientNetB0. These models have been fine-tuned and evaluated, demonstrating remarkable detection accuracies of 99.26%, 98.23%, 97.06%, and 94.55%, respectively, across fifty epochs. The experimental results validate that the models we have proposed achieve high detection accuracy rates, with the best model reaching up to 99.26% effectiveness, outperforming other models in the diagnosis of pneumonia from X-ray images.
文摘Acral melanoma(AM)is a rare and lethal type of skin cancer.It can be diagnosed by expert dermatologists,using dermoscopic imaging.It is challenging for dermatologists to diagnose melanoma because of the very minor differences between melanoma and non-melanoma cancers.Most of the research on skin cancer diagnosis is related to the binary classification of lesions into melanoma and non-melanoma.However,to date,limited research has been conducted on the classification of melanoma subtypes.The current study investigated the effectiveness of dermoscopy and deep learning in classifying melanoma subtypes,such as,AM.In this study,we present a novel deep learning model,developed to classify skin cancer.We utilized a dermoscopic image dataset from the Yonsei University Health System South Korea for the classification of skin lesions.Various image processing and data augmentation techniques have been applied to develop a robust automated system for AM detection.Our custombuilt model is a seven-layered deep convolutional network that was trained from scratch.Additionally,transfer learning was utilized to compare the performance of our model,where AlexNet and ResNet-18 were modified,fine-tuned,and trained on the same dataset.We achieved improved results from our proposed model with an accuracy of more than 90%for AM and benign nevus,respectively.Additionally,using the transfer learning approach,we achieved an average accuracy of nearly 97%,which is comparable to that of state-of-the-art methods.From our analysis and results,we found that our model performed well and was able to effectively classify skin cancer.Our results show that the proposed system can be used by dermatologists in the clinical decision-making process for the early diagnosis of AM.
基金This work was supported by Taif university Researchers Supporting Project Number(TURSP-2020/114),Taif University,Taif,Saudi Arabia.
文摘Low contrast of Magnetic Resonance(MR)images limits the visibility of subtle structures and adversely affects the outcome of both subjective and automated diagnosis.State-of-the-art contrast boosting techniques intolerably alter inherent features of MR images.Drastic changes in brightness features,induced by post-processing are not appreciated in medical imaging as the grey level values have certain diagnostic meanings.To overcome these issues this paper proposes an algorithm that enhance the contrast of MR images while preserving the underlying features as well.This method termed as Power-law and Logarithmic Modification-based Histogram Equalization(PLMHE)partitions the histogram of the image into two sub histograms after a power-law transformation and a log compression.After a modification intended for improving the dispersion of the sub-histograms and subsequent normalization,cumulative histograms are computed.Enhanced grey level values are computed from the resultant cumulative histograms.The performance of the PLMHE algorithm is comparedwith traditional histogram equalization based algorithms and it has been observed from the results that PLMHE can boost the image contrast without causing dynamic range compression,a significant change in mean brightness,and contrast-overshoot.
文摘Over the last decade, supervised deep learning on manually annotated big data has been progressing significantly on computer vision tasks. But, the application of deep learning in medical image analysis is limited by the scarcity of high-quality annotated medical imaging data. An emerging solution is self-supervised learning (SSL), among which contrastive SSL is the most successful approach to rivalling or outperforming supervised learning. This review investigates several state-of-the-art contrastive SSL algorithms originally on natural images as well as their adaptations for medical images, and concludes by discussing recent advances, current limitations, and future directions in applying contrastive SSL in the medical domain.
文摘This research work develops new and better prognostic markers for predicting Childhood MedulloBlastoma(CMB)using a well-defined deep learning architecture.A deep learning architecture could be designed using ideas from image processing and neural networks to predict CMB using histopathological images.First,a convolution process transforms the histopathological image into deep features that uniquely describe it using different two-dimensional filters of various sizes.A 10-layer deep learning architecture is designed to extract deep features.The introduction of pooling layers in the architecture reduces the feature dimension.The extracted and dimension-reduced deep features from the arrangement of convolution layers and pooling layers are used to classify histopathological images using a neural network classifier.The performance of the CMB classification system is evaluated using 1414(10×magnification)and 1071(100×magnification)augmented histopathological images with five classes of CMB such as desmoplastic,nodular,large cell,classic,and normal.Experimental results show that the average classification accuracy of 99.38%(10×)and 99.07%(100×)is attained by the proposed CNB classification system.
文摘The earliest and most accurate detection of the pathological manifestations of hepatic diseases ensures effective treatments and thus positive prognostic outcomes.In clinical settings,screening and determining the extent of a pathology are prominent factors in preparing remedial agents and administering approp-riate therapeutic procedures.Moreover,in a patient undergoing liver resection,a realistic preoperative simulation of the subject-specific anatomy and physiology also plays a vital part in conducting initial assessments,making surgical decisions during the procedure,and anticipating postoperative results.Conventionally,various medical imaging modalities,e.g.,computed tomography,magnetic resonance imaging,and positron emission tomography,have been employed to assist in these tasks.In fact,several standardized procedures,such as lesion detection and liver segmentation,are also incorporated into prominent commercial software packages.Thus far,most integrated software as a medical device typically involves tedious interactions from the physician,such as manual delineation and empirical adjustments,as per a given patient.With the rapid progress in digital health approaches,especially medical image analysis,a wide range of computer algorithms have been proposed to facilitate those procedures.They include pattern recognition of a liver,its periphery,and lesion,as well as pre-and postoperative simulations.Prior to clinical adoption,however,software must conform to regulatory requirements set by the governing agency,for instance,valid clinical association and analytical and clinical validation.Therefore,this paper provides a detailed account and discussion of the state-of-the-art methods for liver image analyses,visualization,and simulation in the literature.Emphasis is placed upon their concepts,algorithmic classifications,merits,limitations,clinical considerations,and future research trends.
基金This study was supported by Royal Society International Exchanges Cost Share Award,UK(RP202G0230)Medical Research Council Confidence in Concept Award,UK(MC_PC_17171)+1 种基金Hope Foundation for Cancer Research,UK(RM60G0680)Global Challenges Research Fund(GCRF),UK(P202PF11)。
文摘(Aim)COVID-19 is an ongoing infectious disease.It has caused more than 107.45 m confirmed cases and 2.35 m deaths till 11/Feb/2021.Traditional computer vision methods have achieved promising results on the automatic smart diagnosis.(Method)This study aims to propose a novel deep learning method that can obtain better performance.We use the pseudo-Zernike moment(PZM),derived from Zernike moment,as the extracted features.Two settings are introducing:(i)image plane over unit circle;and(ii)image plane inside the unit circle.Afterward,we use a deep-stacked sparse autoencoder(DSSAE)as the classifier.Besides,multiple-way data augmentation is chosen to overcome overfitting.The multiple-way data augmentation is based on Gaussian noise,salt-and-pepper noise,speckle noise,horizontal and vertical shear,rotation,Gamma correction,random translation and scaling.(Results)10 runs of 10-fold cross validation shows that our PZM-DSSAE method achieves a sensitivity of 92.06%±1.54%,a specificity of 92.56%±1.06%,a precision of 92.53%±1.03%,and an accuracy of 92.31%±1.08%.Its F1 score,MCC,and FMI arrive at 92.29%±1.10%,84.64%±2.15%,and 92.29%±1.10%,respectively.The AUC of our model is 0.9576.(Conclusion)We demonstrate“image plane over unit circle”can get better results than“image plane inside a unit circle.”Besides,this proposed PZM-DSSAE model is better than eight state-of-the-art approaches.
基金This study was supported by the grant of the National Research Foundation of Korea(NRF 2016M3A9E9942010)the grants of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute(KHIDI)+1 种基金funded by the Ministry of Health&Welfare(HI18C1216)the Soonchunhyang University Research Fund.
文摘In March 2020,the World Health Organization declared the coronavirus disease(COVID-19)outbreak as a pandemic due to its uncontrolled global spread.Reverse transcription polymerase chain reaction is a laboratory test that is widely used for the diagnosis of this deadly disease.However,the limited availability of testing kits and qualified staff and the drastically increasing number of cases have hampered massive testing.To handle COVID19 testing problems,we apply the Internet of Things and artificial intelligence to achieve self-adaptive,secure,and fast resource allocation,real-time tracking,remote screening,and patient monitoring.In addition,we implement a cloud platform for efficient spectrum utilization.Thus,we propose a cloudbased intelligent system for remote COVID-19 screening using cognitiveradio-based Internet of Things and deep learning.Specifically,a deep learning technique recognizes radiographic patterns in chest computed tomography(CT)scans.To this end,contrast-limited adaptive histogram equalization is applied to an input CT scan followed by bilateral filtering to enhance the spatial quality.The image quality assessment of the CT scan is performed using the blind/referenceless image spatial quality evaluator.Then,a deep transfer learning model,VGG-16,is trained to diagnose a suspected CT scan as either COVID-19 positive or negative.Experimental results demonstrate that the proposed VGG-16 model outperforms existing COVID-19 screening models regarding accuracy,sensitivity,and specificity.The results obtained from the proposed system can be verified by doctors and sent to remote places through the Internet.
文摘Information engineering mainly focus on application,uncertainty and information for its utility.This lecture discussed several aspects of information engineering research in Oxford,included the areas of mobile robotics,signal processing,real-time computer vision for object tracking,3D reconstruction of space,medical image analysis and artificial intelligence.Then what information engineering really means was discussed and the possibilities for the future of this field was prospected finally.
基金This research was supported by the National Research Foundation of Korea(NRF)grant funded by the Korean government(MSIT)[NRF-2019R1F1A1062397,NRF-2021R1F1A1059665]Brain Korea 21 FOUR Project(Dept.of IT Convergence Engineering,Kumoh National Institute of Technology)This paper was supported by Korea Institute for Advancement of Technology(KIAT)grant funded by the Korea Government(MOTIE)[P0017123,The Competency Development Program for Industry Specialist].
文摘A lump growing in the breast may be referred to as a breast mass related to the tumor.However,not all tumors are cancerous or malignant.Breast masses can cause discomfort and pain,depending on the size and texture of the breast.With an appropriate diagnosis,non-cancerous breast masses can be diagnosed earlier to prevent their cultivation from being malignant.With the development of the artificial neural network,the deep discriminative model,such as a convolutional neural network,may evaluate the breast lesion to distinguish benign and malignant cancers frommammogram breast masses images.This work accomplished breastmasses classification relative to benign and malignant cancers using a digital database for screening mammography image datasets.A residual neural network 50(ResNet50)model along with an adaptive gradient algorithm,adaptive moment estimation,and stochastic gradient descent optimizers,as well as data augmentations and fine-tuning methods,were implemented.In addition,a learning rate scheduler and 5-fold cross-validation were applied with 60 training procedures to determine the best models.The results of training accuracy,p-value,test accuracy,area under the curve,sensitivity,precision,F1-score,specificity,and kappa for adaptive gradient algorithm 25%,75%,100%,and stochastic gradient descent 100%fine-tunings indicate that the classifier is feasible for categorizing breast cancer into benign and malignant from the mammographic breast masses images.
文摘A neural network is one of the current trends in deep learning,which is increasingly gaining attention owing to its contribution in transforming the different facets of human life.It also paves a way to approach the current crisis caused by the coronavirus disease(COVID-19)from all scientific directions.Convolutional neural network(CNN),a type of neural network,is extensively applied in the medical field,and is particularly useful in the current COVID-19 pandemic.In this article,we present the application of CNNs for the diagnosis and prognosis of COVID-19 using X-ray and computed tomography(CT)images of COVID-19 patients.The CNN models discussed in this review were mainly developed for the detection,classification,and segmentation of COVID-19 images.The base models used for detection and classification were AlexNet,Visual Geometry Group Network with 16 layers,residual network,DensNet,GoogLeNet,MobileNet,Inception,and extreme Inception.U-Net and voxel-based broad learning network were used for segmentation.Even with limited datasets,these methods proved to be beneficial for efficiently identifying the occurrence of COVID-19.To further validate these observations,we conducted an experimental study using a simple CNN framework for the binary classification of COVID-19 CT images.We achieved an accuracy of 93%with an F1-score of 0.93.Thus,with the availability of improved medical image datasets,it is evident that CNNs are very useful for the efficient diagnosis and prognosis of COVID-19.
文摘This research implements a novel segmentation of mammographic mass.Three methods are proposed,namely,segmentation of mass based on iterative active contour,automatic region growing,and fully automatic mask selectionbased active contour techniques.In the first method,iterative threshold is performed for manual cropped preprocessed image,and active contour is applied thereafter.To overcome manual cropping in the second method,an automatic seed selection followed by region growing is performed.Given that the result is only a few images owing to over segmentation,the third method uses a fully automatic active contour.Results of the segmentation techniques are compared with the manual markup by experts,specifically by taking the difference in their mean values.Accordingly,the difference in the mean value of the third method is 1.0853,which indicates the closeness of the segmentation.Moreover,the proposed method is compared with the existing fuzzy C means and level set methods.The automatic mass segmentation based on active contour technique results in segmentation with high accuracy.By using adaptive neuro fuzzy inference system,classification is done and results in a sensitivity of 94.73%,accuracy of 93.93%,and Mathew’s correlation coefficient(MCC)of 0.876.
基金the National Natural Science Foundation of China(No.61977046)the Science and Interdisciplinary Program of Shanghai Jiao Tong University(No.19X190020109)the Interdisciplinary Program of Shanghai Jiao Tong University(No.YG2022ZD031)。
文摘Colorectal cancer is one of the biggest health threats to humans and takes thousands of lives every year.Colonoscopy is the gold standard in clinical practice to inspect the intestinal wall,detect polyps and remove polypsin early stages,preventing polyps from becoming malignant and forming colorectal cancer instances.In recentyears,computer-aided polyp detection systems have been widely used in colonoscopies to improve the qualityof colonoscopy examination and increase the polyp detection rate.Currently,the most efficient computer-aidedsystems are built with machine learning methods.However,developing such a computer-aided detection systemrequires experienced doctors to label a large number of image data from colonoscopy videos,which is extremelytime-consuming,laborious and expensive.One possible solution is to adopt a semi-supervised learning,which canbuild a detection system on a dataset where part of its data is not necessary to be labeled.In this paper,on thebasis of state-of-the-art object detection method and semi-supervised learning technique,we design and implementa semi-supervised colonoscopy polyp detection system containing four main steps:running standard supervisedtraining with all labeled data;running inference on unlabeled data to obtain pseudo labels;applying a set ofstrong augmentation to both unlabeled data and pseudo label;combining labeled data,and unlabeled data withits pseudo labels to retrain the detector.The semi-supervised learning system is evaluated both on public datasetand our original private dataset and proves its effectiveness.Also,the inference speed of the semi-supervisedlearning system can meet the requirement of real-time operation.