Lung cancer remains a significant global health challenge and identifying lung cancer at an early stage is essential for enhancing patient outcomes. The study focuses on developing and optimizing gene expression-based...Lung cancer remains a significant global health challenge and identifying lung cancer at an early stage is essential for enhancing patient outcomes. The study focuses on developing and optimizing gene expression-based models for classifying cancer types using machine learning techniques. By applying Log2 normalization to gene expression data and conducting Wilcoxon rank sum tests, the researchers employed various classifiers and Incremental Feature Selection (IFS) strategies. The study culminated in two optimized models using the XGBoost classifier, comprising 10 and 74 genes respectively. The 10-gene model, due to its simplicity, is proposed for easier clinical implementation, whereas the 74-gene model exhibited superior performance in terms of Specificity, AUC (Area Under the Curve), and Precision. These models were evaluated based on their sensitivity, AUC, and specificity, aiming to achieve high sensitivity and AUC while maintaining reasonable specificity.展开更多
Objective: To discuss the clinical and imaging diagnostic rules of peripheral lung cancer by data mining technique, and to explore new ideas in the diagnosis of peripheral lung cancer, and to obtain early-stage techn...Objective: To discuss the clinical and imaging diagnostic rules of peripheral lung cancer by data mining technique, and to explore new ideas in the diagnosis of peripheral lung cancer, and to obtain early-stage technology and knowledge support of computer-aided detecting (CAD). Methods: 58 cases of peripheral lung cancer confirmed by clinical pathology were collected. The data were imported into the database after the standardization of the clinical and CT findings attributes were identified. The data was studied comparatively based on Association Rules (AR) of the knowledge discovery process and the Rough Set (RS) reduction algorithm and Genetic Algorithm(GA) of the generic data analysis tool (ROSETTA), respectively. Results: The genetic classification algorithm of ROSETTA generates 5 000 or so diagnosis rules. The RS reduction algorithm of Johnson's Algorithm generates 51 diagnosis rules and the AR algorithm generates 123 diagnosis rules. Three data mining methods basically consider gender, age, cough, location, lobulation sign, shape, ground-glass density attributes as the main basis for the diagnosis of peripheral lung cancer. Conclusion: These diagnosis rules for peripheral lung cancer with three data mining technology is same as clinical diagnostic rules, and these rules also can be used to build the knowledge base of expert system. This study demonstrated the potential values of data mining technology in clinical imaging diagnosis and differential diagnosis.展开更多
Lung cancer is the most common cancer type worldwide and has the highest and second highest mortality rate for men and women respectively in Germany.Yet,the role of comorbid illnesses in lung cancer patient prognosis ...Lung cancer is the most common cancer type worldwide and has the highest and second highest mortality rate for men and women respectively in Germany.Yet,the role of comorbid illnesses in lung cancer patient prognosis is still debated.We analyzed administrative claims data from one of the largest statutory health insurance(SHI)funds in Germany,covering close to 9 million people(11%of the national population);observation period was from 2005 to 2019.Lung cancer patients and their concomitant diseases were identified by ICD-10-GM codes.Comorbidities were classified according to the Charlson Comorbidity Index(CCI).Incidence,comorbidity prevalence and survival are estimated considering sex,age at diagnosis,and place of residence.Kaplan Meier curves with 95%confidence intervals were built in relation to common comorbidities.We identified 70,698 lung cancer incident cases in the sample.Incidence and survival figures are comparable to official statistics in Germany.Most prevalent comorbidities are chronic obstructive pulmonary disease(COPD)(36.7%),followed by peripheral vascular disease(PVD)(18.7%),diabetes without chronic complications(17.4%),congestive heart failure(CHF)(16.5%)and renal disease(14.7%).Relative to overall survival,lung cancer patients with CHF,cerebrovascular disease(CEVD)and renal disease are associated with largest drops in survival probabilities(9%or higher),while those with PVD and diabetes without chronic complications with moderate drops(7%or lower).The study showed a negative association between survival and most common comorbidities among lung cancer patients,based on a large sample for Germany.Further research needs to explore the individual effect of comorbidities disentangled from that of other patient characteristics such as cancer stage and histology.展开更多
Objective: To analyze the experience of chief physician Xiong Lu in treating metaphase and advanced lung cancer through using TCM inheritance support system (V2.5). Methods: Collecting the prescriptions used for m...Objective: To analyze the experience of chief physician Xiong Lu in treating metaphase and advanced lung cancer through using TCM inheritance support system (V2.5). Methods: Collecting the prescriptions used for metaphase and advanced lung cancer from November 1, 2014 to February 1, 2015, then the data were entered into the TCM inheritance support system. Based on principle analysis, revised mutual information, complex system entropy cluster and unsupervised hierarchical clustering composing principles were analyzed. Results: Based on the analysis of 228 cases of prescriptions, the frequency of each Chinese medicinal herb and association rules among herbs included in the database were computed. 15 core combinations and 2 new prescriptions were explored from the database. Conclusion: In treating metaphase and advanced lung cancer, chief physician Xiong Lu pay attention to Fuzheng Peiben (Therapy for support Zheng-qi to propup root), according to the different situation cooperate with Tong Luo (dredging collaterals), San Jie (Dissipating a mass), Huo Xue (Activating blood), Gong Du (Counteracting toxic substance) and so on. Xiong Lu is also good at using toxic drugs and incompatible medicaments.展开更多
The extent of the peril associated with cancer can be perceivedfrom the lack of treatment, ineffective early diagnosis techniques, and mostimportantly its fatality rate. Globally, cancer is the second leading cause of...The extent of the peril associated with cancer can be perceivedfrom the lack of treatment, ineffective early diagnosis techniques, and mostimportantly its fatality rate. Globally, cancer is the second leading cause ofdeath and among over a hundred types of cancer;lung cancer is the secondmost common type of cancer as well as the leading cause of cancer-relateddeaths. Anyhow, an accurate lung cancer diagnosis in a timely manner canelevate the likelihood of survival by a noticeable margin and medical imagingis a prevalent manner of cancer diagnosis since it is easily accessible to peoplearound the globe. Nonetheless, this is not eminently efficacious consideringhuman inspection of medical images can yield a high false positive rate. Ineffectiveand inefficient diagnosis is a crucial reason for such a high mortalityrate for this malady. However, the conspicuous advancements in deep learningand artificial intelligence have stimulated the development of exceedinglyprecise diagnosis systems. The development and performance of these systemsrely prominently on the data that is used to train these systems. A standardproblem witnessed in publicly available medical image datasets is the severeimbalance of data between different classes. This grave imbalance of data canmake a deep learning model biased towards the dominant class and unableto generalize. This study aims to present an end-to-end convolutional neuralnetwork that can accurately differentiate lung nodules from non-nodules andreduce the false positive rate to a bare minimum. To tackle the problem ofdata imbalance, we oversampled the data by transforming available images inthe minority class. The average false positive rate in the proposed method isa mere 1.5 percent. However, the average false negative rate is 31.76 percent.The proposed neural network has 68.66 percent sensitivity and 98.42 percentspecificity.展开更多
BACKGROUND The objectives of this study were to identify hub genes and biological pathways involved in lung adenocarcinoma(LUAD)via bioinformatics analysis,and investigate potential therapeutic targets.AIM To determin...BACKGROUND The objectives of this study were to identify hub genes and biological pathways involved in lung adenocarcinoma(LUAD)via bioinformatics analysis,and investigate potential therapeutic targets.AIM To determine reliable prognostic biomarkers for early diagnosis and treatment of LUAD.METHODS To identify potential therapeutic targets for LUAD,two microarray datasets derived from the Gene Expression Omnibus(GEO)database were analyzed,GSE3116959 and GSE118370.Differentially expressed genes(DEGs)in LUAD and normal tissues were identified using the GEO2R tool.The Hiplot database was then used to generate a volcanic map of the DEGs.Weighted gene co-expression network analysis was conducted to cluster the genes in GSE116959 and GSE-118370 into different modules,and identify immune genes shared between them.A protein-protein interaction network was established using the Search Tool for the Retrieval of Interacting Genes database,then the CytoNCA and CytoHubba components of Cytoscape software were used to visualize the genes.Hub genes with high scores and co-expression were identified,and the Database for Annotation,Visualization and Integrated Discovery was used to perform enrichment analysis of these genes.The diagnostic and prognostic values of the hub genes were calculated using receiver operating characteristic curves and Kaplan-Meier survival analysis,and gene-set enrichment analysis was conducted.The University of Alabama at Birmingham Cancer data analysis portal was used to analyze relationships between the hub genes and normal specimens,as well as their expression during tumor progression.Lastly,validation of protein expression was conducted on the identified hub genes via the Human Protein Atlas database.RESULTS Three hub genes with high connectivity were identified;cellular retinoic acid binding protein 2(CRABP2),matrix metallopeptidase 12(MMP12),and DNA topoisomerase II alpha(TOP2A).High expression of these genes was associated with a poor LUAD prognosis,and the genes exhibited high diagnostic value.CONCLUSION Expression levels of CRABP2,MMP12,and TOP2A in LUAD were higher than those in normal lung tissue.This observation has diagnostic value,and is linked to poor LUAD prognosis.These genes may be biomarkers and therapeutic targets in LUAD,but further research is warranted to investigate their usefulness in these respects.展开更多
目的:研究3D-Slicer软件实性肺结节体积测量对不同观察者在肺部影像报告数据系统(lung CT screening reporting and data system,Lung-RADS)分类一致性中的影响。方法:纳入76例患者中的76个实性结节。由3位放射科医师分别采用手动和3D-S...目的:研究3D-Slicer软件实性肺结节体积测量对不同观察者在肺部影像报告数据系统(lung CT screening reporting and data system,Lung-RADS)分类一致性中的影响。方法:纳入76例患者中的76个实性结节。由3位放射科医师分别采用手动和3D-Slicer软件半自动体积测量方法获得结节的直径与体积,并转化为相应的Lung-RADS评分,其中2分为阴性,3分及以上为阳性。采用同类相关系数(intraclass correlation coefficient,ICC)及Bland-Altman指数来评价观察者间直径与体积测量的一致性,Kappa分析评价观察者间Lung-RADS评分及阳性/阴性组间的一致性。结果:ICC分析结果显示手动直径测量的一致性(0.994~0.996)明显低于半自动体积测量的一致性(0.997~0.998),同时Bland-Altman指数分析结果显示手动直径测量的偏倚高于半自动体积测量。采用半自动体积测量,能够比手动直径测量明显提高观察者间Lung-RADS评分及阳性/阴性之间的一致性(0.963~0.975及0.957~0.977 vs.0.833~0.866及0.863~0.892)。结论:3D-Slicer半自动体积测量能够提高实性肺结节观察者间大小测量的一致性,相应的Lung-RADS分类一致性也随之提高。展开更多
Encouraging and astonishing developments have recently been achieved in image-based diagnostic technology.Modern medical care and imaging technology are becoming increasingly inseparable.However,the current diagnosis ...Encouraging and astonishing developments have recently been achieved in image-based diagnostic technology.Modern medical care and imaging technology are becoming increasingly inseparable.However,the current diagnosis pattern of signal to image to knowledge inevitably leads to information distortion and noise introduction in the procedure of image reconstruction(from signal to image).Artificial intelligence(AI)technologies that can mine knowledge from vast amounts of data offer opportunities to disrupt established workflows.In this prospective study,for the first time,we develop an AI-based signal-toknowledge diagnostic scheme for lung nodule classification directly from the computed tomography(CT)raw data(the signal).We find that the raw data achieves almost comparable performance with CT,indicating that it is possible to diagnose diseases without reconstructing images.Moreover,the incorporation of raw data through three common convolutional network structures greatly improves the performance of the CT models in all cohorts(with a gain ranging from 0.01 to 0.12),demonstrating that raw data contains diagnostic information that CT does not possess.Our results break new ground and demonstrate the potential for direct signal-to-knowledge domain analysis.展开更多
文摘Lung cancer remains a significant global health challenge and identifying lung cancer at an early stage is essential for enhancing patient outcomes. The study focuses on developing and optimizing gene expression-based models for classifying cancer types using machine learning techniques. By applying Log2 normalization to gene expression data and conducting Wilcoxon rank sum tests, the researchers employed various classifiers and Incremental Feature Selection (IFS) strategies. The study culminated in two optimized models using the XGBoost classifier, comprising 10 and 74 genes respectively. The 10-gene model, due to its simplicity, is proposed for easier clinical implementation, whereas the 74-gene model exhibited superior performance in terms of Specificity, AUC (Area Under the Curve), and Precision. These models were evaluated based on their sensitivity, AUC, and specificity, aiming to achieve high sensitivity and AUC while maintaining reasonable specificity.
文摘Objective: To discuss the clinical and imaging diagnostic rules of peripheral lung cancer by data mining technique, and to explore new ideas in the diagnosis of peripheral lung cancer, and to obtain early-stage technology and knowledge support of computer-aided detecting (CAD). Methods: 58 cases of peripheral lung cancer confirmed by clinical pathology were collected. The data were imported into the database after the standardization of the clinical and CT findings attributes were identified. The data was studied comparatively based on Association Rules (AR) of the knowledge discovery process and the Rough Set (RS) reduction algorithm and Genetic Algorithm(GA) of the generic data analysis tool (ROSETTA), respectively. Results: The genetic classification algorithm of ROSETTA generates 5 000 or so diagnosis rules. The RS reduction algorithm of Johnson's Algorithm generates 51 diagnosis rules and the AR algorithm generates 123 diagnosis rules. Three data mining methods basically consider gender, age, cough, location, lobulation sign, shape, ground-glass density attributes as the main basis for the diagnosis of peripheral lung cancer. Conclusion: These diagnosis rules for peripheral lung cancer with three data mining technology is same as clinical diagnostic rules, and these rules also can be used to build the knowledge base of expert system. This study demonstrated the potential values of data mining technology in clinical imaging diagnosis and differential diagnosis.
文摘Lung cancer is the most common cancer type worldwide and has the highest and second highest mortality rate for men and women respectively in Germany.Yet,the role of comorbid illnesses in lung cancer patient prognosis is still debated.We analyzed administrative claims data from one of the largest statutory health insurance(SHI)funds in Germany,covering close to 9 million people(11%of the national population);observation period was from 2005 to 2019.Lung cancer patients and their concomitant diseases were identified by ICD-10-GM codes.Comorbidities were classified according to the Charlson Comorbidity Index(CCI).Incidence,comorbidity prevalence and survival are estimated considering sex,age at diagnosis,and place of residence.Kaplan Meier curves with 95%confidence intervals were built in relation to common comorbidities.We identified 70,698 lung cancer incident cases in the sample.Incidence and survival figures are comparable to official statistics in Germany.Most prevalent comorbidities are chronic obstructive pulmonary disease(COPD)(36.7%),followed by peripheral vascular disease(PVD)(18.7%),diabetes without chronic complications(17.4%),congestive heart failure(CHF)(16.5%)and renal disease(14.7%).Relative to overall survival,lung cancer patients with CHF,cerebrovascular disease(CEVD)and renal disease are associated with largest drops in survival probabilities(9%or higher),while those with PVD and diabetes without chronic complications with moderate drops(7%or lower).The study showed a negative association between survival and most common comorbidities among lung cancer patients,based on a large sample for Germany.Further research needs to explore the individual effect of comorbidities disentangled from that of other patient characteristics such as cancer stage and histology.
文摘Objective: To analyze the experience of chief physician Xiong Lu in treating metaphase and advanced lung cancer through using TCM inheritance support system (V2.5). Methods: Collecting the prescriptions used for metaphase and advanced lung cancer from November 1, 2014 to February 1, 2015, then the data were entered into the TCM inheritance support system. Based on principle analysis, revised mutual information, complex system entropy cluster and unsupervised hierarchical clustering composing principles were analyzed. Results: Based on the analysis of 228 cases of prescriptions, the frequency of each Chinese medicinal herb and association rules among herbs included in the database were computed. 15 core combinations and 2 new prescriptions were explored from the database. Conclusion: In treating metaphase and advanced lung cancer, chief physician Xiong Lu pay attention to Fuzheng Peiben (Therapy for support Zheng-qi to propup root), according to the different situation cooperate with Tong Luo (dredging collaterals), San Jie (Dissipating a mass), Huo Xue (Activating blood), Gong Du (Counteracting toxic substance) and so on. Xiong Lu is also good at using toxic drugs and incompatible medicaments.
基金supported this research through the National Research Foundation of Korea (NRF)funded by the Ministry of Science,ICT (2019M3F2A1073387)this work was supported by the Institute for Information&communications Technology Promotion (IITP) (NO.2022-0-00980Cooperative Intelligence Framework of Scene Perception for Autonomous IoT Device).
文摘The extent of the peril associated with cancer can be perceivedfrom the lack of treatment, ineffective early diagnosis techniques, and mostimportantly its fatality rate. Globally, cancer is the second leading cause ofdeath and among over a hundred types of cancer;lung cancer is the secondmost common type of cancer as well as the leading cause of cancer-relateddeaths. Anyhow, an accurate lung cancer diagnosis in a timely manner canelevate the likelihood of survival by a noticeable margin and medical imagingis a prevalent manner of cancer diagnosis since it is easily accessible to peoplearound the globe. Nonetheless, this is not eminently efficacious consideringhuman inspection of medical images can yield a high false positive rate. Ineffectiveand inefficient diagnosis is a crucial reason for such a high mortalityrate for this malady. However, the conspicuous advancements in deep learningand artificial intelligence have stimulated the development of exceedinglyprecise diagnosis systems. The development and performance of these systemsrely prominently on the data that is used to train these systems. A standardproblem witnessed in publicly available medical image datasets is the severeimbalance of data between different classes. This grave imbalance of data canmake a deep learning model biased towards the dominant class and unableto generalize. This study aims to present an end-to-end convolutional neuralnetwork that can accurately differentiate lung nodules from non-nodules andreduce the false positive rate to a bare minimum. To tackle the problem ofdata imbalance, we oversampled the data by transforming available images inthe minority class. The average false positive rate in the proposed method isa mere 1.5 percent. However, the average false negative rate is 31.76 percent.The proposed neural network has 68.66 percent sensitivity and 98.42 percentspecificity.
文摘BACKGROUND The objectives of this study were to identify hub genes and biological pathways involved in lung adenocarcinoma(LUAD)via bioinformatics analysis,and investigate potential therapeutic targets.AIM To determine reliable prognostic biomarkers for early diagnosis and treatment of LUAD.METHODS To identify potential therapeutic targets for LUAD,two microarray datasets derived from the Gene Expression Omnibus(GEO)database were analyzed,GSE3116959 and GSE118370.Differentially expressed genes(DEGs)in LUAD and normal tissues were identified using the GEO2R tool.The Hiplot database was then used to generate a volcanic map of the DEGs.Weighted gene co-expression network analysis was conducted to cluster the genes in GSE116959 and GSE-118370 into different modules,and identify immune genes shared between them.A protein-protein interaction network was established using the Search Tool for the Retrieval of Interacting Genes database,then the CytoNCA and CytoHubba components of Cytoscape software were used to visualize the genes.Hub genes with high scores and co-expression were identified,and the Database for Annotation,Visualization and Integrated Discovery was used to perform enrichment analysis of these genes.The diagnostic and prognostic values of the hub genes were calculated using receiver operating characteristic curves and Kaplan-Meier survival analysis,and gene-set enrichment analysis was conducted.The University of Alabama at Birmingham Cancer data analysis portal was used to analyze relationships between the hub genes and normal specimens,as well as their expression during tumor progression.Lastly,validation of protein expression was conducted on the identified hub genes via the Human Protein Atlas database.RESULTS Three hub genes with high connectivity were identified;cellular retinoic acid binding protein 2(CRABP2),matrix metallopeptidase 12(MMP12),and DNA topoisomerase II alpha(TOP2A).High expression of these genes was associated with a poor LUAD prognosis,and the genes exhibited high diagnostic value.CONCLUSION Expression levels of CRABP2,MMP12,and TOP2A in LUAD were higher than those in normal lung tissue.This observation has diagnostic value,and is linked to poor LUAD prognosis.These genes may be biomarkers and therapeutic targets in LUAD,but further research is warranted to investigate their usefulness in these respects.
文摘目的:研究3D-Slicer软件实性肺结节体积测量对不同观察者在肺部影像报告数据系统(lung CT screening reporting and data system,Lung-RADS)分类一致性中的影响。方法:纳入76例患者中的76个实性结节。由3位放射科医师分别采用手动和3D-Slicer软件半自动体积测量方法获得结节的直径与体积,并转化为相应的Lung-RADS评分,其中2分为阴性,3分及以上为阳性。采用同类相关系数(intraclass correlation coefficient,ICC)及Bland-Altman指数来评价观察者间直径与体积测量的一致性,Kappa分析评价观察者间Lung-RADS评分及阳性/阴性组间的一致性。结果:ICC分析结果显示手动直径测量的一致性(0.994~0.996)明显低于半自动体积测量的一致性(0.997~0.998),同时Bland-Altman指数分析结果显示手动直径测量的偏倚高于半自动体积测量。采用半自动体积测量,能够比手动直径测量明显提高观察者间Lung-RADS评分及阳性/阴性之间的一致性(0.963~0.975及0.957~0.977 vs.0.833~0.866及0.863~0.892)。结论:3D-Slicer半自动体积测量能够提高实性肺结节观察者间大小测量的一致性,相应的Lung-RADS分类一致性也随之提高。
基金supported by the National Key Research and Development Program of China (2017YFA0205200,2023YFC2415200,2021YFF1201003,and 2021YFC2500402)the National Natural Science Foundation of China (82022036,91959130,81971776,62027901,81930053,81771924,62333022,82361168664,62176013,and 82302317)+5 种基金the Beijing Natural Science Foundation (Z20J00105)Strategic Priority Research Program of Chinese Academy of Sciences (XDB38040200)Chinese Academy of Sciences (GJJSTD20170004 and QYZDJ-SSW-JSC005)the Project of High-Level Talents Team Introduction in Zhuhai City (Zhuhai HLHPTP201703)the Youth Innovation Promotion Association CAS (Y2021049)the China Postdoctoral Science Foundation (2021M700341).
文摘Encouraging and astonishing developments have recently been achieved in image-based diagnostic technology.Modern medical care and imaging technology are becoming increasingly inseparable.However,the current diagnosis pattern of signal to image to knowledge inevitably leads to information distortion and noise introduction in the procedure of image reconstruction(from signal to image).Artificial intelligence(AI)technologies that can mine knowledge from vast amounts of data offer opportunities to disrupt established workflows.In this prospective study,for the first time,we develop an AI-based signal-toknowledge diagnostic scheme for lung nodule classification directly from the computed tomography(CT)raw data(the signal).We find that the raw data achieves almost comparable performance with CT,indicating that it is possible to diagnose diseases without reconstructing images.Moreover,the incorporation of raw data through three common convolutional network structures greatly improves the performance of the CT models in all cohorts(with a gain ranging from 0.01 to 0.12),demonstrating that raw data contains diagnostic information that CT does not possess.Our results break new ground and demonstrate the potential for direct signal-to-knowledge domain analysis.