Objective We developed a universal lesion detector(ULDor)which showed good performance in in-lab experiments.The study aims to evaluate the performance and its ability to generalize in clinical setting via both extern...Objective We developed a universal lesion detector(ULDor)which showed good performance in in-lab experiments.The study aims to evaluate the performance and its ability to generalize in clinical setting via both external and internal validation.Methods The ULDor system consists of a convolutional neural network(CNN)trained on around 80 K lesion annotations from about 12 K CT studies in the DeepLesion dataset and 5 other public organ-specific datasets.During the validation process,the test sets include two parts:the external validation dataset which was comprised of 164 sets of non-contrasted chest and upper abdomen CT scans from a comprehensive hospital,and the internal validation dataset which was comprised of 187 sets of low-dose helical CT scans from the National Lung Screening Trial(NLST).We ran the model on the two test sets to output lesion detection.Three board-certified radiologists read the CT scans and verified the detection results of ULDor.We used positive predictive value(PPV)and sensitivity to evaluate the performance of the model in detecting space-occupying lesions at all extra-pulmonary organs visualized on CT images,including liver,kidney,pancreas,adrenal,spleen,esophagus,thyroid,lymph nodes,body wall,thoracic spine,etc.Results In the external validation,the lesion-level PPV and sensitivity of the model were 57.9%and 67.0%,respectively.On average,the model detected 2.1 findings per set,and among them,0.9 were false positives.ULDor worked well for detecting liver lesions,with a PPV of 78.9%and a sensitivity of 92.7%,followed by kidney,with a PPV of 70.0%and a sensitivity of 58.3%.In internal validation with NLST test set,ULDor obtained a PPV of 75.3%and a sensitivity of 52.0%despite the relatively high noise level of soft tissue on images.Conclusions The performance tests of ULDor with the external real-world data have shown its high effectiveness in multiple-purposed detection for lesions in certain organs.With further optimisation and iterative upgrades,ULDor may be well suited for extensive application to external data.展开更多
BACKGROUND Hepatic steatosis is a major cause of chronic liver disease.Two-dimensional(2D)ultrasound is the most widely used non-invasive tool for screening and monitoring,but associated diagnoses are highly subjectiv...BACKGROUND Hepatic steatosis is a major cause of chronic liver disease.Two-dimensional(2D)ultrasound is the most widely used non-invasive tool for screening and monitoring,but associated diagnoses are highly subjective.AIM To develop a scalable deep learning(DL)algorithm for quantitative scoring of liver steatosis from 2D ultrasound images.METHODS Using multi-view ultrasound data from 3310 patients,19513 studies,and 228075 images from a retrospective cohort of patients received elastography,we trained a DL algorithm to diagnose steatosis stages(healthy,mild,moderate,or severe)from clinical ultrasound diagnoses.Performance was validated on two multiscanner unblinded and blinded(initially to DL developer)histology-proven cohorts(147 and 112 patients)with histopathology fatty cell percentage diagnoses and a subset with FibroScan diagnoses.We also quantified reliability across scanners and viewpoints.Results were evaluated using Bland-Altman and receiver operating characteristic(ROC)analysis.RESULTS The DL algorithm demonstrated repeatable measurements with a moderate number of images(three for each viewpoint)and high agreement across three premium ultrasound scanners.High diagnostic performance was observed across all viewpoints:Areas under the curve of the ROC to classify mild,moderate,and severe steatosis grades were 0.85,0.91,and 0.93,respectively.The DL algorithm outperformed or performed at least comparably to FibroScan control attenuation parameter(CAP)with statistically significant improvements for all levels on the unblinded histology-proven cohort and for“=severe”steatosis on the blinded histology-proven cohort.CONCLUSION The DL algorithm provides a reliable quantitative steatosis assessment across view and scanners on two multi-scanner cohorts.Diagnostic performance was high with comparable or better performance than the CAP.展开更多
文摘Objective We developed a universal lesion detector(ULDor)which showed good performance in in-lab experiments.The study aims to evaluate the performance and its ability to generalize in clinical setting via both external and internal validation.Methods The ULDor system consists of a convolutional neural network(CNN)trained on around 80 K lesion annotations from about 12 K CT studies in the DeepLesion dataset and 5 other public organ-specific datasets.During the validation process,the test sets include two parts:the external validation dataset which was comprised of 164 sets of non-contrasted chest and upper abdomen CT scans from a comprehensive hospital,and the internal validation dataset which was comprised of 187 sets of low-dose helical CT scans from the National Lung Screening Trial(NLST).We ran the model on the two test sets to output lesion detection.Three board-certified radiologists read the CT scans and verified the detection results of ULDor.We used positive predictive value(PPV)and sensitivity to evaluate the performance of the model in detecting space-occupying lesions at all extra-pulmonary organs visualized on CT images,including liver,kidney,pancreas,adrenal,spleen,esophagus,thyroid,lymph nodes,body wall,thoracic spine,etc.Results In the external validation,the lesion-level PPV and sensitivity of the model were 57.9%and 67.0%,respectively.On average,the model detected 2.1 findings per set,and among them,0.9 were false positives.ULDor worked well for detecting liver lesions,with a PPV of 78.9%and a sensitivity of 92.7%,followed by kidney,with a PPV of 70.0%and a sensitivity of 58.3%.In internal validation with NLST test set,ULDor obtained a PPV of 75.3%and a sensitivity of 52.0%despite the relatively high noise level of soft tissue on images.Conclusions The performance tests of ULDor with the external real-world data have shown its high effectiveness in multiple-purposed detection for lesions in certain organs.With further optimisation and iterative upgrades,ULDor may be well suited for extensive application to external data.
基金Supported by the Maintenance Project of the Center for Artificial Intelligence,No.CLRPG3H0012 and No.SMRPG3I0011.
文摘BACKGROUND Hepatic steatosis is a major cause of chronic liver disease.Two-dimensional(2D)ultrasound is the most widely used non-invasive tool for screening and monitoring,but associated diagnoses are highly subjective.AIM To develop a scalable deep learning(DL)algorithm for quantitative scoring of liver steatosis from 2D ultrasound images.METHODS Using multi-view ultrasound data from 3310 patients,19513 studies,and 228075 images from a retrospective cohort of patients received elastography,we trained a DL algorithm to diagnose steatosis stages(healthy,mild,moderate,or severe)from clinical ultrasound diagnoses.Performance was validated on two multiscanner unblinded and blinded(initially to DL developer)histology-proven cohorts(147 and 112 patients)with histopathology fatty cell percentage diagnoses and a subset with FibroScan diagnoses.We also quantified reliability across scanners and viewpoints.Results were evaluated using Bland-Altman and receiver operating characteristic(ROC)analysis.RESULTS The DL algorithm demonstrated repeatable measurements with a moderate number of images(three for each viewpoint)and high agreement across three premium ultrasound scanners.High diagnostic performance was observed across all viewpoints:Areas under the curve of the ROC to classify mild,moderate,and severe steatosis grades were 0.85,0.91,and 0.93,respectively.The DL algorithm outperformed or performed at least comparably to FibroScan control attenuation parameter(CAP)with statistically significant improvements for all levels on the unblinded histology-proven cohort and for“=severe”steatosis on the blinded histology-proven cohort.CONCLUSION The DL algorithm provides a reliable quantitative steatosis assessment across view and scanners on two multi-scanner cohorts.Diagnostic performance was high with comparable or better performance than the CAP.