The concept and advantage of reconfigurable technology is introduced. A kind of processor architecture of re configurable macro processor (RMP) model based on FPGA array and DSP is put forward and has been implemented...The concept and advantage of reconfigurable technology is introduced. A kind of processor architecture of re configurable macro processor (RMP) model based on FPGA array and DSP is put forward and has been implemented. Two image algorithms are developed: template-based automatic target recognition and zone labeling. One is estimating for motion direction in the infrared image background, another is line picking-up algorithm based on image zone labeling and phase grouping technique. It is a kind of 'hardware' function that can be called by the DSP in high-level algorithm. It is also a kind of hardware algorithm of the DSP. The results of experiments show the reconfigurable computing technology based on RMP is an ideal accelerating means to deal with the high-speed image processing tasks. High real time performance is obtained in our two applications on RMP.展开更多
The lack of labeled image data poses a serious challenge to the application of artificial intelligence(AI)in medical image diagnosis.Medical image notes contain valuable patient information that could be used to label...The lack of labeled image data poses a serious challenge to the application of artificial intelligence(AI)in medical image diagnosis.Medical image notes contain valuable patient information that could be used to label images for machine learning tasks.However,most image note texts are unstructured with heterogeneity and short-paragraph characters,which fail traditional keyword-based techniques.We utilized a deep learning approach to recover missing labels for medical image notes automatically by using a combination of deep word embedding and deep neural network classifiers.Bidirectional encoder representations from transformers trained on medical image notes corpus(MinBERT)were proposed.We applied the proposed techniques to two typical classification tasks:Medical image type identification and clinical diagnosis identification.The two methods significantly outperformed baseline methods and presented high accuracies of 99.56%and 99.72%in image type identification and of 94.56%and 92.45%in clinical diagnosis identification.Visualization analysis further indicated that word embedding could efficiently capture semantic similarities and regularities across diverse expressions.Results indicated that our proposed framework could accurately recover the missing label information of medical images through the automatic extraction of electronic medical record information.Hence,it could serve as a powerful tool for exploring useful training data in various medical AI applications.展开更多
Rapid histology of brain tissues with sufficient diagnostic information has the great potential to aid neurosurgons during operations.Stimulated Raman Scattering(SRS)microscopy is an emerging label-free imaging techni...Rapid histology of brain tissues with sufficient diagnostic information has the great potential to aid neurosurgons during operations.Stimulated Raman Scattering(SRS)microscopy is an emerging label-free imaging technique,with the intrinsic chemical resolutions to delineate brain tumors from normal tissues without the nood of time-consuming tissue processing.Growing number of studies have shown SRS as a“virtual histology"tool for rapid diagnosis of various types of brain tumors.In this review,we focus on the basic principles and current developments of SRS microscopy,as well as its applications for brain tumor imaging.展开更多
Crowdsourcing is an effective method to obtain large databases of manually-labeled images, which is especially important for image understanding with supervised machine learning algorithms. However, for several kinds ...Crowdsourcing is an effective method to obtain large databases of manually-labeled images, which is especially important for image understanding with supervised machine learning algorithms. However, for several kinds of tasks regarding image labeling, e.g., dog breed recognition, it is hard to achieve high-quality results. Therefore, further optimizing crowdsourcing workflow mainly involves task allocation and result inference. For task allocation, we design a two-round crowdsourcing framework, which contains a smart decision mechanism based on information entropy to determine whether to perform the second round task allocation. Regarding result inference, after quantifying the similarity of all labels, two graphical models are proposed to describe the labeling process and corresponding inference algorithms are designed to further improve the result quality of image labeling. Extensive experiments on real-world tasks in Crowdflower and synthesis datasets were conducted. The experimental results demonstrate the superiority of these methods in comparison with state-of-the-art methods.展开更多
A literature analysis has shown that object search,recognition,and tracking systems are becoming increasingly popular.However,such systems do not achieve high practical results in analyzing small moving living objects...A literature analysis has shown that object search,recognition,and tracking systems are becoming increasingly popular.However,such systems do not achieve high practical results in analyzing small moving living objects ranging from 8 to 14 mm.This article examines methods and tools for recognizing and tracking the class of small moving objects,such as ants.To fulfill those aims,a customized You Only Look Once Ants Recognition(YOLO_AR)Convolutional Neural Network(CNN)has been trained to recognize Messor Structor ants in the laboratory using the LabelImg object marker tool.The proposed model is an extension of the You Only Look Once v4(Yolov4)512×512 model with an additional Self Regularized Non–Monotonic(Mish)activation function.Additionally,the scalable solution for continuous object recognizing and tracking was implemented.This solution is based on the OpenDatacam system,with extended Object Tracking modules that allow for tracking and counting objects that have crossed the custom boundary line.During the study,the methods of the alignment algorithm for finding the trajectory of moving objects were modified.I discovered that the Hungarian algorithm showed better results in tracking small objects than the K–D dimensional tree(k-d tree)matching algorithm used in OpenDataCam.Remarkably,such an algorithm showed better results with the implemented YOLO_AR model due to the lack of False Positives(FP).Therefore,I provided a new tracker module with a Hungarian matching algorithm verified on the Multiple Object Tracking(MOT)benchmark.Furthermore,additional customization parameters for object recognition and tracking results parsing and filtering were added,like boundary angle threshold(BAT)and past frames trajectory prediction(PFTP).Experimental tests confirmed the results of the study on a mobile device.During the experiment,parameters such as the quality of recognition and tracking of moving objects,the PFTP and BAT,and the configuration parameters of the neural network and boundary line model were analyzed.The results showed an increased tracking accuracy with the proposed methods by 50%.The study results confirmed the relevance of the topic and the effectiveness of the implemented methods and tools.展开更多
文摘The concept and advantage of reconfigurable technology is introduced. A kind of processor architecture of re configurable macro processor (RMP) model based on FPGA array and DSP is put forward and has been implemented. Two image algorithms are developed: template-based automatic target recognition and zone labeling. One is estimating for motion direction in the infrared image background, another is line picking-up algorithm based on image zone labeling and phase grouping technique. It is a kind of 'hardware' function that can be called by the DSP in high-level algorithm. It is also a kind of hardware algorithm of the DSP. The results of experiments show the reconfigurable computing technology based on RMP is an ideal accelerating means to deal with the high-speed image processing tasks. High real time performance is obtained in our two applications on RMP.
基金This work was supported in part by the Shenzhen Science and Technology Program(No.JCYJ20180703145002040)the Strategic Priority Research Program of Chinese Academy of Sciences(No.XDB38050100)the Shenzhen Science and Technology Program(No.JCYJ20180507182818013).
文摘The lack of labeled image data poses a serious challenge to the application of artificial intelligence(AI)in medical image diagnosis.Medical image notes contain valuable patient information that could be used to label images for machine learning tasks.However,most image note texts are unstructured with heterogeneity and short-paragraph characters,which fail traditional keyword-based techniques.We utilized a deep learning approach to recover missing labels for medical image notes automatically by using a combination of deep word embedding and deep neural network classifiers.Bidirectional encoder representations from transformers trained on medical image notes corpus(MinBERT)were proposed.We applied the proposed techniques to two typical classification tasks:Medical image type identification and clinical diagnosis identification.The two methods significantly outperformed baseline methods and presented high accuracies of 99.56%and 99.72%in image type identification and of 94.56%and 92.45%in clinical diagnosis identification.Visualization analysis further indicated that word embedding could efficiently capture semantic similarities and regularities across diverse expressions.Results indicated that our proposed framework could accurately recover the missing label information of medical images through the automatic extraction of electronic medical record information.Hence,it could serve as a powerful tool for exploring useful training data in various medical AI applications.
基金supports from the National Key Research and Development Program of China (2016YFC0102100,2016YFA0301000,2016YFA0203900)National Natural Science Foundation of China (81671725)+1 种基金Shanghai Rising Star Program (15QA1400500)Shanghai Action Plan for Scientific and Technological Innovation Program (16441909200).
文摘Rapid histology of brain tissues with sufficient diagnostic information has the great potential to aid neurosurgons during operations.Stimulated Raman Scattering(SRS)microscopy is an emerging label-free imaging technique,with the intrinsic chemical resolutions to delineate brain tumors from normal tissues without the nood of time-consuming tissue processing.Growing number of studies have shown SRS as a“virtual histology"tool for rapid diagnosis of various types of brain tumors.In this review,we focus on the basic principles and current developments of SRS microscopy,as well as its applications for brain tumor imaging.
文摘Crowdsourcing is an effective method to obtain large databases of manually-labeled images, which is especially important for image understanding with supervised machine learning algorithms. However, for several kinds of tasks regarding image labeling, e.g., dog breed recognition, it is hard to achieve high-quality results. Therefore, further optimizing crowdsourcing workflow mainly involves task allocation and result inference. For task allocation, we design a two-round crowdsourcing framework, which contains a smart decision mechanism based on information entropy to determine whether to perform the second round task allocation. Regarding result inference, after quantifying the similarity of all labels, two graphical models are proposed to describe the labeling process and corresponding inference algorithms are designed to further improve the result quality of image labeling. Extensive experiments on real-world tasks in Crowdflower and synthesis datasets were conducted. The experimental results demonstrate the superiority of these methods in comparison with state-of-the-art methods.
文摘A literature analysis has shown that object search,recognition,and tracking systems are becoming increasingly popular.However,such systems do not achieve high practical results in analyzing small moving living objects ranging from 8 to 14 mm.This article examines methods and tools for recognizing and tracking the class of small moving objects,such as ants.To fulfill those aims,a customized You Only Look Once Ants Recognition(YOLO_AR)Convolutional Neural Network(CNN)has been trained to recognize Messor Structor ants in the laboratory using the LabelImg object marker tool.The proposed model is an extension of the You Only Look Once v4(Yolov4)512×512 model with an additional Self Regularized Non–Monotonic(Mish)activation function.Additionally,the scalable solution for continuous object recognizing and tracking was implemented.This solution is based on the OpenDatacam system,with extended Object Tracking modules that allow for tracking and counting objects that have crossed the custom boundary line.During the study,the methods of the alignment algorithm for finding the trajectory of moving objects were modified.I discovered that the Hungarian algorithm showed better results in tracking small objects than the K–D dimensional tree(k-d tree)matching algorithm used in OpenDataCam.Remarkably,such an algorithm showed better results with the implemented YOLO_AR model due to the lack of False Positives(FP).Therefore,I provided a new tracker module with a Hungarian matching algorithm verified on the Multiple Object Tracking(MOT)benchmark.Furthermore,additional customization parameters for object recognition and tracking results parsing and filtering were added,like boundary angle threshold(BAT)and past frames trajectory prediction(PFTP).Experimental tests confirmed the results of the study on a mobile device.During the experiment,parameters such as the quality of recognition and tracking of moving objects,the PFTP and BAT,and the configuration parameters of the neural network and boundary line model were analyzed.The results showed an increased tracking accuracy with the proposed methods by 50%.The study results confirmed the relevance of the topic and the effectiveness of the implemented methods and tools.