An effective nonrigid image registrationmethod is developed based on the optical flow field(OFF)framework for the complex registration of structure images.In our method,a new force is modeled and integrated into the o...An effective nonrigid image registrationmethod is developed based on the optical flow field(OFF)framework for the complex registration of structure images.In our method,a new force is modeled and integrated into the original optical flow equation to jointly drive the motion direction of pixels.At any point in the offset field,in addition to the force generated by the OFF model derived from local gradient information to drive the pixels in the floating image to infiltrate into the reference pixel set,a new“guiding force”derived from the global grayscale overall trend in a given neighborhood system helps the pixels to more properly spread into the corresponding reference pixel set,particularly when the gradient field of the reference image is unstable.In the experiment,a data set containing several images with complex structures was employed to validate the performance of our registration model.The test results show that our method can quickly and efficiently register complex images and is robust to noise in images.展开更多
Objective: Most of the western music consists of a melody and an accompaniment. The melody is referred to as the foreground, with the accompaniment the background. In visual processing, the lateral occipital complex (...Objective: Most of the western music consists of a melody and an accompaniment. The melody is referred to as the foreground, with the accompaniment the background. In visual processing, the lateral occipital complex (LOC) is known to participate in foreground and background segregation. We investigated the role of LOC in music processing with use of positron emission tomography (PET). Method: Musically na?ve subjects listened to unfamiliar novel melodies with (accompaniment condition) and without the accompaniment (melodic condition). Using a PET subtraction technique, we studied changes in regional cerebral blood flow (rCBF) during the accompaniment condition compared to the melodic condition. Results: The accompanyment condition was associated with bilateral increase of rCBF at the lateral and medial surfaces of both occipital lobes, medial parts of fusiform gyri, cingulate gyri, precentral gyri, insular cortices, and cerebellum. During the melodic condition, the activation at the anterior and posterior portions of the temporal lobes, medial surface of the frontal lobes, inferior frontal gyri, orbitofrontal cortices, inferior parietal lobules, and cerebellum was observed. Conclusions: The LOC participates in recognition of melody with accompaniment, a phenomenon that can be regarded as foreground and background segregation in auditory processing. The fusiform cortex which was known to participate in the color recognition might be activated by the recognition of flourish sounds by the accompaniment, compared to melodic condition. It is supposed that the LOC and fusiform cortex play similar functions beyond the difference of sensory modalities.展开更多
线结构光三维扫描建模系统中最关键的一步是提取光条中心线,但环境中各种因素的干扰给中心线提取带来困难。针对线结构光条纹图像存在光斑干扰、光强分布不均、光条宽度差别大、背景复杂等多种问题,提出解决方案。首先采用Otsu对结构光...线结构光三维扫描建模系统中最关键的一步是提取光条中心线,但环境中各种因素的干扰给中心线提取带来困难。针对线结构光条纹图像存在光斑干扰、光强分布不均、光条宽度差别大、背景复杂等多种问题,提出解决方案。首先采用Otsu对结构光图像二值化;其次采用改进DBSCAN(density-based spatial clustering of applications with noise)算法保留核心点,去除边界点和噪声点;最后将核心点作为输入,构建图数据结构,采用适用于线结构光条纹图像的最短路径搜索算法得到光条中心线。实验结果表明,该算法运行时间在150 ms以内,误差在0.2像素以内,并适用于多种复杂环境,满足实时性、准确性和稳定性的要求。展开更多
Often we encounter documents with text printed on complex color background. Readability of textual contents in such documents is very poor due to complexity of the background and mix up of color(s) of foreground text ...Often we encounter documents with text printed on complex color background. Readability of textual contents in such documents is very poor due to complexity of the background and mix up of color(s) of foreground text with colors of background. Automatic segmentation of foreground text in such document images is very much essential for smooth reading of the document contents either by human or by machine. In this paper we propose a novel approach to extract the foreground text in color document images having complex background. The proposed approach is a hybrid approach which combines connected component and texture feature analysis of potential text regions. The proposed approach utilizes Canny edge detector to detect all possible text edge pixels. Connected component analysis is performed on these edge pixels to identify candidate text regions. Because of background complexity it is also possible that a non-text region may be identified as a text region. This problem is overcome by analyzing the texture features of potential text region corresponding to each connected component. An unsupervised local thresholding is devised to perform foreground segmentation in detected text regions. Finally the text regions which are noisy are identified and reprocessed to further enhance the quality of retrieved foreground. The proposed approach can handle document images with varying background of multiple colors and texture;and foreground text in any color, font, size and orientation. Experimental results show that the proposed algorithm detects on an average 97.12% of text regions in the source document. Readability of the extracted foreground text is illustrated through Optical character recognition (OCR) in case the text is in English. The proposed approach is compared with some existing methods of foreground separation in document images. Experimental results show that our approach performs better.展开更多
SAR图像舰船目标检测时,因近海岸港口存在着复杂背景的问题,以至于重叠舰船目标无法被准确提取特征信息,造成近海岸的舰船目标出现漏检、误检的情况.针对以上问题,提出一种复杂场景下的SAR图像舰船检测算法,该算法基于YOLOv5进行改进,采...SAR图像舰船目标检测时,因近海岸港口存在着复杂背景的问题,以至于重叠舰船目标无法被准确提取特征信息,造成近海岸的舰船目标出现漏检、误检的情况.针对以上问题,提出一种复杂场景下的SAR图像舰船检测算法,该算法基于YOLOv5进行改进,采用SPPF结构加强提取特征信息,并融合原YOLOv5的SPP结构提取的特征信息,这种多级金字塔模块并列融合的方式能有效的检测多尺度舰船目标,使特征信息更好的表达;然后将原模型中的GIOU改进为CIOU,使其可以准确的回归出预测框的位置;最终为了更合理的筛选高于阈值的预测框,改进NMS(Non-Maximum-Suppression),采用Soft-NMS方法去惩罚衰减高于阈值的边框得分,合理的去除预测框.试验结果表明,该文改进的模型相比于原模型在SSDD、SAR-Ship-Dataset数据集上的mAP(mean Average Precision)提高了5.15%和5.06%,改进模型能有效检测近海岸中复杂背景下的SAR图像舰船目标.展开更多
基金supported in part by the National Key Research and Development Program of China under Grant no.2020YFB1806403.
文摘An effective nonrigid image registrationmethod is developed based on the optical flow field(OFF)framework for the complex registration of structure images.In our method,a new force is modeled and integrated into the original optical flow equation to jointly drive the motion direction of pixels.At any point in the offset field,in addition to the force generated by the OFF model derived from local gradient information to drive the pixels in the floating image to infiltrate into the reference pixel set,a new“guiding force”derived from the global grayscale overall trend in a given neighborhood system helps the pixels to more properly spread into the corresponding reference pixel set,particularly when the gradient field of the reference image is unstable.In the experiment,a data set containing several images with complex structures was employed to validate the performance of our registration model.The test results show that our method can quickly and efficiently register complex images and is robust to noise in images.
文摘Objective: Most of the western music consists of a melody and an accompaniment. The melody is referred to as the foreground, with the accompaniment the background. In visual processing, the lateral occipital complex (LOC) is known to participate in foreground and background segregation. We investigated the role of LOC in music processing with use of positron emission tomography (PET). Method: Musically na?ve subjects listened to unfamiliar novel melodies with (accompaniment condition) and without the accompaniment (melodic condition). Using a PET subtraction technique, we studied changes in regional cerebral blood flow (rCBF) during the accompaniment condition compared to the melodic condition. Results: The accompanyment condition was associated with bilateral increase of rCBF at the lateral and medial surfaces of both occipital lobes, medial parts of fusiform gyri, cingulate gyri, precentral gyri, insular cortices, and cerebellum. During the melodic condition, the activation at the anterior and posterior portions of the temporal lobes, medial surface of the frontal lobes, inferior frontal gyri, orbitofrontal cortices, inferior parietal lobules, and cerebellum was observed. Conclusions: The LOC participates in recognition of melody with accompaniment, a phenomenon that can be regarded as foreground and background segregation in auditory processing. The fusiform cortex which was known to participate in the color recognition might be activated by the recognition of flourish sounds by the accompaniment, compared to melodic condition. It is supposed that the LOC and fusiform cortex play similar functions beyond the difference of sensory modalities.
文摘线结构光三维扫描建模系统中最关键的一步是提取光条中心线,但环境中各种因素的干扰给中心线提取带来困难。针对线结构光条纹图像存在光斑干扰、光强分布不均、光条宽度差别大、背景复杂等多种问题,提出解决方案。首先采用Otsu对结构光图像二值化;其次采用改进DBSCAN(density-based spatial clustering of applications with noise)算法保留核心点,去除边界点和噪声点;最后将核心点作为输入,构建图数据结构,采用适用于线结构光条纹图像的最短路径搜索算法得到光条中心线。实验结果表明,该算法运行时间在150 ms以内,误差在0.2像素以内,并适用于多种复杂环境,满足实时性、准确性和稳定性的要求。
文摘Often we encounter documents with text printed on complex color background. Readability of textual contents in such documents is very poor due to complexity of the background and mix up of color(s) of foreground text with colors of background. Automatic segmentation of foreground text in such document images is very much essential for smooth reading of the document contents either by human or by machine. In this paper we propose a novel approach to extract the foreground text in color document images having complex background. The proposed approach is a hybrid approach which combines connected component and texture feature analysis of potential text regions. The proposed approach utilizes Canny edge detector to detect all possible text edge pixels. Connected component analysis is performed on these edge pixels to identify candidate text regions. Because of background complexity it is also possible that a non-text region may be identified as a text region. This problem is overcome by analyzing the texture features of potential text region corresponding to each connected component. An unsupervised local thresholding is devised to perform foreground segmentation in detected text regions. Finally the text regions which are noisy are identified and reprocessed to further enhance the quality of retrieved foreground. The proposed approach can handle document images with varying background of multiple colors and texture;and foreground text in any color, font, size and orientation. Experimental results show that the proposed algorithm detects on an average 97.12% of text regions in the source document. Readability of the extracted foreground text is illustrated through Optical character recognition (OCR) in case the text is in English. The proposed approach is compared with some existing methods of foreground separation in document images. Experimental results show that our approach performs better.
文摘SAR图像舰船目标检测时,因近海岸港口存在着复杂背景的问题,以至于重叠舰船目标无法被准确提取特征信息,造成近海岸的舰船目标出现漏检、误检的情况.针对以上问题,提出一种复杂场景下的SAR图像舰船检测算法,该算法基于YOLOv5进行改进,采用SPPF结构加强提取特征信息,并融合原YOLOv5的SPP结构提取的特征信息,这种多级金字塔模块并列融合的方式能有效的检测多尺度舰船目标,使特征信息更好的表达;然后将原模型中的GIOU改进为CIOU,使其可以准确的回归出预测框的位置;最终为了更合理的筛选高于阈值的预测框,改进NMS(Non-Maximum-Suppression),采用Soft-NMS方法去惩罚衰减高于阈值的边框得分,合理的去除预测框.试验结果表明,该文改进的模型相比于原模型在SSDD、SAR-Ship-Dataset数据集上的mAP(mean Average Precision)提高了5.15%和5.06%,改进模型能有效检测近海岸中复杂背景下的SAR图像舰船目标.