Visual odometry is critical in visual simultaneous localization and mapping for robot navigation.However,the pose estimation performance of most current visual odometry algorithms degrades in scenes with unevenly dist...Visual odometry is critical in visual simultaneous localization and mapping for robot navigation.However,the pose estimation performance of most current visual odometry algorithms degrades in scenes with unevenly distributed features because dense features occupy excessive weight.Herein,a new human visual attention mechanism for point-and-line stereo visual odometry,which is called point-line-weight-mechanism visual odometry(PLWM-VO),is proposed to describe scene features in a global and balanced manner.A weight-adaptive model based on region partition and region growth is generated for the human visual attention mechanism,where sufficient attention is assigned to position-distinctive objects(sparse features in the environment).Furthermore,the sum of absolute differences algorithm is used to improve the accuracy of initialization for line features.Compared with the state-of-the-art method(ORB-VO),PLWM-VO show a 36.79%reduction in the absolute trajectory error on the Kitti and Euroc datasets.Although the time consumption of PLWM-VO is higher than that of ORB-VO,online test results indicate that PLWM-VO satisfies the real-time demand.The proposed algorithm not only significantly promotes the environmental adaptability of visual odometry,but also quantitatively demonstrates the superiority of the human visual attention mechanism.展开更多
To overcome the shortcomings of the Lee image enhancement algorithm and its improvement based on the logarithmic image processing(LIP) model, this paper proposes what we believe to be an effective image enhancement al...To overcome the shortcomings of the Lee image enhancement algorithm and its improvement based on the logarithmic image processing(LIP) model, this paper proposes what we believe to be an effective image enhancement algorithm. This algorithm introduces fuzzy entropy, makes full use of neighborhood information, fuzzy information and human visual characteristics.To enhance an image, this paper first carries out the reasonable fuzzy-3 partition of its histogram into the dark region, intermediate region and bright region. It then extracts the statistical characteristics of the three regions and adaptively selects the parameter αaccording to the statistical characteristics of the image’s gray-scale values. It also adds a useful nonlinear transform, thus increasing the ubiquity of the algorithm. Finally, the causes for the gray-scale value overcorrection that occurs in the traditional image enhancement algorithms are analyzed and their solutions are proposed.The simulation results show that our image enhancement algorithm can effectively suppress the noise of an image, enhance its contrast and visual effect, sharpen its edge and adjust its dynamic range.展开更多
A Robust Adaptive Video Encoder (RAVE) based on human visual model is proposed. The encoder combines the best features of Fine Granularity Scalable (FGS) coding, framedropping coding, video redundancy coding, and huma...A Robust Adaptive Video Encoder (RAVE) based on human visual model is proposed. The encoder combines the best features of Fine Granularity Scalable (FGS) coding, framedropping coding, video redundancy coding, and human visual model. According to packet loss and available bandwidth of the network, the encoder adjust the output bit rate by jointly adapting quantization step-size instructed by human visual model, rate shaping, and periodically inserting key frame. The proposed encoder is implemented based on MPEG-4 encoder and is compared with the case of a conventional FGS algorithm. It is shown that RAVE is a very efficient robust video encoder that provides improved visual quality for the receiver and consumes equal or less network resource. Results are confirmed by subjective tests and simulation tests.展开更多
The key to the wavelet based denoising teehniquea is how to manipulate the wavelet coefficients. By referring to the idea of Inclusive-OR in the design of circuits, this paper proposes a new algorithm called wavelet d...The key to the wavelet based denoising teehniquea is how to manipulate the wavelet coefficients. By referring to the idea of Inclusive-OR in the design of circuits, this paper proposes a new algorithm called wavelet domain Inclusive-OR denoising algorithm(WDIDA), which distinguishes the wavelet coefficients belonging to image or noise by considering their phases and modulus maxima simultaneously. Using this new algorithm, the denoising effects are improved and the computation time is reduced. Furthermore, in order to enhance the edges of the image but not magnify noise, a contrast nonlinear enhancing algorithm is presented according to human visual properties. Compared with traditional enhancing algorithms, the algorithm that we proposed has a better noise reducing performanee , preserving edges and improving the visual quality of images.展开更多
AIM: To investigate the visual pathway in normal subjects and patients with lesion involved by diffusion tensor imaging (DTI) and diffusion tensor tractography (DTT). METHODS: Thirty normal volunteers, 3 subjects with...AIM: To investigate the visual pathway in normal subjects and patients with lesion involved by diffusion tensor imaging (DTI) and diffusion tensor tractography (DTT). METHODS: Thirty normal volunteers, 3 subjects with orbital tumors involved the optic nerve (ON) and 33 subjects with occipital lobe tumors involved the optic radiation (OR) (10 gliomas, 6 meningiomas and 17 cerebral metastases) undertook routine cranium magnetic resonance imaging (MRI), DTI and DTT. Visual pathway fibers were analyzed by DTI and DTT images. Test fractional anisotropy (FA) and mean diffusivity (MD) values in different part of the visual pathway. RESULTS: The whole visual pathway but optic chiasm manifested as hyperintensity in FA maps and homogenous green signal in the direction encoded color maps. The optic chiasm did not display clearly. There was no significant difference between the bilateral FA values and MD values of normal visual pathway but optic chiasm, which the FA values tested were much too low (all P>0.05). The ONs of subjects with orbital tumors were compressed and displaced. Only one subject had lower FA values and higher MD values. OR of 9 gliomas subjects were infiltrated, with displacement in 2 and disruption in 7 subjects. All OR in 6 meniongiomas subjects were displaced. OR in 17 cerebral metastases subjects all developed displacement while 7 of them had disruption also. CONCLUSION: MR-DTI is highly sensitive in manifesting visual pathway. Visual pathway can be analyzed quantitatively in FA and MD values. DTT supplies accurate three dimensional conformations of visual pathway. But optic chiasm's manifestation still needs to improve.展开更多
The Drovers’ Paths are remnants of important land access roads from Rio Grande do Sul to São Paulo at the time of Colonial Brazil. They were built and used between the 18th and 20th centuries, particularly i...The Drovers’ Paths are remnants of important land access roads from Rio Grande do Sul to São Paulo at the time of Colonial Brazil. They were built and used between the 18th and 20th centuries, particularly in the region of Coxilha Rica. The main objective of this research is to develop a method for decision-making applied to the territorial landscape management in the Coxilha Rica. The method consisted of generating criteria to map the visibility spot reached from the main selected points;define the human visual acuity, realize bibliographic research, use cartographic and historical documents, inter-views, as well as field surveys that enabled the identification, characterization and mapping of historical farms and drovers’ paths. After data processing, the information was entered into the cartographic database;the data were cross-checked and analysis was made of the visibility of the surrounding farms and stone-walled corridors. Quality assessments showed that, with the visibility polygons, and through the use of cartographic tools, we could cross-check between different levels of information and analyze landscape intervention alternatives in order to minimize environmental impacts. When applying the method in the Coxilha Rica it was possible mapping the visibility polygon, taking human visual acuity into consideration, based on historical farms and stone-walled corridors;and making spatial analyses to explore alternatives to intervention (installation of power transmission systems) in order to preserve the scenic environment of the region. In the end, the decision was by does not construct the system.展开更多
The concept of receptive field(RF) is central to sensory neuroscience. Neuronal RF properties have been substantially studied in animals,while those in humans remain nearly unexplored. Here, we measured neuronal RFs w...The concept of receptive field(RF) is central to sensory neuroscience. Neuronal RF properties have been substantially studied in animals,while those in humans remain nearly unexplored. Here, we measured neuronal RFs with intracranial local field potentials(LFPs) and spiking activity in human visual cortex(V1/V2/V3). We recorded LFPs via macro-contacts and discovered that RF sizes estimated from lowfrequency activity(LFA, 0.5–30 Hz) were larger than those estimated from low-gamma activity(LGA, 30–60 Hz) and high-gamma activity(HGA, 60–150 Hz). We then took a rare opportunity to record LFPs and spiking activity via microwires in V1 simultaneously. We found that RF sizes and temporal profiles measured from LGA and HGA closely matched those from spiking activity. In sum, this study reveals that spiking activity of neurons in human visual cortex could be well approximated by LGA and HGA in RF estimation and temporal profile measurement, implying the pivotal functions of LGA and HGA in early visual information processing.展开更多
This study explores the complex relationship between climate change and human development. The aim is to understand how climate change affects human development across countries, regions, and the global population. Vi...This study explores the complex relationship between climate change and human development. The aim is to understand how climate change affects human development across countries, regions, and the global population. Visual analytics were used to examine the impact of various climate change indicators on different aspects of human development. The study highlights the urgent need for climate change action and encourages policymakers to make decisive moves. Climate change adversely affects numerous aspects of daily life, leading to significant consequences that must be addressed through policy changes and global governance recommendations. Key findings include that regions with higher CO2 emissions experience a significantly higher incidence of life-threatening diseases compared to regions with lower emissions. Additionally, higher CO2 emissions correlate with consistent death rates. Increased pollution exposure is associated with a higher prevalence of life-threatening diseases and higher rates of malnutrition. Moreover, greater mineral depletion is linked to more frequent life-threatening diseases, suggesting that industrialization contributes to adverse health effects. These results provide valuable insights for policy and decision-making aimed at mitigating the impact of climate change on human development.展开更多
In times of digitalisation, visual assistance systems in assembly are increasingly important. The design of these assembly systems needs to be highly complex to meet the requirements. Due to the increasing number of v...In times of digitalisation, visual assistance systems in assembly are increasingly important. The design of these assembly systems needs to be highly complex to meet the requirements. Due to the increasing number of variants in production processes, as well as shorter innovation and product life cycles, assistance systems should improve quality and reduce complexity of assembly processes. However, many large kitchen manufacturers still assemble kitchen cabinets manually, due to the high variety of components, such as rails and fittings. This paper focuses on the analysis and evaluation of virtual assistance systems to improve quality and usability in individualised kitchen cabinet assembly processes at a large German manufacturer. A solution is identified and detailed.展开更多
In order to achieve higher image compression ratio and improve visual perception of the decompressed image, a novel color image compression scheme based on the contrast sensitivity characteristics of the human visual ...In order to achieve higher image compression ratio and improve visual perception of the decompressed image, a novel color image compression scheme based on the contrast sensitivity characteristics of the human visual system (HVS) is proposed. In the proposed scheme, firstly the image is converted into the YCrCb color space and divided into sub-blocks. Afterwards, the discrete cosine transform is carried out for each sub-block, and three quantization matrices are built to quantize the frequency spectrum coefficients of the images by combining the contrast sensitivity characteristics of HVS. The Huffman algorithm is used to encode the quantized data. The inverse process involves decompression and matching to reconstruct the decompressed color image. And simulations are carried out for two color images. The results show that the average structural similarity index measurement (SSIM) and peak signal to noise ratio (PSNR) under the approximate compression ratio could be increased by 2.78% and 5.48%, respectively, compared with the joint photographic experts group (JPEG) compression. The results indicate that the proposed compression algorithm in the text is feasible and effective to achieve higher compression ratio under ensuring the encoding and image quality, which can fully meet the needs of storage and transmission of color images in daily life.展开更多
Objective image quality assessment(IQA)plays an important role in various visual communication systems,which can automatically and efficiently predict the perceived quality of images.The human eye is the ultimate eval...Objective image quality assessment(IQA)plays an important role in various visual communication systems,which can automatically and efficiently predict the perceived quality of images.The human eye is the ultimate evaluator for visual experience,thus the modeling of human visual system(HVS)is a core issue for objective IQA and visual experience optimization.The traditional model based on black box fitting has low interpretability and it is difficult to guide the experience optimization effectively,while the model based on physiological simulation is hard to integrate into practical visual communication services due to its high computational complexity.For bridging the gap between signal distortion and visual experience,in this paper,we propose a novel perceptual no-reference(NR)IQA algorithm based on structural computational modeling of HVS.According to the mechanism of the human brain,we divide the visual signal processing into a low-level visual layer,a middle-level visual layer and a high-level visual layer,which conduct pixel information processing,primitive information processing and global image information processing,respectively.The natural scene statistics(NSS)based features,deep features and free-energy based features are extracted from these three layers.The support vector regression(SVR)is employed to aggregate features to the final quality prediction.Extensive experimental comparisons on three widely used benchmark IQA databases(LIVE,CSIQ and TID2013)demonstrate that our proposed metric is highly competitive with or outperforms the state-of-the-art NR IQA measures.展开更多
Vision-simulated imagery―the process of generating images that mimic the human visual system―is a valuable tool with a wide spectrum of possible applications, including visual acuity measurements, personalized plann...Vision-simulated imagery―the process of generating images that mimic the human visual system―is a valuable tool with a wide spectrum of possible applications, including visual acuity measurements, personalized planning of corrective lenses and surgeries, vision-correcting displays, vision-related hardware development, and extended reality discomfort reduction. A critical property of human vision is that it is imperfect because of the highly influential wavefront aberrations that vary from person to person. This study provides an overview of the existing computational image generation techniques that properly simulate human vision in the presence of wavefront aberrations. These algorithms typically apply ray tracing with a detailed description of the simulated eye or utilize the point-spread func-tion of the eye to perform convolution on the input image. Based on the description of the vision simulation tech-niques, several of their characteristic features have been evaluated and some potential application areas and research directions have been outlined.展开更多
AIM: To explore whether ectopic expression of human melanopsin can effectively and safely restore visual function in rd1 mice.· METHODS: Hematoxylin-eosin staining of retinal sections from rd1 mice was used to ...AIM: To explore whether ectopic expression of human melanopsin can effectively and safely restore visual function in rd1 mice.· METHODS: Hematoxylin-eosin staining of retinal sections from rd1 mice was used to detect the thickness of the outer nuclear layer to determine the timing of surgery. We constructed a human melanopsinAAV2/8 viral vector and injected it into the subretinal space of rd1 mice. The Phoenix Micron IV system was used to exclude the aborted injections, and immunohistochemistry was used to validate the ectopic expression of human melanopsin. Furthermore, visual electrophysiology and behavioral tests were used to detect visual function 30 and 45 d after the injection. The structure of the retina was compared between the human melanopsin-injected group and phosphate buffer saline(PBS)-injected group.·RESULTS: Retinas of rd1 mice lost almost all of their photoreceptors on postnatal day 28(P28). We therefore injected the human melanopsin-adeno-associated virus(AAV) 2/8 viral vector into P30 rd1 mice. After excluding aborted injections, we used immunohistochemistry of the whole mount retina to confirm the ectopic expression of human melanopsin by co-expression of human melanopsin and YFP that was carried by a viral vector. At30 d post-injection, visual electrophysiology and the behavioral test significantly improved. However,restoration of vision disappeared 45 d after human melanopsin injection. Notably, human melanopsin-injected mice did not show any structural differences in their retinas compared with PBS-injected mice.·CONCLUSION: Ectopic expression of human melanopsin effectively and safely restores visual function in rd1展开更多
红外小目标的检测一直是红外追踪系统的关键技术,针对现有红外小目标检测方法在复杂背景下易造成虚警、检测速度慢的不足,从人类视觉系统的角度出发,参考了多尺度局部能量因子检测方法(multiscale local contrast measure using a local...红外小目标的检测一直是红外追踪系统的关键技术,针对现有红外小目标检测方法在复杂背景下易造成虚警、检测速度慢的不足,从人类视觉系统的角度出发,参考了多尺度局部能量因子检测方法(multiscale local contrast measure using a local energy factor,MLCM-LEF),提出了一种基于双层局部能量因子的红外小目标检测方法.从局部能量差异与局部亮度差异两个角度进行目标检测,使用双层局部能量因子从能量角度描述小目标与背景的相异程度,同时采取加权亮度差因子从亮度角度对图像进行目标检测,通过二维高斯融合上述二者的处理结果,最终利用图像均值和标准差进行自适应阈值分割,提取红外小目标.经过公开数据集实验测试,该方法在抑制背景噪声、减低虚警概率的表现上比主流的检测方法有所提升,与MLCM-LEF算法相比,基于双层局部能量因子的方法将单帧检测时间降低至三分之一.展开更多
基金Supported by Tianjin Municipal Natural Science Foundation of China(Grant No.19JCJQJC61600)Hebei Provincial Natural Science Foundation of China(Grant Nos.F2020202051,F2020202053).
文摘Visual odometry is critical in visual simultaneous localization and mapping for robot navigation.However,the pose estimation performance of most current visual odometry algorithms degrades in scenes with unevenly distributed features because dense features occupy excessive weight.Herein,a new human visual attention mechanism for point-and-line stereo visual odometry,which is called point-line-weight-mechanism visual odometry(PLWM-VO),is proposed to describe scene features in a global and balanced manner.A weight-adaptive model based on region partition and region growth is generated for the human visual attention mechanism,where sufficient attention is assigned to position-distinctive objects(sparse features in the environment).Furthermore,the sum of absolute differences algorithm is used to improve the accuracy of initialization for line features.Compared with the state-of-the-art method(ORB-VO),PLWM-VO show a 36.79%reduction in the absolute trajectory error on the Kitti and Euroc datasets.Although the time consumption of PLWM-VO is higher than that of ORB-VO,online test results indicate that PLWM-VO satisfies the real-time demand.The proposed algorithm not only significantly promotes the environmental adaptability of visual odometry,but also quantitatively demonstrates the superiority of the human visual attention mechanism.
基金supported by the National Natural Science Foundation of China(61472324)
文摘To overcome the shortcomings of the Lee image enhancement algorithm and its improvement based on the logarithmic image processing(LIP) model, this paper proposes what we believe to be an effective image enhancement algorithm. This algorithm introduces fuzzy entropy, makes full use of neighborhood information, fuzzy information and human visual characteristics.To enhance an image, this paper first carries out the reasonable fuzzy-3 partition of its histogram into the dark region, intermediate region and bright region. It then extracts the statistical characteristics of the three regions and adaptively selects the parameter αaccording to the statistical characteristics of the image’s gray-scale values. It also adds a useful nonlinear transform, thus increasing the ubiquity of the algorithm. Finally, the causes for the gray-scale value overcorrection that occurs in the traditional image enhancement algorithms are analyzed and their solutions are proposed.The simulation results show that our image enhancement algorithm can effectively suppress the noise of an image, enhance its contrast and visual effect, sharpen its edge and adjust its dynamic range.
基金Supported by Innovation Fund of China(00C26224210641)
文摘A Robust Adaptive Video Encoder (RAVE) based on human visual model is proposed. The encoder combines the best features of Fine Granularity Scalable (FGS) coding, framedropping coding, video redundancy coding, and human visual model. According to packet loss and available bandwidth of the network, the encoder adjust the output bit rate by jointly adapting quantization step-size instructed by human visual model, rate shaping, and periodically inserting key frame. The proposed encoder is implemented based on MPEG-4 encoder and is compared with the case of a conventional FGS algorithm. It is shown that RAVE is a very efficient robust video encoder that provides improved visual quality for the receiver and consumes equal or less network resource. Results are confirmed by subjective tests and simulation tests.
文摘The key to the wavelet based denoising teehniquea is how to manipulate the wavelet coefficients. By referring to the idea of Inclusive-OR in the design of circuits, this paper proposes a new algorithm called wavelet domain Inclusive-OR denoising algorithm(WDIDA), which distinguishes the wavelet coefficients belonging to image or noise by considering their phases and modulus maxima simultaneously. Using this new algorithm, the denoising effects are improved and the computation time is reduced. Furthermore, in order to enhance the edges of the image but not magnify noise, a contrast nonlinear enhancing algorithm is presented according to human visual properties. Compared with traditional enhancing algorithms, the algorithm that we proposed has a better noise reducing performanee , preserving edges and improving the visual quality of images.
基金Fundamental Research Funds of State Key Laboratory of Ophthalmology,China
文摘AIM: To investigate the visual pathway in normal subjects and patients with lesion involved by diffusion tensor imaging (DTI) and diffusion tensor tractography (DTT). METHODS: Thirty normal volunteers, 3 subjects with orbital tumors involved the optic nerve (ON) and 33 subjects with occipital lobe tumors involved the optic radiation (OR) (10 gliomas, 6 meningiomas and 17 cerebral metastases) undertook routine cranium magnetic resonance imaging (MRI), DTI and DTT. Visual pathway fibers were analyzed by DTI and DTT images. Test fractional anisotropy (FA) and mean diffusivity (MD) values in different part of the visual pathway. RESULTS: The whole visual pathway but optic chiasm manifested as hyperintensity in FA maps and homogenous green signal in the direction encoded color maps. The optic chiasm did not display clearly. There was no significant difference between the bilateral FA values and MD values of normal visual pathway but optic chiasm, which the FA values tested were much too low (all P>0.05). The ONs of subjects with orbital tumors were compressed and displaced. Only one subject had lower FA values and higher MD values. OR of 9 gliomas subjects were infiltrated, with displacement in 2 and disruption in 7 subjects. All OR in 6 meniongiomas subjects were displaced. OR in 17 cerebral metastases subjects all developed displacement while 7 of them had disruption also. CONCLUSION: MR-DTI is highly sensitive in manifesting visual pathway. Visual pathway can be analyzed quantitatively in FA and MD values. DTT supplies accurate three dimensional conformations of visual pathway. But optic chiasm's manifestation still needs to improve.
文摘The Drovers’ Paths are remnants of important land access roads from Rio Grande do Sul to São Paulo at the time of Colonial Brazil. They were built and used between the 18th and 20th centuries, particularly in the region of Coxilha Rica. The main objective of this research is to develop a method for decision-making applied to the territorial landscape management in the Coxilha Rica. The method consisted of generating criteria to map the visibility spot reached from the main selected points;define the human visual acuity, realize bibliographic research, use cartographic and historical documents, inter-views, as well as field surveys that enabled the identification, characterization and mapping of historical farms and drovers’ paths. After data processing, the information was entered into the cartographic database;the data were cross-checked and analysis was made of the visibility of the surrounding farms and stone-walled corridors. Quality assessments showed that, with the visibility polygons, and through the use of cartographic tools, we could cross-check between different levels of information and analyze landscape intervention alternatives in order to minimize environmental impacts. When applying the method in the Coxilha Rica it was possible mapping the visibility polygon, taking human visual acuity into consideration, based on historical farms and stone-walled corridors;and making spatial analyses to explore alternatives to intervention (installation of power transmission systems) in order to preserve the scenic environment of the region. In the end, the decision was by does not construct the system.
基金supported by the National Science and Technology Innovation 2030 Major Program(2022ZD0204802,2022ZD0204804)the National Natural Science Foundation of China(31930053,32171039)Beijing Academy of Artificial Intelligence(BAAI)。
文摘The concept of receptive field(RF) is central to sensory neuroscience. Neuronal RF properties have been substantially studied in animals,while those in humans remain nearly unexplored. Here, we measured neuronal RFs with intracranial local field potentials(LFPs) and spiking activity in human visual cortex(V1/V2/V3). We recorded LFPs via macro-contacts and discovered that RF sizes estimated from lowfrequency activity(LFA, 0.5–30 Hz) were larger than those estimated from low-gamma activity(LGA, 30–60 Hz) and high-gamma activity(HGA, 60–150 Hz). We then took a rare opportunity to record LFPs and spiking activity via microwires in V1 simultaneously. We found that RF sizes and temporal profiles measured from LGA and HGA closely matched those from spiking activity. In sum, this study reveals that spiking activity of neurons in human visual cortex could be well approximated by LGA and HGA in RF estimation and temporal profile measurement, implying the pivotal functions of LGA and HGA in early visual information processing.
文摘This study explores the complex relationship between climate change and human development. The aim is to understand how climate change affects human development across countries, regions, and the global population. Visual analytics were used to examine the impact of various climate change indicators on different aspects of human development. The study highlights the urgent need for climate change action and encourages policymakers to make decisive moves. Climate change adversely affects numerous aspects of daily life, leading to significant consequences that must be addressed through policy changes and global governance recommendations. Key findings include that regions with higher CO2 emissions experience a significantly higher incidence of life-threatening diseases compared to regions with lower emissions. Additionally, higher CO2 emissions correlate with consistent death rates. Increased pollution exposure is associated with a higher prevalence of life-threatening diseases and higher rates of malnutrition. Moreover, greater mineral depletion is linked to more frequent life-threatening diseases, suggesting that industrialization contributes to adverse health effects. These results provide valuable insights for policy and decision-making aimed at mitigating the impact of climate change on human development.
文摘In times of digitalisation, visual assistance systems in assembly are increasingly important. The design of these assembly systems needs to be highly complex to meet the requirements. Due to the increasing number of variants in production processes, as well as shorter innovation and product life cycles, assistance systems should improve quality and reduce complexity of assembly processes. However, many large kitchen manufacturers still assemble kitchen cabinets manually, due to the high variety of components, such as rails and fittings. This paper focuses on the analysis and evaluation of virtual assistance systems to improve quality and usability in individualised kitchen cabinet assembly processes at a large German manufacturer. A solution is identified and detailed.
文摘In order to achieve higher image compression ratio and improve visual perception of the decompressed image, a novel color image compression scheme based on the contrast sensitivity characteristics of the human visual system (HVS) is proposed. In the proposed scheme, firstly the image is converted into the YCrCb color space and divided into sub-blocks. Afterwards, the discrete cosine transform is carried out for each sub-block, and three quantization matrices are built to quantize the frequency spectrum coefficients of the images by combining the contrast sensitivity characteristics of HVS. The Huffman algorithm is used to encode the quantized data. The inverse process involves decompression and matching to reconstruct the decompressed color image. And simulations are carried out for two color images. The results show that the average structural similarity index measurement (SSIM) and peak signal to noise ratio (PSNR) under the approximate compression ratio could be increased by 2.78% and 5.48%, respectively, compared with the joint photographic experts group (JPEG) compression. The results indicate that the proposed compression algorithm in the text is feasible and effective to achieve higher compression ratio under ensuring the encoding and image quality, which can fully meet the needs of storage and transmission of color images in daily life.
基金This work was supported by National Natural Science Foundation of China(Nos.61831015 and 61901260)Key Research and Development Program of China(No.2019YFB1405902).
文摘Objective image quality assessment(IQA)plays an important role in various visual communication systems,which can automatically and efficiently predict the perceived quality of images.The human eye is the ultimate evaluator for visual experience,thus the modeling of human visual system(HVS)is a core issue for objective IQA and visual experience optimization.The traditional model based on black box fitting has low interpretability and it is difficult to guide the experience optimization effectively,while the model based on physiological simulation is hard to integrate into practical visual communication services due to its high computational complexity.For bridging the gap between signal distortion and visual experience,in this paper,we propose a novel perceptual no-reference(NR)IQA algorithm based on structural computational modeling of HVS.According to the mechanism of the human brain,we divide the visual signal processing into a low-level visual layer,a middle-level visual layer and a high-level visual layer,which conduct pixel information processing,primitive information processing and global image information processing,respectively.The natural scene statistics(NSS)based features,deep features and free-energy based features are extracted from these three layers.The support vector regression(SVR)is employed to aggregate features to the final quality prediction.Extensive experimental comparisons on three widely used benchmark IQA databases(LIVE,CSIQ and TID2013)demonstrate that our proposed metric is highly competitive with or outperforms the state-of-the-art NR IQA measures.
文摘Vision-simulated imagery―the process of generating images that mimic the human visual system―is a valuable tool with a wide spectrum of possible applications, including visual acuity measurements, personalized planning of corrective lenses and surgeries, vision-correcting displays, vision-related hardware development, and extended reality discomfort reduction. A critical property of human vision is that it is imperfect because of the highly influential wavefront aberrations that vary from person to person. This study provides an overview of the existing computational image generation techniques that properly simulate human vision in the presence of wavefront aberrations. These algorithms typically apply ray tracing with a detailed description of the simulated eye or utilize the point-spread func-tion of the eye to perform convolution on the input image. Based on the description of the vision simulation tech-niques, several of their characteristic features have been evaluated and some potential application areas and research directions have been outlined.
基金Supported by the Chongqing Internationa Cooperation Key Projects(No.CSTC2013GJHZ10004)National Basic Research Program of China(973 Program No.2013CB967002)
文摘AIM: To explore whether ectopic expression of human melanopsin can effectively and safely restore visual function in rd1 mice.· METHODS: Hematoxylin-eosin staining of retinal sections from rd1 mice was used to detect the thickness of the outer nuclear layer to determine the timing of surgery. We constructed a human melanopsinAAV2/8 viral vector and injected it into the subretinal space of rd1 mice. The Phoenix Micron IV system was used to exclude the aborted injections, and immunohistochemistry was used to validate the ectopic expression of human melanopsin. Furthermore, visual electrophysiology and behavioral tests were used to detect visual function 30 and 45 d after the injection. The structure of the retina was compared between the human melanopsin-injected group and phosphate buffer saline(PBS)-injected group.·RESULTS: Retinas of rd1 mice lost almost all of their photoreceptors on postnatal day 28(P28). We therefore injected the human melanopsin-adeno-associated virus(AAV) 2/8 viral vector into P30 rd1 mice. After excluding aborted injections, we used immunohistochemistry of the whole mount retina to confirm the ectopic expression of human melanopsin by co-expression of human melanopsin and YFP that was carried by a viral vector. At30 d post-injection, visual electrophysiology and the behavioral test significantly improved. However,restoration of vision disappeared 45 d after human melanopsin injection. Notably, human melanopsin-injected mice did not show any structural differences in their retinas compared with PBS-injected mice.·CONCLUSION: Ectopic expression of human melanopsin effectively and safely restores visual function in rd1
文摘红外小目标的检测一直是红外追踪系统的关键技术,针对现有红外小目标检测方法在复杂背景下易造成虚警、检测速度慢的不足,从人类视觉系统的角度出发,参考了多尺度局部能量因子检测方法(multiscale local contrast measure using a local energy factor,MLCM-LEF),提出了一种基于双层局部能量因子的红外小目标检测方法.从局部能量差异与局部亮度差异两个角度进行目标检测,使用双层局部能量因子从能量角度描述小目标与背景的相异程度,同时采取加权亮度差因子从亮度角度对图像进行目标检测,通过二维高斯融合上述二者的处理结果,最终利用图像均值和标准差进行自适应阈值分割,提取红外小目标.经过公开数据集实验测试,该方法在抑制背景噪声、减低虚警概率的表现上比主流的检测方法有所提升,与MLCM-LEF算法相比,基于双层局部能量因子的方法将单帧检测时间降低至三分之一.