Point-based rendering is a common method widely used in point cloud rendering.It realizes rendering by turning the points into the base geometry.The critical step in point-based rendering is to set an appropriate rend...Point-based rendering is a common method widely used in point cloud rendering.It realizes rendering by turning the points into the base geometry.The critical step in point-based rendering is to set an appropriate rendering radius for the base geometry,usually calculated using the average Euclidean distance of the N nearest neighboring points to the rendered point.This method effectively reduces the appearance of empty spaces between points in rendering.However,it also causes the problem that the rendering radius of outlier points far away from the central region of the point cloud sequence could be large,which impacts the perceptual quality.To solve the above problem,we propose an algorithm for point-based point cloud rendering through outlier detection to optimize the perceptual quality of rendering.The algorithm determines whether the detected points are outliers using a combination of local and global geometric features.For the detected outliers,the minimum radius is used for rendering.We examine the performance of the proposed method in terms of both objective quality and perceptual quality.The experimental results show that the peak signal-to-noise ratio(PSNR)of the point cloud sequences is improved under all geometric quantization,and the PSNR improvement ratio is more evident in dense point clouds.Specifically,the PSNR of the point cloud sequences is improved by 3.6%on average compared with the original algorithm.The proposed method significantly improves the perceptual quality of the rendered point clouds and the results of ablation studies prove the feasibility and effectiveness of the proposed method.展开更多
BACKGROUND: Conventional methods (such as occlusion therapy, fine manipulation, complementary, and alternative medicine) take effects slowly, are time and labor consuming, and have uncertain curative effects in the...BACKGROUND: Conventional methods (such as occlusion therapy, fine manipulation, complementary, and alternative medicine) take effects slowly, are time and labor consuming, and have uncertain curative effects in the treatment of amblyopia. Perceptual learning, a new method for treating amblyopia, improves the ability to process signals from the cerebral optic nerve system by specific visual stimulation and visual learning, as well as activation of the visual signal pathway utilizing brain nervous system plasticity. OBJECTIVE: This study investigated and evaluated the curative effects of perceptual learning, which can directionally increase brain plasticity, on the treatment of amblyopia in children. The relationship between curative effect and time was also analyzed. DESIGN: A self-control experiment. SETTING: Visual Science and Optometry Center, People's Hospital of Guangxi Zhuang Autonomous Region. PARTICIPANTS: A total of 125 amblyopic children (250 amblyopic eyes), 73 males, 52 females, averaging (6±2) years of age, received treatment at the Visual Science and Optometry Center, People's Hospital of Guangxi Zhuang Autonomous Region between September 2006 and February 2007 and were recruited for this study. All children presented with no structural disease of the eyeballs. Written informed consent for therapeutic regiments was obtained from each child's parent. The protocol received approval from the Hospital's Ethics Committee. METHODS: Visual function was tested with a perceptual learning system (Research Center for Human Health and Development of Sun Yat-sen University, National Engineering Technique Research Center for Medical Care Implement) for visual noise, position noise, contour discrimination, contrast sensitivity, grating stereogram, and random-dot fusion. These tests helped to evaluate the efficiency of visual information processing of these children, and to determine the degree of defects of the optic nerve cells and the connections of visual cortical neurons. According to results of visual function tests, individualized treatment was adopted for each amblyopia patient using perceptual learning system. One course of treatment lasted one month, and treatment was performed twice every day with two training procedures (each training procedure lasted for ten minutes). There was a ten-minute time interval between the two training procedures. The training treatment was performed in a quiet and dark environment. Visual acuity and recovery of visual function were tested every month. Original training procedure was continued or adjusted according to the results of visual function. MAIN OUTCOME MEASURES: Visual function change; relationship of curative effects and curative time. RESULTS: A total of 125 amblyopia children were included in the final analysis. The total efficiency of perceptual learning for treating amblyopia in children was 75.2%. Visual acuity began to greatly increase 3 months after treatment (P 〈 0.05). Visual acuity was best corrected from 0.60 ± 0.23 before treatment to 0.86 ± 0.26 after treatment (P 〈 0.05). The mean time to reach improved levels with curative effects was (2.82 ± 1.30) months, and to reach a basically cured level was (2.87 ±1.40) months. Percentage of improved visual acuity was the highest [98% (39/40)] in children that received 3 months of treatment and the lowest [55% (31/56)] in children that received 1 month of treatment (P 〈 0.05). The percentage of basically cured levels with curative effects increased with length of learning time and was the greatest in children that received 4 months of treatment [67% (31/46), P 〈 0.05]. CONCLUSION: Perceptual learning rapidly and remarkably improves visual function of amblyopia children; however, the curative effects are first apparent two and three months after intervention.展开更多
The easy generation, storage, transmission and reproduction of digital images have caused serious abuse and security problems. Assurance of the rightful ownership, integrity, and authenticity is a major concern to the...The easy generation, storage, transmission and reproduction of digital images have caused serious abuse and security problems. Assurance of the rightful ownership, integrity, and authenticity is a major concern to the academia as well as the industry. On the other hand, efficient search of the huge amount of images has become a great challenge. Image hashing is a technique suitable for use in image authentication and content based image retrieval (CBIR). In this article, we review some representative image hashing techniques proposed in the recent years, with emphases on how to meet the conflicting requirements of perceptual robustness and security. Following a brief introduction to some earlier methods, we focus on a typical two-stage structure and some geometric-distortion resilient techniques. We then introduce two image hashing approaches developed in our own research, and reveal security problems in some existing methods due to the absence of secret keys in certain stage of the image feature extraction, or availability of a large quantity of images, keys, or the hash function to the adversary. More research efforts are needed in developing truly robust and secure image hashing techniques.展开更多
Most of Image Quality Assessment (IQA) metrics consist of two processes. In the first process, quality map of image is measured locally. In the second process, the last quality score is converted from the quality map ...Most of Image Quality Assessment (IQA) metrics consist of two processes. In the first process, quality map of image is measured locally. In the second process, the last quality score is converted from the quality map by using the pooling strategy. The first process had been made effective and significant progresses, while the second process was always done in simple ways. In the second process of the pooling strategy, the optimal perceptual pooling weights should be determined and computed according to Human Visual System (HVS). Thus, a reliable spatial pooling mathematical model based on HVS is an important issue worthy of study. In this paper, a new Visual Perceptual Pooling Strategy (VPPS) for IQA is presented based on contrast sensitivity and luminance sensitivity of HVS. Experimental results with the LIVE database show that the visual perceptual weights, obtained by the proposed pooling strategy, can effectively and significantly improve the performances of the IQA metrics with Mean Structural SIMilarity (MSSIM) or Phase Quantization Code (PQC). It is confirmed that the proposed VPPS demonstrates promising results for improving the performances of existing IQA metrics.展开更多
Currently,polarization visualization strategies are accomplished by mapping polarization information into a perceptually uniform color appearance model CAM02-UCS.However,the deviation of the CAM02-UCS space from the l...Currently,polarization visualization strategies are accomplished by mapping polarization information into a perceptually uniform color appearance model CAM02-UCS.However,the deviation of the CAM02-UCS space from the lightness prediction results in an inaccurate match between the polarization information and the perceptual information.In this paper,we propose a novel polarization visualization strategy based on the perceptual uniform space Jzazbz.The polarization visualization be completed by placing the polarization information into the lightness Jz,colorfulness Cz and hue angle hz channels of the Jzazbz space.The experimental results show that the proposed method can significantly improve the lightness of the low irradiance and high polarization region,hence more polarization information can be sensed by human visual system.展开更多
Perceptual image quality assessment(IQA)is one of the most indispensable yet challenging problems in image processing and computer vision.It is quite necessary to develop automatic and efficient approaches that can ac...Perceptual image quality assessment(IQA)is one of the most indispensable yet challenging problems in image processing and computer vision.It is quite necessary to develop automatic and efficient approaches that can accurately predict perceptual image quality consistently with human subjective evaluation.To further improve the prediction accuracy for the distortion of color images,in this paper,we propose a novel effective and efficient IQA model,called perceptual gradient similarity deviation(PGSD).Based on the gradient magnitude similarity,we proposed a gradient direction selection method to automatically determine the pixel-wise perceptual gradient.The luminance and chrominance channels are both took into account to characterize the quality degradation caused by intensity and color distortions.Finally,a multi-scale strategy is utilized and pooled with different weights to incorporate image details at different resolutions.Experimental results on LIVE,CSIQ and TID2013 databases demonstrate the superior performances of the proposed algorithm.展开更多
Image classifiers that based on Deep Neural Networks(DNNs)have been proved to be easily fooled by well-designed perturbations.Previous defense methods have the limitations of requiring expensive computation or reducin...Image classifiers that based on Deep Neural Networks(DNNs)have been proved to be easily fooled by well-designed perturbations.Previous defense methods have the limitations of requiring expensive computation or reducing the accuracy of the image classifiers.In this paper,we propose a novel defense method which based on perceptual hash.Our main goal is to destroy the process of perturbations generation by comparing the similarities of images thus achieve the purpose of defense.To verify our idea,we defended against two main attack methods(a white-box attack and a black-box attack)in different DNN-based image classifiers and show that,after using our defense method,the attack-success-rate for all DNN-based image classifiers decreases significantly.More specifically,for the white-box attack,the attack-success-rate is reduced by an average of 36.3%.For the black-box attack,the average attack-success-rate of targeted attack and non-targeted attack has been reduced by 72.8%and 76.7%respectively.The proposed method is a simple and effective defense method and provides a new way to defend against adversarial samples.展开更多
On the basis of psychological acoustic theories and experiments, this paper proposes an acoustic model which is based on acoustic perceptual feature. Compared with the physiological acoustics based acoustic model, thi...On the basis of psychological acoustic theories and experiments, this paper proposes an acoustic model which is based on acoustic perceptual feature. Compared with the physiological acoustics based acoustic model, this model is more suitable to represent human’s perceptual features of continuous speech, so it is suitable for recognition of continuous speech.展开更多
Campus is a direct carrier for higher education and culture communication. Good landscape sculpture can produce great cohesion and impetus, and it can play an important role in campus landscape construction and cultur...Campus is a direct carrier for higher education and culture communication. Good landscape sculpture can produce great cohesion and impetus, and it can play an important role in campus landscape construction and cultural communication. Based on the premise of constructing a high-level experience campus culture, this paper analyzes and studies the landscape sculptures of three polytechnic universities in Chengdu from the perspective of perceptual engineering, and analyzes the historical inheritance and school-running characteristics of the three universities, to compare the campus spirit conveyed by the sculptures and the perceptual cognition felt by the teachers and students. The differences between the existing campus landscape sculptures and the campus culture construction goal are obtained, and the corresponding campus landscape sculpture promotion strategy is proposed.展开更多
Based on perceptual control theory,a task analysis approach is proposed to describe more accurately user tasks in dynamic environments,which is of more powerful and flexible descriptive ability. Theoretically,a task m...Based on perceptual control theory,a task analysis approach is proposed to describe more accurately user tasks in dynamic environments,which is of more powerful and flexible descriptive ability. Theoretically,a task meta model is established to describe the interactive process in an individual,dynamic,and flexible way.Methodologically,an implementation framework is illustrated to map the user-oriented description into implementation-oriented models,which will be as a technical tool to transform from a task model to a user interface prototype.展开更多
Virtual reality(VR) environment can provide immersive experience to viewers.Under the VR environment, providing a good quality of experience is extremely important.Therefore, in this paper, we present an image quality...Virtual reality(VR) environment can provide immersive experience to viewers.Under the VR environment, providing a good quality of experience is extremely important.Therefore, in this paper, we present an image quality assessment(IQA) study on omnidirectional images. We first build an omnidirectional IQA(OIQA) database, including 16 source images with their corresponding 320 distorted images. We add four commonly encountered distortions. These distortions are JPEG compression, JPEG2000 compression, Gaussian blur, and Gaussian noise. Then we conduct a subjective quality evaluation study in the VR environment based on the OIQA database. Considering that visual attention is more important in VR environment, head and eye movement data are also tracked and collected during the quality rating experiments. The 16 raw and their corresponding distorted images,subjective quality assessment scores, and the head-orientation data and eye-gaze data together constitute the OIQA database. Based on the OIQA database, we test some state-of-the-art full-reference IQA(FR-IQA) measures on equirectangular format or cubic formatomnidirectional images. The results show that applying FR-IQA metrics on cubic format omnidirectional images could improve their performance. The performance of some FR-IQA metrics combining the saliency weight of three different types are also tested based on our database. Some new phenomena different from traditional IQA are observed.展开更多
A watermarking scheme designed for remote sensing images needs to meet the same demand of both invisibility as for ordinary digital images. Due to specific perceptual characteristics of Synthetic Aperture Radar(SAR) i...A watermarking scheme designed for remote sensing images needs to meet the same demand of both invisibility as for ordinary digital images. Due to specific perceptual characteristics of Synthetic Aperture Radar(SAR) images, the watermarking algorithms with consideration of Human Vision System(HVS) modeling from optical images give poor performance when applied on SAR images. This paper examines a variety of factors affecting the noise sensitivity, and further proposes a refined pixel-wise masking approach for watermarking on SAR images. The proposed approach is applied on logarithmic transformed SAR images, and has increased the acceptable watermark embedding strength by about 6 dB to 10 dB while achieving the same levels of watermarked image visual quality. Experimental results show that this approach enhanced the perceptual invisibility of watermarking based on wavelet decomposition.展开更多
A mathematical model of perceptual symbol system is developed. This development requires new mathematical methods of dynamic logic (DL), which have overcome limitations of classical artificial intelligence and connect...A mathematical model of perceptual symbol system is developed. This development requires new mathematical methods of dynamic logic (DL), which have overcome limitations of classical artificial intelligence and connectionist approaches. The paper discusses these past limitations, relates them to combinatorial complexity (exponential explosion) of algorithms in the past, and relates it further to the static nature of classical logic. DL is a process-logic;its salient property is evolution of vague representations into crisp. We first consider one aspect of PSS: situation learning from object perceptions. Next DL is related to PSS mechanisms of concepts, simulators, grounding, embodiment, productiveity, binding, recursion, and to the mechanisms relating embodied-grounded and amodal symbols. We discuss DL capability for modeling cognition on multiple levels of abstraction. PSS is extended toward interaction between cognition and language. Experimental predictions of the theory are discussed. They might influence experimental psychology and impact future theoretical developments in cognitive science, including knowledge representation, and mechanisms of interaction between perception, cognition, and language. All mathematical equations are also discussed conceptually, so mathematical understanding is not required. Experimental evidence for DL and PSS in brain imaging is discussed as well as future research directions.展开更多
In this work a new technique for global perceptual codes (GPCs) extraction using genetic algorithms (GA) is presented. GAs are employed to extract the GPCs in order to reduce the original number of features and to pro...In this work a new technique for global perceptual codes (GPCs) extraction using genetic algorithms (GA) is presented. GAs are employed to extract the GPCs in order to reduce the original number of features and to provide meaningful representations of the original data. In this technique the GPCs are build from a certain combination of elementary perceptual codes (EPCs) which are provided by the Beta-elliptic model for the generation of complex handwriting movements. Indeed, in this model each script is modelled by a set of elliptic arcs. We associate to each arc an EPC. In the proposed technique we defined four types of EPCs. The GPCs can be formed by many possible combinations of EPCs depending on their number and types. So that, the problem of choosing the right combination for each GPC can be regarded as a global optimization problem which is treated in this work using the GAs. Several simulation examples are presented to evaluate the interest and the efficiency of the proposed technique.展开更多
Recent theories on natural and synthetic consciousness overlook the geometric structure necessary for awareness of 3-dimensional space, as strikingly illustrated by left-neglect disorder. Furthermore, awareness of 3-d...Recent theories on natural and synthetic consciousness overlook the geometric structure necessary for awareness of 3-dimensional space, as strikingly illustrated by left-neglect disorder. Furthermore, awareness of 3-dimensional space entails some surprisingly tenacious optical illusions, as demonstrated by an experiment in the text. Awareness of linear time is also crucial and complex. As a consequence, synthetic consciousness cannot be realized by simply intercomnecting a large number of electronic circuits constructed from ordinary chips and transistors. Since consciousness is a subjective experience, there is no sufficient condition for consciousness that can be experimentally confirmed. The most we can hope for is agreement on the necessary conditions for consciousness. Toward that end, this paper reviews some relevant clinical phenomena.展开更多
As mankind was one kind of aesthetic species,the visual study of Marxist philosophy has profound enlightenment in understanding the New Year paintings' visual meaning,form,development,cultivation,and features etc....As mankind was one kind of aesthetic species,the visual study of Marxist philosophy has profound enlightenment in understanding the New Year paintings' visual meaning,form,development,cultivation,and features etc.In essence,Chinese folk New Year paintings is a visual expression,its nature of being pasted at home makes it a visual encirclement for the Chinese people,this vision encirclement in the home space gradually expands,constitutes a spiritual home with different cultural space,living environment,image schema,and secular and religious faith scope,in this visual encirclement,people's home life achieves the satisfaction,peace and self–comfort in diversity.The"childrenization"of New Year paintings reveals a close relationship between the immature of folk art and the original art as human childhood,and from the both aspects of form and content reflects the visual encirclement is actually a kind of siege with cultural significance,this siege has multiple levels in value ideas,historical tradition,ethical order and life knowledge,aesthetic taste and so on,makes us have a new aesthetic anthropology on the New Year paintings.展开更多
The two mast cameras, Mastcams, onboard Mars rover Curiosity are multispectral imagers with nine bands in each. Currently, the images are compressed losslessly using JPEG, which can achieve only two to three times of ...The two mast cameras, Mastcams, onboard Mars rover Curiosity are multispectral imagers with nine bands in each. Currently, the images are compressed losslessly using JPEG, which can achieve only two to three times of compression. We present a comparative study of four approaches to compressing multispectral Mastcam images. The first approach is to divide the nine bands into three groups with each group having three bands. Since the multispectral bands have strong correlation, we treat the three groups of images as video frames. We call this approach the Video approach. The second approach is to compress each group separately and we call it the split band (SB) approach. The third one is to apply a two-step approach in which the first step uses principal component analysis (PCA) to compress a nine-band image cube to six bands and a second step compresses the six PCA bands using conventional codecs. The fourth one is to apply PCA only. In addition, we also present subjective and objective assessment results for compressing RGB images because RGB images have been used for stereo and disparity map generation. Five well-known compression codecs, including JPEG, JPEG-2000 (J2K), X264, X265, and Daala in the literature, have been applied and compared in each approach. The performance of different algorithms was assessed using four well-known performance metrics. Two are conventional and another two are known to have good correlation with human perception. Extensive experiments using actual Mastcam images have been performed to demonstrate the various approaches. We observed that perceptually lossless compression can be achieved at 10:1 compression ratio. In particular, the performance gain of the SB approach with Daala is at least 5 dBs in terms peak signal-to-noise ratio (PSNR) at 10:1 compression ratio over that of JPEG. Subjective comparisons also corroborated with the objective metrics in that perceptually lossless compression can be achieved even at 20 to 1 compression.展开更多
Numerous perceptual hashing algorithms have been developed for identification and verification of multimedia objects in recent years. Many application schemes have been adopted for various commercial objects. Develope...Numerous perceptual hashing algorithms have been developed for identification and verification of multimedia objects in recent years. Many application schemes have been adopted for various commercial objects. Developers and users are looking for a benchmark tool to compare and evaluate their current algorithms or technologies. In this paper, a novel benchmark platform is presented. PHABS provides an open framework and lets its users define their own test strategy, perform tests, collect and analyze test data. With PHABS, various performance parameters of algorithms can be tested, and different algorithms or algorithms with different parameters can be evaluated and compared easily.展开更多
文摘Point-based rendering is a common method widely used in point cloud rendering.It realizes rendering by turning the points into the base geometry.The critical step in point-based rendering is to set an appropriate rendering radius for the base geometry,usually calculated using the average Euclidean distance of the N nearest neighboring points to the rendered point.This method effectively reduces the appearance of empty spaces between points in rendering.However,it also causes the problem that the rendering radius of outlier points far away from the central region of the point cloud sequence could be large,which impacts the perceptual quality.To solve the above problem,we propose an algorithm for point-based point cloud rendering through outlier detection to optimize the perceptual quality of rendering.The algorithm determines whether the detected points are outliers using a combination of local and global geometric features.For the detected outliers,the minimum radius is used for rendering.We examine the performance of the proposed method in terms of both objective quality and perceptual quality.The experimental results show that the peak signal-to-noise ratio(PSNR)of the point cloud sequences is improved under all geometric quantization,and the PSNR improvement ratio is more evident in dense point clouds.Specifically,the PSNR of the point cloud sequences is improved by 3.6%on average compared with the original algorithm.The proposed method significantly improves the perceptual quality of the rendered point clouds and the results of ablation studies prove the feasibility and effectiveness of the proposed method.
基金Grant from Major Scientific Research Program of Medical Treatment and Public Health of Guangxi Zhuang Autonomous Region, No.200730
文摘BACKGROUND: Conventional methods (such as occlusion therapy, fine manipulation, complementary, and alternative medicine) take effects slowly, are time and labor consuming, and have uncertain curative effects in the treatment of amblyopia. Perceptual learning, a new method for treating amblyopia, improves the ability to process signals from the cerebral optic nerve system by specific visual stimulation and visual learning, as well as activation of the visual signal pathway utilizing brain nervous system plasticity. OBJECTIVE: This study investigated and evaluated the curative effects of perceptual learning, which can directionally increase brain plasticity, on the treatment of amblyopia in children. The relationship between curative effect and time was also analyzed. DESIGN: A self-control experiment. SETTING: Visual Science and Optometry Center, People's Hospital of Guangxi Zhuang Autonomous Region. PARTICIPANTS: A total of 125 amblyopic children (250 amblyopic eyes), 73 males, 52 females, averaging (6±2) years of age, received treatment at the Visual Science and Optometry Center, People's Hospital of Guangxi Zhuang Autonomous Region between September 2006 and February 2007 and were recruited for this study. All children presented with no structural disease of the eyeballs. Written informed consent for therapeutic regiments was obtained from each child's parent. The protocol received approval from the Hospital's Ethics Committee. METHODS: Visual function was tested with a perceptual learning system (Research Center for Human Health and Development of Sun Yat-sen University, National Engineering Technique Research Center for Medical Care Implement) for visual noise, position noise, contour discrimination, contrast sensitivity, grating stereogram, and random-dot fusion. These tests helped to evaluate the efficiency of visual information processing of these children, and to determine the degree of defects of the optic nerve cells and the connections of visual cortical neurons. According to results of visual function tests, individualized treatment was adopted for each amblyopia patient using perceptual learning system. One course of treatment lasted one month, and treatment was performed twice every day with two training procedures (each training procedure lasted for ten minutes). There was a ten-minute time interval between the two training procedures. The training treatment was performed in a quiet and dark environment. Visual acuity and recovery of visual function were tested every month. Original training procedure was continued or adjusted according to the results of visual function. MAIN OUTCOME MEASURES: Visual function change; relationship of curative effects and curative time. RESULTS: A total of 125 amblyopia children were included in the final analysis. The total efficiency of perceptual learning for treating amblyopia in children was 75.2%. Visual acuity began to greatly increase 3 months after treatment (P 〈 0.05). Visual acuity was best corrected from 0.60 ± 0.23 before treatment to 0.86 ± 0.26 after treatment (P 〈 0.05). The mean time to reach improved levels with curative effects was (2.82 ± 1.30) months, and to reach a basically cured level was (2.87 ±1.40) months. Percentage of improved visual acuity was the highest [98% (39/40)] in children that received 3 months of treatment and the lowest [55% (31/56)] in children that received 1 month of treatment (P 〈 0.05). The percentage of basically cured levels with curative effects increased with length of learning time and was the greatest in children that received 4 months of treatment [67% (31/46), P 〈 0.05]. CONCLUSION: Perceptual learning rapidly and remarkably improves visual function of amblyopia children; however, the curative effects are first apparent two and three months after intervention.
基金supported by the National Natural Science Foundation of China(Grant No.60502039),the Shanghai Rising-Star Program(Grant No.06QA14022),and the Key project of Shanghai Municipality for Basic Research (Grant No.04JC14037)
文摘The easy generation, storage, transmission and reproduction of digital images have caused serious abuse and security problems. Assurance of the rightful ownership, integrity, and authenticity is a major concern to the academia as well as the industry. On the other hand, efficient search of the huge amount of images has become a great challenge. Image hashing is a technique suitable for use in image authentication and content based image retrieval (CBIR). In this article, we review some representative image hashing techniques proposed in the recent years, with emphases on how to meet the conflicting requirements of perceptual robustness and security. Following a brief introduction to some earlier methods, we focus on a typical two-stage structure and some geometric-distortion resilient techniques. We then introduce two image hashing approaches developed in our own research, and reveal security problems in some existing methods due to the absence of secret keys in certain stage of the image feature extraction, or availability of a large quantity of images, keys, or the hash function to the adversary. More research efforts are needed in developing truly robust and secure image hashing techniques.
基金Supported by the National Natural Science Foundation of China (No. 60832003, 60902096, 61171163, 61071120)the Scientific Research Foundation of Graduate School of Ningbo University
文摘Most of Image Quality Assessment (IQA) metrics consist of two processes. In the first process, quality map of image is measured locally. In the second process, the last quality score is converted from the quality map by using the pooling strategy. The first process had been made effective and significant progresses, while the second process was always done in simple ways. In the second process of the pooling strategy, the optimal perceptual pooling weights should be determined and computed according to Human Visual System (HVS). Thus, a reliable spatial pooling mathematical model based on HVS is an important issue worthy of study. In this paper, a new Visual Perceptual Pooling Strategy (VPPS) for IQA is presented based on contrast sensitivity and luminance sensitivity of HVS. Experimental results with the LIVE database show that the visual perceptual weights, obtained by the proposed pooling strategy, can effectively and significantly improve the performances of the IQA metrics with Mean Structural SIMilarity (MSSIM) or Phase Quantization Code (PQC). It is confirmed that the proposed VPPS demonstrates promising results for improving the performances of existing IQA metrics.
基金This work was supported by the Key Research and Development Program of Shaanxi(2018ZDXM-GY-091)the National Key Research and Development Project of China(2018YFB1309403)+2 种基金the Natural National Science Foundation of China(61805199)Natural Science Basic Research Plan in Shaanxi Province of China(2018JQ6065)We would like to sincerely thank all reviewers for their helpful comments and suggestions.
文摘Currently,polarization visualization strategies are accomplished by mapping polarization information into a perceptually uniform color appearance model CAM02-UCS.However,the deviation of the CAM02-UCS space from the lightness prediction results in an inaccurate match between the polarization information and the perceptual information.In this paper,we propose a novel polarization visualization strategy based on the perceptual uniform space Jzazbz.The polarization visualization be completed by placing the polarization information into the lightness Jz,colorfulness Cz and hue angle hz channels of the Jzazbz space.The experimental results show that the proposed method can significantly improve the lightness of the low irradiance and high polarization region,hence more polarization information can be sensed by human visual system.
文摘Perceptual image quality assessment(IQA)is one of the most indispensable yet challenging problems in image processing and computer vision.It is quite necessary to develop automatic and efficient approaches that can accurately predict perceptual image quality consistently with human subjective evaluation.To further improve the prediction accuracy for the distortion of color images,in this paper,we propose a novel effective and efficient IQA model,called perceptual gradient similarity deviation(PGSD).Based on the gradient magnitude similarity,we proposed a gradient direction selection method to automatically determine the pixel-wise perceptual gradient.The luminance and chrominance channels are both took into account to characterize the quality degradation caused by intensity and color distortions.Finally,a multi-scale strategy is utilized and pooled with different weights to incorporate image details at different resolutions.Experimental results on LIVE,CSIQ and TID2013 databases demonstrate the superior performances of the proposed algorithm.
基金The work is supported by the National Key Research Development Program of China(2016QY01W0200)the National Natural Science Foundation of China NSFC(U1636101,U1736211,U1636219).
文摘Image classifiers that based on Deep Neural Networks(DNNs)have been proved to be easily fooled by well-designed perturbations.Previous defense methods have the limitations of requiring expensive computation or reducing the accuracy of the image classifiers.In this paper,we propose a novel defense method which based on perceptual hash.Our main goal is to destroy the process of perturbations generation by comparing the similarities of images thus achieve the purpose of defense.To verify our idea,we defended against two main attack methods(a white-box attack and a black-box attack)in different DNN-based image classifiers and show that,after using our defense method,the attack-success-rate for all DNN-based image classifiers decreases significantly.More specifically,for the white-box attack,the attack-success-rate is reduced by an average of 36.3%.For the black-box attack,the average attack-success-rate of targeted attack and non-targeted attack has been reduced by 72.8%and 76.7%respectively.The proposed method is a simple and effective defense method and provides a new way to defend against adversarial samples.
基金Supported by National Natural Science Foundation of China(61473176,61105077,61402260,61074149) the Excellent Young and Middle-Aged Scientist Award Grant of Shandong Province of China(BS2012DX026,BS2013DX043) the Open Program from theState Key Laboratory of Management and Control for Complex Systems(20140102)
文摘On the basis of psychological acoustic theories and experiments, this paper proposes an acoustic model which is based on acoustic perceptual feature. Compared with the physiological acoustics based acoustic model, this model is more suitable to represent human’s perceptual features of continuous speech, so it is suitable for recognition of continuous speech.
文摘Campus is a direct carrier for higher education and culture communication. Good landscape sculpture can produce great cohesion and impetus, and it can play an important role in campus landscape construction and cultural communication. Based on the premise of constructing a high-level experience campus culture, this paper analyzes and studies the landscape sculptures of three polytechnic universities in Chengdu from the perspective of perceptual engineering, and analyzes the historical inheritance and school-running characteristics of the three universities, to compare the campus spirit conveyed by the sculptures and the perceptual cognition felt by the teachers and students. The differences between the existing campus landscape sculptures and the campus culture construction goal are obtained, and the corresponding campus landscape sculpture promotion strategy is proposed.
基金Supported by the National Natural Science Foundation of China(61272286)the Specialized Research Fund for the Doctoral Program of Higher Education of China(20126101110006)
文摘Based on perceptual control theory,a task analysis approach is proposed to describe more accurately user tasks in dynamic environments,which is of more powerful and flexible descriptive ability. Theoretically,a task meta model is established to describe the interactive process in an individual,dynamic,and flexible way.Methodologically,an implementation framework is illustrated to map the user-oriented description into implementation-oriented models,which will be as a technical tool to transform from a task model to a user interface prototype.
文摘Virtual reality(VR) environment can provide immersive experience to viewers.Under the VR environment, providing a good quality of experience is extremely important.Therefore, in this paper, we present an image quality assessment(IQA) study on omnidirectional images. We first build an omnidirectional IQA(OIQA) database, including 16 source images with their corresponding 320 distorted images. We add four commonly encountered distortions. These distortions are JPEG compression, JPEG2000 compression, Gaussian blur, and Gaussian noise. Then we conduct a subjective quality evaluation study in the VR environment based on the OIQA database. Considering that visual attention is more important in VR environment, head and eye movement data are also tracked and collected during the quality rating experiments. The 16 raw and their corresponding distorted images,subjective quality assessment scores, and the head-orientation data and eye-gaze data together constitute the OIQA database. Based on the OIQA database, we test some state-of-the-art full-reference IQA(FR-IQA) measures on equirectangular format or cubic formatomnidirectional images. The results show that applying FR-IQA metrics on cubic format omnidirectional images could improve their performance. The performance of some FR-IQA metrics combining the saliency weight of three different types are also tested based on our database. Some new phenomena different from traditional IQA are observed.
文摘A watermarking scheme designed for remote sensing images needs to meet the same demand of both invisibility as for ordinary digital images. Due to specific perceptual characteristics of Synthetic Aperture Radar(SAR) images, the watermarking algorithms with consideration of Human Vision System(HVS) modeling from optical images give poor performance when applied on SAR images. This paper examines a variety of factors affecting the noise sensitivity, and further proposes a refined pixel-wise masking approach for watermarking on SAR images. The proposed approach is applied on logarithmic transformed SAR images, and has increased the acceptable watermark embedding strength by about 6 dB to 10 dB while achieving the same levels of watermarked image visual quality. Experimental results show that this approach enhanced the perceptual invisibility of watermarking based on wavelet decomposition.
文摘A mathematical model of perceptual symbol system is developed. This development requires new mathematical methods of dynamic logic (DL), which have overcome limitations of classical artificial intelligence and connectionist approaches. The paper discusses these past limitations, relates them to combinatorial complexity (exponential explosion) of algorithms in the past, and relates it further to the static nature of classical logic. DL is a process-logic;its salient property is evolution of vague representations into crisp. We first consider one aspect of PSS: situation learning from object perceptions. Next DL is related to PSS mechanisms of concepts, simulators, grounding, embodiment, productiveity, binding, recursion, and to the mechanisms relating embodied-grounded and amodal symbols. We discuss DL capability for modeling cognition on multiple levels of abstraction. PSS is extended toward interaction between cognition and language. Experimental predictions of the theory are discussed. They might influence experimental psychology and impact future theoretical developments in cognitive science, including knowledge representation, and mechanisms of interaction between perception, cognition, and language. All mathematical equations are also discussed conceptually, so mathematical understanding is not required. Experimental evidence for DL and PSS in brain imaging is discussed as well as future research directions.
文摘In this work a new technique for global perceptual codes (GPCs) extraction using genetic algorithms (GA) is presented. GAs are employed to extract the GPCs in order to reduce the original number of features and to provide meaningful representations of the original data. In this technique the GPCs are build from a certain combination of elementary perceptual codes (EPCs) which are provided by the Beta-elliptic model for the generation of complex handwriting movements. Indeed, in this model each script is modelled by a set of elliptic arcs. We associate to each arc an EPC. In the proposed technique we defined four types of EPCs. The GPCs can be formed by many possible combinations of EPCs depending on their number and types. So that, the problem of choosing the right combination for each GPC can be regarded as a global optimization problem which is treated in this work using the GAs. Several simulation examples are presented to evaluate the interest and the efficiency of the proposed technique.
文摘Recent theories on natural and synthetic consciousness overlook the geometric structure necessary for awareness of 3-dimensional space, as strikingly illustrated by left-neglect disorder. Furthermore, awareness of 3-dimensional space entails some surprisingly tenacious optical illusions, as demonstrated by an experiment in the text. Awareness of linear time is also crucial and complex. As a consequence, synthetic consciousness cannot be realized by simply intercomnecting a large number of electronic circuits constructed from ordinary chips and transistors. Since consciousness is a subjective experience, there is no sufficient condition for consciousness that can be experimentally confirmed. The most we can hope for is agreement on the necessary conditions for consciousness. Toward that end, this paper reviews some relevant clinical phenomena.
文摘As mankind was one kind of aesthetic species,the visual study of Marxist philosophy has profound enlightenment in understanding the New Year paintings' visual meaning,form,development,cultivation,and features etc.In essence,Chinese folk New Year paintings is a visual expression,its nature of being pasted at home makes it a visual encirclement for the Chinese people,this vision encirclement in the home space gradually expands,constitutes a spiritual home with different cultural space,living environment,image schema,and secular and religious faith scope,in this visual encirclement,people's home life achieves the satisfaction,peace and self–comfort in diversity.The"childrenization"of New Year paintings reveals a close relationship between the immature of folk art and the original art as human childhood,and from the both aspects of form and content reflects the visual encirclement is actually a kind of siege with cultural significance,this siege has multiple levels in value ideas,historical tradition,ethical order and life knowledge,aesthetic taste and so on,makes us have a new aesthetic anthropology on the New Year paintings.
文摘The two mast cameras, Mastcams, onboard Mars rover Curiosity are multispectral imagers with nine bands in each. Currently, the images are compressed losslessly using JPEG, which can achieve only two to three times of compression. We present a comparative study of four approaches to compressing multispectral Mastcam images. The first approach is to divide the nine bands into three groups with each group having three bands. Since the multispectral bands have strong correlation, we treat the three groups of images as video frames. We call this approach the Video approach. The second approach is to compress each group separately and we call it the split band (SB) approach. The third one is to apply a two-step approach in which the first step uses principal component analysis (PCA) to compress a nine-band image cube to six bands and a second step compresses the six PCA bands using conventional codecs. The fourth one is to apply PCA only. In addition, we also present subjective and objective assessment results for compressing RGB images because RGB images have been used for stereo and disparity map generation. Five well-known compression codecs, including JPEG, JPEG-2000 (J2K), X264, X265, and Daala in the literature, have been applied and compared in each approach. The performance of different algorithms was assessed using four well-known performance metrics. Two are conventional and another two are known to have good correlation with human perception. Extensive experiments using actual Mastcam images have been performed to demonstrate the various approaches. We observed that perceptually lossless compression can be achieved at 10:1 compression ratio. In particular, the performance gain of the SB approach with Daala is at least 5 dBs in terms peak signal-to-noise ratio (PSNR) at 10:1 compression ratio over that of JPEG. Subjective comparisons also corroborated with the objective metrics in that perceptually lossless compression can be achieved even at 20 to 1 compression.
基金European Network of Excellence for cryptology, the National Natural Science Foundation of China(60671064)the Foundation for the Author of National Excellent Doctoral Dissertation of China (FANEDD-200238)+1 种基金the Foundation for the ExcellentYouth of Heilongjiang Provincethe Program for New Century Excellent Talents in University (NCET-04-0330)
文摘Numerous perceptual hashing algorithms have been developed for identification and verification of multimedia objects in recent years. Many application schemes have been adopted for various commercial objects. Developers and users are looking for a benchmark tool to compare and evaluate their current algorithms or technologies. In this paper, a novel benchmark platform is presented. PHABS provides an open framework and lets its users define their own test strategy, perform tests, collect and analyze test data. With PHABS, various performance parameters of algorithms can be tested, and different algorithms or algorithms with different parameters can be evaluated and compared easily.