Information flow between the prefrontal and visual cortices is critical for visual behaviors such as visual search. To investigate its mechanisms, we simultaneously recorded spike and local field potential (LFP) signa...Information flow between the prefrontal and visual cortices is critical for visual behaviors such as visual search. To investigate its mechanisms, we simultaneously recorded spike and local field potential (LFP) signals in the frontal eye field (FEF) and area V4 while monkeys performed a free-gaze visual search task. During free-gaze search, spike-LFP coherence between FEF and V4 was enhanced in the theta rhythm (4–8 Hz) but suppressed in the alpha rhythm (8–13 Hz). Cross-frequency couplings during the Cue period before the search phase were related to monkey performance, with higher FEF theta-V4 gamma coupling and lower FEF alpha-V4 gamma coupling associated with faster search. Finally, feature-based attention during search enhanced spike-LFP coherence between FEF and V4 in the gamma and beta rhythms, whereas overt spatial attention reduced coherence at frequencies up to 30 Hz. These results suggest that oscillatory coupling may play an important role in mediating interactions between the prefrontal and visual cortices during visual search.展开更多
Visual search has been a long-standing problem in applications such as location recognition and product search. Much research has been done on image representation, matching, indexing, and retrieval. Key component tec...Visual search has been a long-standing problem in applications such as location recognition and product search. Much research has been done on image representation, matching, indexing, and retrieval. Key component technologies for visual search have been developed, and numerous real-world applications are emerging. To ensure application interoperability, the Moving Picture Experts Group (MPEG) has begun standardizing visuaJ search technologies and is developing the compact descriptors for visua) search (CDVS) standard. MPEG seeks to develop a collaborative platform for evaluating existing visual search technologies. Peking University has participated in this standardization since the 94th MPEG meeting, and significant progress has been made with the various proposals. A test model (TM) has been selected to determine the basic pipeline and key components of visual search. However, the first-version TM has high computational complexity and imperfect retrieval and matching. Core experiments have therefore been set up to improve TM. In this article, we summarize key technologies for visual search and report the progress of MPEG CDVS. We discuss Peking University' s efforts in CDVS and also discuss unresolved issues.展开更多
This paper introduces an approach for visual tracking of multi-target with occlusion occurrence. Based on the author's previous work in which the Overlap Coefficient (OC) is used to detect the occlusion, in this p...This paper introduces an approach for visual tracking of multi-target with occlusion occurrence. Based on the author's previous work in which the Overlap Coefficient (OC) is used to detect the occlusion, in this paper a method of combining Bhattacharyya Coefficient (BC) and Kalman filter innovation term is proposed as the criteria for jointly detecting the occlusion occurrence. Fragmentation of target is introduced in order to closely monitor the occlusion development. In the course of occlusion, the Kalman predictor is applied to determine the location of the occluded target, and the criterion for checking the re-appearance of the occluded target is also presented. The proposed approach is put to test on a standard video sequence, suggesting the satisfactory performance in multi-target tracking.展开更多
In this paper,two methods are proposed to embed visual watermark into direct binary search(DBS)halftone images,which are called Adjusted Direct Binary Search(ADBS)and Dual Adjusted Direct Binary Search(DADBS).DADBS is...In this paper,two methods are proposed to embed visual watermark into direct binary search(DBS)halftone images,which are called Adjusted Direct Binary Search(ADBS)and Dual Adjusted Direct Binary Search(DADBS).DADBS is an improved version of ADBS.By using the proposed methods,the visual watermark will be embedded into two halftone images separately,thus,the watermark can be revealed when these two halftone images are overlaid.Experimental results show that both methods can achieve excellent image visual quality and decoded visual patterns.展开更多
Background:Age related macular degeneration(AMD)is one of the main causes of vision loss in older adults,generating,in most cases,a central scotoma that reduces central visual acuity(Noble&Chaudhary,2010).People a...Background:Age related macular degeneration(AMD)is one of the main causes of vision loss in older adults,generating,in most cases,a central scotoma that reduces central visual acuity(Noble&Chaudhary,2010).People affected by AMD have to rely on peripheral visual information and would highly benefit from efficiently allocating their attention to the periphery.Indeed,attention can improve peripheral spatial resolution(Carrasco,Ling&Read,2004)and can be allocated to a certain expanse of space outside of the central visual span,known as the attentional span.Attentional span has been shown to be decreased in people with AMD with less attention allocated to the periphery and more to the central visual field(Cheong et al.,2008),however it remains unknown whether aging is also a contributing factor.Methods:Fourteen healthy younger(mean age=21.8 years,SD=1.5)and 8 older adults(mean age=69.6 years,SD=7.3)performed a pop-out and a serial version of a visual search task,in the presence of different sized gaze-contingent invisible and visible artificial central scotomata(no scotoma,3°diameter,5°and 7°).Participants were asked to indicate as quickly as possible whether a target was present or not among distractors whose number varied(16,32 or 64 objects).We wished to determine whether the size of the scotoma,occluding different degrees of central vision,affected visual search differently for younger vs.older participants.Results:Both the younger and older participants showed higher reaction times(RTs)to find the target for the serial version(M=2,074 ms for younger adults,M=3,853 ms for older adults)compared to the pop-out version(M=866 ms,M=1,475 ms,P<0.001)and for more distractors(32 distractors compared to 16,and 64 compared to 32,P<0.01).Older adults showed longer RTs than younger adults for both versions of the task(P<0.01).We found a significant effect of scotoma size on older adults(3°scotoma M=3,276 ms;7°scotoma M=3,877 ms,P<0.05),however,accurate performance was higher with no scotoma(96%vs.92%,P<0.05)in the pop-out search task.This suggests that older participants privileged a fast decision at the expense of performance in those cases.For the younger adults,RTs were higher in the serial search task in the presence of a scotoma(M=2,074 ms)compared to the control condition(M=1,665 ms,P>0.05).Conclusions:These results suggest that older adults take longer to perform visual search compared to younger adults and tend to use peripheral visual less than younger adults;larger central scotomas disrupted their performance but not that of younger participants,who performed equally well with different central scotoma sizes.These findings suggest that aging is a contributing factor in the decrease of the peripheral attentional span.展开更多
Background:It has been suggested that older adults show a reduced attentional field compared to younger adults.This may be attributed to a poorer utilization of peripheral vision(i.e.,peripheral attentional allocation...Background:It has been suggested that older adults show a reduced attentional field compared to younger adults.This may be attributed to a poorer utilization of peripheral vision(i.e.,peripheral attentional allocation)and a higher reliance on central vision compared to younger adults.To test this,we examined the importance of central,peri-foveal and near periphery information in younger and older adults by comparing their visual search performance while their central vision was blocked,in the presence of different sized artificial central scotomas.We tested participants in two versions of visual search,pop-out and serial search,because they require a different use of central and peripheral attention.Pop-out search relies on processing of the entire visual scene(i.e.,global processing)whereas serial search requires processing of each feature serially(i.e.,local processing).Methods:Thirteen healthy younger(M=21.8,SD=1.5)and 15 older adults(M=69.1 years,SD=7.3)performed a pop-out and a serial version of a visual search task in the presence of different sized gaze-contingent artificial central scotomas(no scotoma,3°diameter,5°and 7°).Participants were asked to indicate as quickly as possible whether a target was present or not among distractors whose number varied(16,32 or 64 objects).Results:We found evidence for a greater decline in peripheral processing in older adults compared to younger in pop-out but not in serial search.For the pop-out condition with no scotoma,we found that the further the target in the periphery,the longer the search time,and that this increase was proportionally greater for older adults compared to younger adults.Further,increases in scotoma size were associated with a greater increase in reaction times for older adults compared to younger participants.For the serial condition,both groups showed similar increases in reaction times with target distance from center and scotoma size.We surmise that this may be due to task difficulty in serial search;central vision is necessary for both groups.Conclusions:In conclusion,these findings suggest that,in global processing,older adults distribute more resources towards central vision compared to younger adults.展开更多
In order to discuss the efficiency of visual search of the taekwondo athletes with different kinds of trait anxieties, this article has selected 30 taekwondo athletes with high trait anxieties and another 30 ones with...In order to discuss the efficiency of visual search of the taekwondo athletes with different kinds of trait anxieties, this article has selected 30 taekwondo athletes with high trait anxieties and another 30 ones with low trait anxieties as the testees so as to conduct respective investigations on their visual search reaction time and accuracy of reaction. The results show that the reaction time of individuals with high trait anxieties is significantly longer than that of the individuals with low trait anxieties; the reaction time under threatening stimuli is significantly longer than that under no conditions of threatening stimuli; the reaction accuracy rate of visual search reaction of taekwondo athletes under threatening stimuli is significantly lower than that under no threatening stimuli.展开更多
When using traditional image search engines, smartphone users often complain about their poor user interface including poor user experience, and weak interaction. Moreover, users are unable to find a desired picture p...When using traditional image search engines, smartphone users often complain about their poor user interface including poor user experience, and weak interaction. Moreover, users are unable to find a desired picture partly due to the unclear key words. This paper proposes the word-bag co-occurrence scheme by defining the correlation between images. Through exploratory search, the search range can be expanded and help users refine retrieval of the expected images. Firstly, the proposed scheme applied the bag of visual words (BoVW) vector by processing images on Hadoop. Secondly, similarity matrix was constructed to organize the image data. Finally, the images in which users were interested was visually displayed on the android mobile phone via exploratory search. Comparing the proposed method to current methods by testing with image data sets on ImageNet, the experimental results show that the former is superior to the latter on visual representation, and the proposed scheme can provide a better user experience.展开更多
Visual impairment is one of the major problems among people of all age groups across the globe.Visually Impaired Persons(VIPs)require help from others to carry out their day-to-day tasks.Since they experience several ...Visual impairment is one of the major problems among people of all age groups across the globe.Visually Impaired Persons(VIPs)require help from others to carry out their day-to-day tasks.Since they experience several problems in their daily lives,technical intervention can help them resolve the challenges.In this background,an automatic object detection tool is the need of the hour to empower VIPs with safe navigation.The recent advances in the Internet of Things(IoT)and Deep Learning(DL)techniques make it possible.The current study proposes IoT-assisted Transient Search Optimization with a Lightweight RetinaNetbased object detection(TSOLWR-ODVIP)model to help VIPs.The primary aim of the presented TSOLWR-ODVIP technique is to identify different objects surrounding VIPs and to convey the information via audio message to them.For data acquisition,IoT devices are used in this study.Then,the Lightweight RetinaNet(LWR)model is applied to detect objects accurately.Next,the TSO algorithm is employed for fine-tuning the hyperparameters involved in the LWR model.Finally,the Long Short-Term Memory(LSTM)model is exploited for classifying objects.The performance of the proposed TSOLWR-ODVIP technique was evaluated using a set of objects,and the results were examined under distinct aspects.The comparison study outcomes confirmed that the TSOLWR-ODVIP model could effectually detect and classify the objects,enhancing the quality of life of VIPs.展开更多
Mobile location-based services(MLBS)refer to services around geographic location data.Mobile terminals use wireless communication networks(or satellite positioning systems)to obtain users’geographic location coordina...Mobile location-based services(MLBS)refer to services around geographic location data.Mobile terminals use wireless communication networks(or satellite positioning systems)to obtain users’geographic location coordinate information based on spatial databases and integrate with other information to provide users with required location-related services.The development of systems based on MLBS has significance and practical value.In this paper a visualization management information system for personnel in major events based on microservices,namely MEPMIS,is designed and implemented by using MLBS.The system consists of a server and a client app,and it has some functions including map search and query,personnel positioning and scheduling,location management,messaging,and location service.Managers of the events can quickly search and locate the staff on the specific area of the map in real-time,and make broadcasting messages to the staff,and manage the staff.The client app is developed on the Android system,by which staff users can send the positions information to the server timely.The client users can search fuzzily near their peers and list their locations,and also call near peers through sending messages or query the history record of staff locations.In the design of the system,several new proposed techniques,including visual annotation method for overlapping locations,correcting trajectory drift algorithm,microservices-based overall system architecture methodology and other new techniques,which are applied to the implementation of the system.Also,HTML5,JQuery,MLBS APIs(Application Program Interfaces)related programming techniques have been used and combined with loading Ajax asynchronously and Json data encapsulation,map marker optimization techniques,that can improve the positioning accuracy and the performance of the system.The developed system with practical functions can enhance the efficiencies of the organization and management of major events.展开更多
基金supported by the National Key R&D Program of China(2017YFC1307500)the National Natural Science Foundation of China(31800900)+2 种基金the CAS-Iranian Vice presidency for Science and Technology Joint Research Project(172644KYSB20160175)Guangdong Innovative and Entrepreneurial Research Team Program(2014ZT05S020)Shenzhen Municipal Grants(KQJSCX20170731164702657,JCYJ20151030140325151,JCYJ20170413165053031,GJHZ20160229200136090,KQTD20140630180249366)
文摘Information flow between the prefrontal and visual cortices is critical for visual behaviors such as visual search. To investigate its mechanisms, we simultaneously recorded spike and local field potential (LFP) signals in the frontal eye field (FEF) and area V4 while monkeys performed a free-gaze visual search task. During free-gaze search, spike-LFP coherence between FEF and V4 was enhanced in the theta rhythm (4–8 Hz) but suppressed in the alpha rhythm (8–13 Hz). Cross-frequency couplings during the Cue period before the search phase were related to monkey performance, with higher FEF theta-V4 gamma coupling and lower FEF alpha-V4 gamma coupling associated with faster search. Finally, feature-based attention during search enhanced spike-LFP coherence between FEF and V4 in the gamma and beta rhythms, whereas overt spatial attention reduced coherence at frequencies up to 30 Hz. These results suggest that oscillatory coupling may play an important role in mediating interactions between the prefrontal and visual cortices during visual search.
基金supported by National Basic Research "(973") Program of China(2009CB320902)the Chinese National Nature Science Foundation (60902057)
文摘Visual search has been a long-standing problem in applications such as location recognition and product search. Much research has been done on image representation, matching, indexing, and retrieval. Key component technologies for visual search have been developed, and numerous real-world applications are emerging. To ensure application interoperability, the Moving Picture Experts Group (MPEG) has begun standardizing visuaJ search technologies and is developing the compact descriptors for visua) search (CDVS) standard. MPEG seeks to develop a collaborative platform for evaluating existing visual search technologies. Peking University has participated in this standardization since the 94th MPEG meeting, and significant progress has been made with the various proposals. A test model (TM) has been selected to determine the basic pipeline and key components of visual search. However, the first-version TM has high computational complexity and imperfect retrieval and matching. Core experiments have therefore been set up to improve TM. In this article, we summarize key technologies for visual search and report the progress of MPEG CDVS. We discuss Peking University' s efforts in CDVS and also discuss unresolved issues.
基金Supported by the Program for Technology Innovation Team of Ningbo Government (No. 2011B81002)the Ningbo University Science Research Foundation (No.xkl11075)
文摘This paper introduces an approach for visual tracking of multi-target with occlusion occurrence. Based on the author's previous work in which the Overlap Coefficient (OC) is used to detect the occlusion, in this paper a method of combining Bhattacharyya Coefficient (BC) and Kalman filter innovation term is proposed as the criteria for jointly detecting the occlusion occurrence. Fragmentation of target is introduced in order to closely monitor the occlusion development. In the course of occlusion, the Kalman predictor is applied to determine the location of the occluded target, and the criterion for checking the re-appearance of the occluded target is also presented. The proposed approach is put to test on a standard video sequence, suggesting the satisfactory performance in multi-target tracking.
文摘In this paper,two methods are proposed to embed visual watermark into direct binary search(DBS)halftone images,which are called Adjusted Direct Binary Search(ADBS)and Dual Adjusted Direct Binary Search(DADBS).DADBS is an improved version of ADBS.By using the proposed methods,the visual watermark will be embedded into two halftone images separately,thus,the watermark can be revealed when these two halftone images are overlaid.Experimental results show that both methods can achieve excellent image visual quality and decoded visual patterns.
文摘Background:Age related macular degeneration(AMD)is one of the main causes of vision loss in older adults,generating,in most cases,a central scotoma that reduces central visual acuity(Noble&Chaudhary,2010).People affected by AMD have to rely on peripheral visual information and would highly benefit from efficiently allocating their attention to the periphery.Indeed,attention can improve peripheral spatial resolution(Carrasco,Ling&Read,2004)and can be allocated to a certain expanse of space outside of the central visual span,known as the attentional span.Attentional span has been shown to be decreased in people with AMD with less attention allocated to the periphery and more to the central visual field(Cheong et al.,2008),however it remains unknown whether aging is also a contributing factor.Methods:Fourteen healthy younger(mean age=21.8 years,SD=1.5)and 8 older adults(mean age=69.6 years,SD=7.3)performed a pop-out and a serial version of a visual search task,in the presence of different sized gaze-contingent invisible and visible artificial central scotomata(no scotoma,3°diameter,5°and 7°).Participants were asked to indicate as quickly as possible whether a target was present or not among distractors whose number varied(16,32 or 64 objects).We wished to determine whether the size of the scotoma,occluding different degrees of central vision,affected visual search differently for younger vs.older participants.Results:Both the younger and older participants showed higher reaction times(RTs)to find the target for the serial version(M=2,074 ms for younger adults,M=3,853 ms for older adults)compared to the pop-out version(M=866 ms,M=1,475 ms,P<0.001)and for more distractors(32 distractors compared to 16,and 64 compared to 32,P<0.01).Older adults showed longer RTs than younger adults for both versions of the task(P<0.01).We found a significant effect of scotoma size on older adults(3°scotoma M=3,276 ms;7°scotoma M=3,877 ms,P<0.05),however,accurate performance was higher with no scotoma(96%vs.92%,P<0.05)in the pop-out search task.This suggests that older participants privileged a fast decision at the expense of performance in those cases.For the younger adults,RTs were higher in the serial search task in the presence of a scotoma(M=2,074 ms)compared to the control condition(M=1,665 ms,P>0.05).Conclusions:These results suggest that older adults take longer to perform visual search compared to younger adults and tend to use peripheral visual less than younger adults;larger central scotomas disrupted their performance but not that of younger participants,who performed equally well with different central scotoma sizes.These findings suggest that aging is a contributing factor in the decrease of the peripheral attentional span.
文摘Background:It has been suggested that older adults show a reduced attentional field compared to younger adults.This may be attributed to a poorer utilization of peripheral vision(i.e.,peripheral attentional allocation)and a higher reliance on central vision compared to younger adults.To test this,we examined the importance of central,peri-foveal and near periphery information in younger and older adults by comparing their visual search performance while their central vision was blocked,in the presence of different sized artificial central scotomas.We tested participants in two versions of visual search,pop-out and serial search,because they require a different use of central and peripheral attention.Pop-out search relies on processing of the entire visual scene(i.e.,global processing)whereas serial search requires processing of each feature serially(i.e.,local processing).Methods:Thirteen healthy younger(M=21.8,SD=1.5)and 15 older adults(M=69.1 years,SD=7.3)performed a pop-out and a serial version of a visual search task in the presence of different sized gaze-contingent artificial central scotomas(no scotoma,3°diameter,5°and 7°).Participants were asked to indicate as quickly as possible whether a target was present or not among distractors whose number varied(16,32 or 64 objects).Results:We found evidence for a greater decline in peripheral processing in older adults compared to younger in pop-out but not in serial search.For the pop-out condition with no scotoma,we found that the further the target in the periphery,the longer the search time,and that this increase was proportionally greater for older adults compared to younger adults.Further,increases in scotoma size were associated with a greater increase in reaction times for older adults compared to younger participants.For the serial condition,both groups showed similar increases in reaction times with target distance from center and scotoma size.We surmise that this may be due to task difficulty in serial search;central vision is necessary for both groups.Conclusions:In conclusion,these findings suggest that,in global processing,older adults distribute more resources towards central vision compared to younger adults.
文摘In order to discuss the efficiency of visual search of the taekwondo athletes with different kinds of trait anxieties, this article has selected 30 taekwondo athletes with high trait anxieties and another 30 ones with low trait anxieties as the testees so as to conduct respective investigations on their visual search reaction time and accuracy of reaction. The results show that the reaction time of individuals with high trait anxieties is significantly longer than that of the individuals with low trait anxieties; the reaction time under threatening stimuli is significantly longer than that under no conditions of threatening stimuli; the reaction accuracy rate of visual search reaction of taekwondo athletes under threatening stimuli is significantly lower than that under no threatening stimuli.
文摘When using traditional image search engines, smartphone users often complain about their poor user interface including poor user experience, and weak interaction. Moreover, users are unable to find a desired picture partly due to the unclear key words. This paper proposes the word-bag co-occurrence scheme by defining the correlation between images. Through exploratory search, the search range can be expanded and help users refine retrieval of the expected images. Firstly, the proposed scheme applied the bag of visual words (BoVW) vector by processing images on Hadoop. Secondly, similarity matrix was constructed to organize the image data. Finally, the images in which users were interested was visually displayed on the android mobile phone via exploratory search. Comparing the proposed method to current methods by testing with image data sets on ImageNet, the experimental results show that the former is superior to the latter on visual representation, and the proposed scheme can provide a better user experience.
基金The authors extend their appreciation to the King Salman center for Disability Research for funding this work through Research Group no KSRG-2022-030。
文摘Visual impairment is one of the major problems among people of all age groups across the globe.Visually Impaired Persons(VIPs)require help from others to carry out their day-to-day tasks.Since they experience several problems in their daily lives,technical intervention can help them resolve the challenges.In this background,an automatic object detection tool is the need of the hour to empower VIPs with safe navigation.The recent advances in the Internet of Things(IoT)and Deep Learning(DL)techniques make it possible.The current study proposes IoT-assisted Transient Search Optimization with a Lightweight RetinaNetbased object detection(TSOLWR-ODVIP)model to help VIPs.The primary aim of the presented TSOLWR-ODVIP technique is to identify different objects surrounding VIPs and to convey the information via audio message to them.For data acquisition,IoT devices are used in this study.Then,the Lightweight RetinaNet(LWR)model is applied to detect objects accurately.Next,the TSO algorithm is employed for fine-tuning the hyperparameters involved in the LWR model.Finally,the Long Short-Term Memory(LSTM)model is exploited for classifying objects.The performance of the proposed TSOLWR-ODVIP technique was evaluated using a set of objects,and the results were examined under distinct aspects.The comparison study outcomes confirmed that the TSOLWR-ODVIP model could effectually detect and classify the objects,enhancing the quality of life of VIPs.
基金The work is supported by the Tianjin Planning Project of Philosophy and Social Science under Grant No.TJGL20-018 for Dr.L.J.Hou of Tianjin Normal University,China。
文摘Mobile location-based services(MLBS)refer to services around geographic location data.Mobile terminals use wireless communication networks(or satellite positioning systems)to obtain users’geographic location coordinate information based on spatial databases and integrate with other information to provide users with required location-related services.The development of systems based on MLBS has significance and practical value.In this paper a visualization management information system for personnel in major events based on microservices,namely MEPMIS,is designed and implemented by using MLBS.The system consists of a server and a client app,and it has some functions including map search and query,personnel positioning and scheduling,location management,messaging,and location service.Managers of the events can quickly search and locate the staff on the specific area of the map in real-time,and make broadcasting messages to the staff,and manage the staff.The client app is developed on the Android system,by which staff users can send the positions information to the server timely.The client users can search fuzzily near their peers and list their locations,and also call near peers through sending messages or query the history record of staff locations.In the design of the system,several new proposed techniques,including visual annotation method for overlapping locations,correcting trajectory drift algorithm,microservices-based overall system architecture methodology and other new techniques,which are applied to the implementation of the system.Also,HTML5,JQuery,MLBS APIs(Application Program Interfaces)related programming techniques have been used and combined with loading Ajax asynchronously and Json data encapsulation,map marker optimization techniques,that can improve the positioning accuracy and the performance of the system.The developed system with practical functions can enhance the efficiencies of the organization and management of major events.