Inspired by human behaviors, a robot object tracking model is proposed on the basis of visual attention mechanism, which is fit for the theory of topological perception. The model integrates the image-driven, bottom-u...Inspired by human behaviors, a robot object tracking model is proposed on the basis of visual attention mechanism, which is fit for the theory of topological perception. The model integrates the image-driven, bottom-up attention and the object-driven, top-down attention, whereas the previous attention model has mostly focused on either the bottom-up or top-down attention. By the bottom-up component, the whole scene is segmented into the ground region and the salient regions. Guided by top-down strategy which is achieved by a topological graph, the object regions are separated from the salient regions. The salient regions except the object regions are the barrier regions. In order to estimate the model, a mobile robot platform is developed, on which some experiments are implemented. The experimental results indicate that processing an image with a resolution of 752 × 480 pixels takes less than 200 ms and the object regions are unabridged. The analysis obtained by comparing the proposed model with the existing model demonstrates that the proposed model has some advantages in robot object tracking in terms of speed and efficiency.展开更多
In this study, we explored the neural mechanism of global topological perception in the human visual system. We showed strong evidence that the retinotectal pathway in the archicortex of the human brain is responsible...In this study, we explored the neural mechanism of global topological perception in the human visual system. We showed strong evidence that the retinotectal pathway in the archicortex of the human brain is responsible for global topological perception, and for modulating the local feature processing in the classical ventral visual pathway. Inspired by this recent cognitive discovery,we developed a novel CogNet architecture to emulate the global-local dichotomy of human visual cognitive mechanisms. The thorough experimental results indicate that the proposed CogNet not only significantly improves image classification accuracies but also effectively addresses the texture bias problem observed in baseline CNN models. We have also conducted mathematical analysis for the generalization gap for general neural networks. Our theoretical derivations suggest that the Hurst parameter, a measure of the curvature of the loss landscape, can closely bind the generalization gap. A larger Hurst parameter corresponds to a better generalization ability. We found that our proposed CogNet achieves a lower test error and attains a larger Hurst parameter,strengthening its superiority over the baseline CNN models further.展开更多
基金supported by National Basic Research Program of China (973 Program) (No. 2006CB300407)National Natural Science Foundation of China (No. 50775017)
文摘Inspired by human behaviors, a robot object tracking model is proposed on the basis of visual attention mechanism, which is fit for the theory of topological perception. The model integrates the image-driven, bottom-up attention and the object-driven, top-down attention, whereas the previous attention model has mostly focused on either the bottom-up or top-down attention. By the bottom-up component, the whole scene is segmented into the ground region and the salient regions. Guided by top-down strategy which is achieved by a topological graph, the object regions are separated from the salient regions. The salient regions except the object regions are the barrier regions. In order to estimate the model, a mobile robot platform is developed, on which some experiments are implemented. The experimental results indicate that processing an image with a resolution of 752 × 480 pixels takes less than 200 ms and the object regions are unabridged. The analysis obtained by comparing the proposed model with the existing model demonstrates that the proposed model has some advantages in robot object tracking in terms of speed and efficiency.
基金supported by the National Key Research and Development Project of China (Grant No. 2020AAA0105600)the National Natural Science Foundation of China (Grant Nos. U21B2048 and 62276208)+1 种基金Shenzhen Key Technical Projects (Grant No. CJGJZD2022051714160501)the Chinese Academy of Sciences (Grant Nos. 2021091 and YSBR-068)。
文摘In this study, we explored the neural mechanism of global topological perception in the human visual system. We showed strong evidence that the retinotectal pathway in the archicortex of the human brain is responsible for global topological perception, and for modulating the local feature processing in the classical ventral visual pathway. Inspired by this recent cognitive discovery,we developed a novel CogNet architecture to emulate the global-local dichotomy of human visual cognitive mechanisms. The thorough experimental results indicate that the proposed CogNet not only significantly improves image classification accuracies but also effectively addresses the texture bias problem observed in baseline CNN models. We have also conducted mathematical analysis for the generalization gap for general neural networks. Our theoretical derivations suggest that the Hurst parameter, a measure of the curvature of the loss landscape, can closely bind the generalization gap. A larger Hurst parameter corresponds to a better generalization ability. We found that our proposed CogNet achieves a lower test error and attains a larger Hurst parameter,strengthening its superiority over the baseline CNN models further.