Binaural rendering is of great interest to virtual reality and immersive media. Although humans can naturally use their two ears to perceive the spatial information contained in sounds, it is a challenging task for ma...Binaural rendering is of great interest to virtual reality and immersive media. Although humans can naturally use their two ears to perceive the spatial information contained in sounds, it is a challenging task for machines to achieve binaural rendering since the description of a sound field often requires multiple channels and even the metadata of the sound sources. In addition, the perceived sound varies from person to person even in the same sound field. Previous methods generally rely on individual-dependent head-related transferred function(HRTF)datasets and optimization algorithms that act on HRTFs. In practical applications, there are two major drawbacks to existing methods. The first is a high personalization cost, as traditional methods achieve personalized needs by measuring HRTFs. The second is insufficient accuracy because the optimization goal of traditional methods is to retain another part of information that is more important in perception at the cost of discarding a part of the information. Therefore, it is desirable to develop novel techniques to achieve personalization and accuracy at a low cost. To this end, we focus on the binaural rendering of ambisonic and propose 1) channel-shared encoder and channel-compared attention integrated into neural networks and 2) a loss function quantifying interaural level differences to deal with spatial information. To verify the proposed method, we collect and release the first paired ambisonic-binaural dataset and introduce three metrics to evaluate the content information and spatial information accuracy of the end-to-end methods. Extensive experimental results on the collected dataset demonstrate the superior performance of the proposed method and the shortcomings of previous methods.展开更多
Three-dimensional surfaces are typically modeled as implicit surfaces.However,direct rendering of implicit surfaces is not simple,especially when such surfaces contain finely detailed shapes.One approach is ray-castin...Three-dimensional surfaces are typically modeled as implicit surfaces.However,direct rendering of implicit surfaces is not simple,especially when such surfaces contain finely detailed shapes.One approach is ray-casting,where the field of the implicit surface is assumed to be piecewise polynomials defined on the grid of a rectangular domain.A critical issue for direct rendering based on ray-casting is the computational cost of finding intersections between surfaces and rays.In particular,ray-casting requires many function evaluations along each ray,severely slowing the rendering speed.In this paper,a method is proposed to achieve direct rendering of polynomial-based implicit surfaces in real-time by strategically narrowing the search range and designing the shader to exploit the structure of piecewise polynomials.In experiments,the proposed method achieved a high framerate performance for different test cases,with a speed-up factor ranging from 1.1 to 218.2.In addition,the proposed method demonstrated better efficiency with high cell resolution.In terms of memory consumption,the proposed method saved between 90.94%and 99.64%in different test cases.Generally,the proposed method became more memory-efficient as the cell resolution increased.展开更多
With the development of virtual reality (VR) technology, more and more industries are beginning to integrate with VR technology. In response to the problem of not being able to directly render the lighting effect of C...With the development of virtual reality (VR) technology, more and more industries are beginning to integrate with VR technology. In response to the problem of not being able to directly render the lighting effect of Caideng in digital Caideng scenes, this article analyzes the lighting model. It combines it with the lighting effect of Caideng scenes to design an optimized lighting model algorithm that fuses the bidirectional transmission distribution function (BTDF) model. This algorithm can efficiently render the lighting effect of Caideng models in a virtual environment. And using image optimization processing methods, the immersive experience effect on the VR is enhanced. Finally, a Caideng roaming interactive system was designed based on this method. The results show that the frame rate of the system is stable during operation, maintained above 60 fps, and has a good immersive experience.展开更多
Ray casting algorithm can obtain a better quality image in volume rendering, however, it exists some problems, such as powerful computing capacity and slow rendering speed. How to improve the re-sampled speed is a key...Ray casting algorithm can obtain a better quality image in volume rendering, however, it exists some problems, such as powerful computing capacity and slow rendering speed. How to improve the re-sampled speed is a key to speed up the ray casting algorithm. An algorithm is introduced to reduce matrix computation by matrix transformation characteristics of re-sampling points in a two coordinate system. The projection of 3-D datasets on image plane is adopted to reduce the number of rays. Utilizing boundary box technique avoids the sampling in empty voxel. By extending the Bresenham algorithm to three dimensions, each re-sampling point is calculated. Experimental results show that a two to three-fold improvement in rendering speed using the optimized algorithm, and the similar image quality to traditional algorithm can be achieved. The optimized algorithm can produce the required quality images, thus reducing the total operations and speeding up the volume rendering.展开更多
总结VR立体视觉系统硬件建设的创新技术,提出了人工智能(Artificial Intelligence,AI)生成基于模型系统工程(Model-Based Systems Engineering,MBSE)数字模型VR视觉的创新方法。通过优选大屏幕式、小间距LED显示屏、主动立体显示硬件,...总结VR立体视觉系统硬件建设的创新技术,提出了人工智能(Artificial Intelligence,AI)生成基于模型系统工程(Model-Based Systems Engineering,MBSE)数字模型VR视觉的创新方法。通过优选大屏幕式、小间距LED显示屏、主动立体显示硬件,创新实现了集科研演示验证、会议讨论功能于一体的高分辨率、多通道立体同步、多通道立体拼接的VR视觉硬件系统,并通过数据驱动的AI算法,准确高效地生成MBSE系统工程各类数字模型的VR视觉,为VR系统的软、硬件建设提供技术借鉴。展开更多
基金supported in part by the National Natural Science Foundation of China (62176059, 62101136)。
文摘Binaural rendering is of great interest to virtual reality and immersive media. Although humans can naturally use their two ears to perceive the spatial information contained in sounds, it is a challenging task for machines to achieve binaural rendering since the description of a sound field often requires multiple channels and even the metadata of the sound sources. In addition, the perceived sound varies from person to person even in the same sound field. Previous methods generally rely on individual-dependent head-related transferred function(HRTF)datasets and optimization algorithms that act on HRTFs. In practical applications, there are two major drawbacks to existing methods. The first is a high personalization cost, as traditional methods achieve personalized needs by measuring HRTFs. The second is insufficient accuracy because the optimization goal of traditional methods is to retain another part of information that is more important in perception at the cost of discarding a part of the information. Therefore, it is desirable to develop novel techniques to achieve personalization and accuracy at a low cost. To this end, we focus on the binaural rendering of ambisonic and propose 1) channel-shared encoder and channel-compared attention integrated into neural networks and 2) a loss function quantifying interaural level differences to deal with spatial information. To verify the proposed method, we collect and release the first paired ambisonic-binaural dataset and introduce three metrics to evaluate the content information and spatial information accuracy of the end-to-end methods. Extensive experimental results on the collected dataset demonstrate the superior performance of the proposed method and the shortcomings of previous methods.
基金supported by JSPS KAKENHI Grant Number 21K11928。
文摘Three-dimensional surfaces are typically modeled as implicit surfaces.However,direct rendering of implicit surfaces is not simple,especially when such surfaces contain finely detailed shapes.One approach is ray-casting,where the field of the implicit surface is assumed to be piecewise polynomials defined on the grid of a rectangular domain.A critical issue for direct rendering based on ray-casting is the computational cost of finding intersections between surfaces and rays.In particular,ray-casting requires many function evaluations along each ray,severely slowing the rendering speed.In this paper,a method is proposed to achieve direct rendering of polynomial-based implicit surfaces in real-time by strategically narrowing the search range and designing the shader to exploit the structure of piecewise polynomials.In experiments,the proposed method achieved a high framerate performance for different test cases,with a speed-up factor ranging from 1.1 to 218.2.In addition,the proposed method demonstrated better efficiency with high cell resolution.In terms of memory consumption,the proposed method saved between 90.94%and 99.64%in different test cases.Generally,the proposed method became more memory-efficient as the cell resolution increased.
文摘With the development of virtual reality (VR) technology, more and more industries are beginning to integrate with VR technology. In response to the problem of not being able to directly render the lighting effect of Caideng in digital Caideng scenes, this article analyzes the lighting model. It combines it with the lighting effect of Caideng scenes to design an optimized lighting model algorithm that fuses the bidirectional transmission distribution function (BTDF) model. This algorithm can efficiently render the lighting effect of Caideng models in a virtual environment. And using image optimization processing methods, the immersive experience effect on the VR is enhanced. Finally, a Caideng roaming interactive system was designed based on this method. The results show that the frame rate of the system is stable during operation, maintained above 60 fps, and has a good immersive experience.
文摘Ray casting algorithm can obtain a better quality image in volume rendering, however, it exists some problems, such as powerful computing capacity and slow rendering speed. How to improve the re-sampled speed is a key to speed up the ray casting algorithm. An algorithm is introduced to reduce matrix computation by matrix transformation characteristics of re-sampling points in a two coordinate system. The projection of 3-D datasets on image plane is adopted to reduce the number of rays. Utilizing boundary box technique avoids the sampling in empty voxel. By extending the Bresenham algorithm to three dimensions, each re-sampling point is calculated. Experimental results show that a two to three-fold improvement in rendering speed using the optimized algorithm, and the similar image quality to traditional algorithm can be achieved. The optimized algorithm can produce the required quality images, thus reducing the total operations and speeding up the volume rendering.
文摘总结VR立体视觉系统硬件建设的创新技术,提出了人工智能(Artificial Intelligence,AI)生成基于模型系统工程(Model-Based Systems Engineering,MBSE)数字模型VR视觉的创新方法。通过优选大屏幕式、小间距LED显示屏、主动立体显示硬件,创新实现了集科研演示验证、会议讨论功能于一体的高分辨率、多通道立体同步、多通道立体拼接的VR视觉硬件系统,并通过数据驱动的AI算法,准确高效地生成MBSE系统工程各类数字模型的VR视觉,为VR系统的软、硬件建设提供技术借鉴。