Real-time graphics processing is all along a crucial task of mobile device, and it is conventionally supported by programmable graphics processing unit(GPU). These GPUs are designed to flexibly support vertex and pixe...Real-time graphics processing is all along a crucial task of mobile device, and it is conventionally supported by programmable graphics processing unit(GPU). These GPUs are designed to flexibly support vertex and pixel processing with classic techniques such as on-chip cache and dynamic programmable pipelining. However, it is difficult for the vertex shader and pixel shader to achieve high utilization of hardware resources, even though there is a certain balance by reasonable processor quantity ratio. In this paper, a unified render shader with a very long instruction word(VLIW) processor was designed. The viewport transformation algorithm and the Mipmap mapping algorithm are respectively mapped on the shader, with the purpose of providing an energy-efficient and flexible hardware platform for graphics processing in mobile device. The implemented operating frequency is up to 134 MHz on Xilinx XC7 Z045-2-FFG900 field programmable gate array(FPGA), and unified architecture shader has a performance of 134 Mpixels/s in pixel fill rate, 546 Mtexels/s in texel fill rate.展开更多
基金supported by the National Natural Science Foundation of China (61772417, 61602377, 61634004, 61802304,61834005)the Shaanxi Provincial Science and Technology Co-ordination Innovation Project (2016KTZDGY0204-02)the Shaanxi Provincial Key Research and Development Program (2017GY-060)
文摘Real-time graphics processing is all along a crucial task of mobile device, and it is conventionally supported by programmable graphics processing unit(GPU). These GPUs are designed to flexibly support vertex and pixel processing with classic techniques such as on-chip cache and dynamic programmable pipelining. However, it is difficult for the vertex shader and pixel shader to achieve high utilization of hardware resources, even though there is a certain balance by reasonable processor quantity ratio. In this paper, a unified render shader with a very long instruction word(VLIW) processor was designed. The viewport transformation algorithm and the Mipmap mapping algorithm are respectively mapped on the shader, with the purpose of providing an energy-efficient and flexible hardware platform for graphics processing in mobile device. The implemented operating frequency is up to 134 MHz on Xilinx XC7 Z045-2-FFG900 field programmable gate array(FPGA), and unified architecture shader has a performance of 134 Mpixels/s in pixel fill rate, 546 Mtexels/s in texel fill rate.