Fractional motion estimation(FME) improves the video encoding efficiency significantly. However, its high computational complexity limits the real-time processing capability. Therefore, it is a key problem to reduce t...Fractional motion estimation(FME) improves the video encoding efficiency significantly. However, its high computational complexity limits the real-time processing capability. Therefore, it is a key problem to reduce the implementation complexity of FME, especially in hardware design. This paper presents a novel deeply pipelined interpolation architecture of FME for the real-time realization of H.265/HEVC full Ultra-HD video encoder. First, a pipelined interpolation architecture together with an elegant processing order is proposed to deal with different search positions in parallel without pipeline stall and data conflict. Second, interpolation results sharing strategies are exploited among search positions to reduce the memory cost. Finally, the structure of the interpolation filter is further optimized for an area efficient implementation. As a result, the proposed design costs 41 917 slice LUTs on the Xilinx Kintex-7 FPGA platform with a 308 MHz working frequency. The measured throughput reaches a record of 1.238 Gpixels/s, which is sufficient for the real-time encoding of 8192×4320@ 30 fps video.展开更多
基金Supported by the Zhejiang Provincial Natural Science Foundation of China(No.LQ15F010001,LY16F020029)the General Research Project of Zhejiang Provincial Education Department(No.Y201430479)
文摘Fractional motion estimation(FME) improves the video encoding efficiency significantly. However, its high computational complexity limits the real-time processing capability. Therefore, it is a key problem to reduce the implementation complexity of FME, especially in hardware design. This paper presents a novel deeply pipelined interpolation architecture of FME for the real-time realization of H.265/HEVC full Ultra-HD video encoder. First, a pipelined interpolation architecture together with an elegant processing order is proposed to deal with different search positions in parallel without pipeline stall and data conflict. Second, interpolation results sharing strategies are exploited among search positions to reduce the memory cost. Finally, the structure of the interpolation filter is further optimized for an area efficient implementation. As a result, the proposed design costs 41 917 slice LUTs on the Xilinx Kintex-7 FPGA platform with a 308 MHz working frequency. The measured throughput reaches a record of 1.238 Gpixels/s, which is sufficient for the real-time encoding of 8192×4320@ 30 fps video.
文摘针对室内全球导航卫星系统(Global navigation satellite system,GNSS)信号受遮挡时,农用车辆协同定位精度低、稳定性差、信号丢包等问题,本文开展面向超宽带(Ultra-wideband,UWB)调频技术的室内外农用车辆协同定位算法研究。首先,搭建三基站多边测距定位模型,实现主基站绝对位置标定以及辅助基站绝对位置坐标的变换求解;其次,提出全质心加权最小二乘的高速双边双向(Weighted least squares high double sided two-way ranging,WLS-HDS-TWR)农机协同定位算法,基于泰勒级数展开的WLS估计算法,求解主车位置。同时,提出面向室内环境的多状态基站组合的UWB定位模块布设模式,并验证其可行性;通过飞行时间法(Time of flight,TOF)获取主从车距离信息,融合GNSS标定位置信息、主车坐标信息以及测距信息,实现主从车协同定位。最后,基于Prescan/Simulink搭建联合仿真平台,验证提出算法的可靠性;通过农用履带车辆开展室内及室外协同定位实车试验,试验结果表明:全质心WLS-HDS-TWR协同定位算法可有效解决室内GNSS信号缺失问题,室内环境下,定位精度较HDS-TWR及全质心LS-HDS-TWR算法分别提高26.98%和22.03%,满足智能农机协同定位作业需求。