A GPU-Enabled Real-Time Framework for Compressing and Rendering Volumetric Videos

Dongxiao Yu, Ruopeng Chen, Xin Li, Mengbai Xiao, Guanghui Zhang, Yao Liu

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Nowadays, volumetric videos have emerged as an attractive multimedia application providing highly immersive watching experiences since viewers could adjust their viewports at 6 degrees-of-freedom. However, the point cloud frames composing the video are prohibitively large, and effective compression techniques should be developed. There are two classes of compression methods. One suggests exploiting the conventional video codecs (2D-based methods) and the other proposes to compress the points in 3D space directly (3D-based methods). Though the 3D-based methods feature fast coding speeds, their compression ratios are low since the failure of leveraging inter-frame redundancy. To resolve this problem, we design a patch-wise compression framework working in the 3D space. Specifically, we search rigid moves of patches via the iterative closest point algorithm and construct a common geometric structure, which is followed by color compensation. We implement our decoder on a GPU platform so that real-time decoding and rendering are realized. We compare our method with GROOT, the state-of-the-art 3D-based compression method, and it reduces the bitrate by up to 5.98×. Moreover, by trimming invisible content, our scheme achieves comparable bandwidth demand of V-PCC, the representative 2D-based method, in FoV-adaptive streaming.

Original languageEnglish (US)
Pages (from-to)789-800
Number of pages12
JournalIEEE Transactions on Computers
Volume73
Issue number3
DOIs
StatePublished - Mar 1 2024

All Science Journal Classification (ASJC) codes

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture
  • Computational Theory and Mathematics

Keywords

  • Volumetric video
  • point cloud compression

Fingerprint

Dive into the research topics of 'A GPU-Enabled Real-Time Framework for Compressing and Rendering Volumetric Videos'. Together they form a unique fingerprint.

Cite this