RABBIT: Live Transcoding of V-PCC Point Cloud Streams

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Michael Rudolph
  • Stefan Schneegass
  • Amr Rizk

External Research Organisations

  • University of Duisburg-Essen
View graph of relations

Details

Original languageEnglish
Title of host publicationMMSys '23
Subtitle of host publicationProceedings of the 14th ACM Multimedia Systems Conference
PublisherAssociation for Computing Machinery (ACM)
Pages97-107
Number of pages11
ISBN (electronic)9798400701481
Publication statusPublished - 7 Jun 2023
Externally publishedYes

Abstract

Point clouds are a mature representation format for volumetric objects in 6 degrees-of-freedom multimedia streaming. To handle the massive size of point cloud data for visually satisfying immersive media, MPEG standardized Video-based Point Cloud Compression (V-PCC), leveraging existing video codecs to achieve high compression ratios. A major challenge of V-PCC is the high encoding latency, which results in fallback solutions that exchange the compression ratio for faster point cloud codecs. This encoding effort rises significantly in adaptive streaming systems, where heterogeneous user requirements translate into a set of quality representations of the media.

In this paper, we show that given one high quality media representation we can achieve live transcoding of video-based compressed point clouds to serve heterogeneous user quality requirements in real time. This stands in contrast to the slow, baseline transcoding that reconstructs and re-encodes the raw point cloud at a new quality setting. To address the high latency when employing the decoder-encoder stack of V-PCC during transcoding, we propose RABBIT, a novel technique that only re-encodes the underlying video sub-streams. This eliminates the overhead of the baseline decoding-encoding approach and decreases the latency further by applying optimized video codecs. We perform extensive evaluation of RABBIT in combination with different video codecs, showing on-par quality with the baseline V-PCC transcoding. Using a hardware-Accelerated video codec we demonstrate live transcoding performance of RABBIT and finally present a trade-off between rate, distortion and transcoding latency.

Keywords

    6DOF, adaptive streaming, point cloud, virtual reality

ASJC Scopus subject areas

Cite this

RABBIT: Live Transcoding of V-PCC Point Cloud Streams. / Rudolph, Michael; Schneegass, Stefan; Rizk, Amr.
MMSys '23: Proceedings of the 14th ACM Multimedia Systems Conference. Association for Computing Machinery (ACM), 2023. p. 97-107.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Rudolph, M, Schneegass, S & Rizk, A 2023, RABBIT: Live Transcoding of V-PCC Point Cloud Streams. in MMSys '23: Proceedings of the 14th ACM Multimedia Systems Conference. Association for Computing Machinery (ACM), pp. 97-107. https://doi.org/10.1145/3587819.3590978
Rudolph, M., Schneegass, S., & Rizk, A. (2023). RABBIT: Live Transcoding of V-PCC Point Cloud Streams. In MMSys '23: Proceedings of the 14th ACM Multimedia Systems Conference (pp. 97-107). Association for Computing Machinery (ACM). https://doi.org/10.1145/3587819.3590978
Rudolph M, Schneegass S, Rizk A. RABBIT: Live Transcoding of V-PCC Point Cloud Streams. In MMSys '23: Proceedings of the 14th ACM Multimedia Systems Conference. Association for Computing Machinery (ACM). 2023. p. 97-107 doi: 10.1145/3587819.3590978
Rudolph, Michael ; Schneegass, Stefan ; Rizk, Amr. / RABBIT : Live Transcoding of V-PCC Point Cloud Streams. MMSys '23: Proceedings of the 14th ACM Multimedia Systems Conference. Association for Computing Machinery (ACM), 2023. pp. 97-107
Download
@inproceedings{3c66efd356e7482f99cfc0e523c0444b,
title = "RABBIT: Live Transcoding of V-PCC Point Cloud Streams",
abstract = "Point clouds are a mature representation format for volumetric objects in 6 degrees-of-freedom multimedia streaming. To handle the massive size of point cloud data for visually satisfying immersive media, MPEG standardized Video-based Point Cloud Compression (V-PCC), leveraging existing video codecs to achieve high compression ratios. A major challenge of V-PCC is the high encoding latency, which results in fallback solutions that exchange the compression ratio for faster point cloud codecs. This encoding effort rises significantly in adaptive streaming systems, where heterogeneous user requirements translate into a set of quality representations of the media.In this paper, we show that given one high quality media representation we can achieve live transcoding of video-based compressed point clouds to serve heterogeneous user quality requirements in real time. This stands in contrast to the slow, baseline transcoding that reconstructs and re-encodes the raw point cloud at a new quality setting. To address the high latency when employing the decoder-encoder stack of V-PCC during transcoding, we propose RABBIT, a novel technique that only re-encodes the underlying video sub-streams. This eliminates the overhead of the baseline decoding-encoding approach and decreases the latency further by applying optimized video codecs. We perform extensive evaluation of RABBIT in combination with different video codecs, showing on-par quality with the baseline V-PCC transcoding. Using a hardware-Accelerated video codec we demonstrate live transcoding performance of RABBIT and finally present a trade-off between rate, distortion and transcoding latency.",
keywords = "6DOF, adaptive streaming, point cloud, virtual reality",
author = "Michael Rudolph and Stefan Schneegass and Amr Rizk",
note = "Publisher Copyright: {\textcopyright} 2023 ACM.",
year = "2023",
month = jun,
day = "7",
doi = "10.1145/3587819.3590978",
language = "English",
pages = "97--107",
booktitle = "MMSys '23",
publisher = "Association for Computing Machinery (ACM)",
address = "United States",

}

Download

TY - GEN

T1 - RABBIT

T2 - Live Transcoding of V-PCC Point Cloud Streams

AU - Rudolph, Michael

AU - Schneegass, Stefan

AU - Rizk, Amr

N1 - Publisher Copyright: © 2023 ACM.

PY - 2023/6/7

Y1 - 2023/6/7

N2 - Point clouds are a mature representation format for volumetric objects in 6 degrees-of-freedom multimedia streaming. To handle the massive size of point cloud data for visually satisfying immersive media, MPEG standardized Video-based Point Cloud Compression (V-PCC), leveraging existing video codecs to achieve high compression ratios. A major challenge of V-PCC is the high encoding latency, which results in fallback solutions that exchange the compression ratio for faster point cloud codecs. This encoding effort rises significantly in adaptive streaming systems, where heterogeneous user requirements translate into a set of quality representations of the media.In this paper, we show that given one high quality media representation we can achieve live transcoding of video-based compressed point clouds to serve heterogeneous user quality requirements in real time. This stands in contrast to the slow, baseline transcoding that reconstructs and re-encodes the raw point cloud at a new quality setting. To address the high latency when employing the decoder-encoder stack of V-PCC during transcoding, we propose RABBIT, a novel technique that only re-encodes the underlying video sub-streams. This eliminates the overhead of the baseline decoding-encoding approach and decreases the latency further by applying optimized video codecs. We perform extensive evaluation of RABBIT in combination with different video codecs, showing on-par quality with the baseline V-PCC transcoding. Using a hardware-Accelerated video codec we demonstrate live transcoding performance of RABBIT and finally present a trade-off between rate, distortion and transcoding latency.

AB - Point clouds are a mature representation format for volumetric objects in 6 degrees-of-freedom multimedia streaming. To handle the massive size of point cloud data for visually satisfying immersive media, MPEG standardized Video-based Point Cloud Compression (V-PCC), leveraging existing video codecs to achieve high compression ratios. A major challenge of V-PCC is the high encoding latency, which results in fallback solutions that exchange the compression ratio for faster point cloud codecs. This encoding effort rises significantly in adaptive streaming systems, where heterogeneous user requirements translate into a set of quality representations of the media.In this paper, we show that given one high quality media representation we can achieve live transcoding of video-based compressed point clouds to serve heterogeneous user quality requirements in real time. This stands in contrast to the slow, baseline transcoding that reconstructs and re-encodes the raw point cloud at a new quality setting. To address the high latency when employing the decoder-encoder stack of V-PCC during transcoding, we propose RABBIT, a novel technique that only re-encodes the underlying video sub-streams. This eliminates the overhead of the baseline decoding-encoding approach and decreases the latency further by applying optimized video codecs. We perform extensive evaluation of RABBIT in combination with different video codecs, showing on-par quality with the baseline V-PCC transcoding. Using a hardware-Accelerated video codec we demonstrate live transcoding performance of RABBIT and finally present a trade-off between rate, distortion and transcoding latency.

KW - 6DOF

KW - adaptive streaming

KW - point cloud

KW - virtual reality

UR - http://www.scopus.com/inward/record.url?scp=85163679403&partnerID=8YFLogxK

U2 - 10.1145/3587819.3590978

DO - 10.1145/3587819.3590978

M3 - Conference contribution

SP - 97

EP - 107

BT - MMSys '23

PB - Association for Computing Machinery (ACM)

ER -