Loading [MathJax]/extensions/tex2jax.js

Deep learning-based intra prediction mode decision for HEVC

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

Details

Original languageEnglish
Title of host publication2016 Picture Coding Symposium
Subtitle of host publicationPCS 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (electronic)9781509059669
Publication statusPublished - Apr 2017
Event2016 Picture Coding Symposium, PCS 2016 - Nuremberg, Germany
Duration: 4 Dec 20167 Dec 2016

Publication series

Name2016 Picture Coding Symposium, PCS 2016

Abstract

The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

ASJC Scopus subject areas

Cite this

Deep learning-based intra prediction mode decision for HEVC. / Laude, Thorsten; Ostermann, Jörn.
2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc., 2017. 7906399 (2016 Picture Coding Symposium, PCS 2016).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Laude, T & Ostermann, J 2017, Deep learning-based intra prediction mode decision for HEVC. in 2016 Picture Coding Symposium: PCS 2016., 7906399, 2016 Picture Coding Symposium, PCS 2016, Institute of Electrical and Electronics Engineers Inc., 2016 Picture Coding Symposium, PCS 2016, Nuremberg, Germany, 4 Dec 2016. https://doi.org/10.1109/pcs.2016.7906399
Laude, T., & Ostermann, J. (2017). Deep learning-based intra prediction mode decision for HEVC. In 2016 Picture Coding Symposium: PCS 2016 Article 7906399 (2016 Picture Coding Symposium, PCS 2016). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/pcs.2016.7906399
Laude T, Ostermann J. Deep learning-based intra prediction mode decision for HEVC. In 2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc. 2017. 7906399. (2016 Picture Coding Symposium, PCS 2016). doi: 10.1109/pcs.2016.7906399
Laude, Thorsten ; Ostermann, Jörn. / Deep learning-based intra prediction mode decision for HEVC. 2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc., 2017. (2016 Picture Coding Symposium, PCS 2016).
Download
@inproceedings{5a9261662afa4a07a64f5abfd5c5d548,
title = "Deep learning-based intra prediction mode decision for HEVC",
abstract = "The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.",
author = "Thorsten Laude and J{\"o}rn Ostermann",
year = "2017",
month = apr,
doi = "10.1109/pcs.2016.7906399",
language = "English",
series = "2016 Picture Coding Symposium, PCS 2016",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
booktitle = "2016 Picture Coding Symposium",
address = "United States",
note = "2016 Picture Coding Symposium, PCS 2016 ; Conference date: 04-12-2016 Through 07-12-2016",

}

Download

TY - GEN

T1 - Deep learning-based intra prediction mode decision for HEVC

AU - Laude, Thorsten

AU - Ostermann, Jörn

PY - 2017/4

Y1 - 2017/4

N2 - The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

AB - The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

UR - http://www.scopus.com/inward/record.url?scp=85019423425&partnerID=8YFLogxK

U2 - 10.1109/pcs.2016.7906399

DO - 10.1109/pcs.2016.7906399

M3 - Conference contribution

AN - SCOPUS:85019423425

T3 - 2016 Picture Coding Symposium, PCS 2016

BT - 2016 Picture Coding Symposium

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2016 Picture Coding Symposium, PCS 2016

Y2 - 4 December 2016 through 7 December 2016

ER -

By the same author(s)