Deep learning-based intra prediction mode decision for HEVC

Thorsten Laude; Jörn Ostermann

doi:10.1109/pcs.2016.7906399

Details

Original language	English
Title of host publication	2016 Picture Coding Symposium
Subtitle of host publication	PCS 2016
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (electronic)	9781509059669
Publication status	Published - Apr 2017
Event	2016 Picture Coding Symposium, PCS 2016 - Nuremberg, Germany Duration: 4 Dec 2016 → 7 Dec 2016

Publication series

Name	2016 Picture Coding Symposium, PCS 2016

Abstract

The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

ASJC Scopus subject areas

Engineering(all)
Media Technology
Computer Science(all)
Signal Processing

Cite this

Deep learning-based intra prediction mode decision for HEVC. / Laude, Thorsten; Ostermann, Jörn.
2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc., 2017. 7906399 (2016 Picture Coding Symposium, PCS 2016).

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review

Laude, T & Ostermann, J 2017, Deep learning-based intra prediction mode decision for HEVC. in 2016 Picture Coding Symposium: PCS 2016., 7906399, 2016 Picture Coding Symposium, PCS 2016, Institute of Electrical and Electronics Engineers Inc., 2016 Picture Coding Symposium, PCS 2016, Nuremberg, Germany, 4 Dec 2016. https://doi.org/10.1109/pcs.2016.7906399

Laude, T., & Ostermann, J. (2017). Deep learning-based intra prediction mode decision for HEVC. In 2016 Picture Coding Symposium: PCS 2016 Article 7906399 (2016 Picture Coding Symposium, PCS 2016). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/pcs.2016.7906399

Laude T, Ostermann J. Deep learning-based intra prediction mode decision for HEVC. In 2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc. 2017. 7906399. (2016 Picture Coding Symposium, PCS 2016). doi: 10.1109/pcs.2016.7906399

Laude, Thorsten ; Ostermann, Jörn. / Deep learning-based intra prediction mode decision for HEVC. 2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc., 2017. (2016 Picture Coding Symposium, PCS 2016).

Download

@inproceedings{5a9261662afa4a07a64f5abfd5c5d548,

title = "Deep learning-based intra prediction mode decision for HEVC",

abstract = "The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.",

author = "Thorsten Laude and J{\"o}rn Ostermann",

year = "2017",

month = apr,

doi = "10.1109/pcs.2016.7906399",

language = "English",

series = "2016 Picture Coding Symposium, PCS 2016",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2016 Picture Coding Symposium",

address = "United States",

note = "2016 Picture Coding Symposium, PCS 2016 ; Conference date: 04-12-2016 Through 07-12-2016",

}

Download

TY - GEN

T1 - Deep learning-based intra prediction mode decision for HEVC

AU - Laude, Thorsten

AU - Ostermann, Jörn

PY - 2017/4

Y1 - 2017/4

N2 - The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

AB - The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

UR - http://www.scopus.com/inward/record.url?scp=85019423425&partnerID=8YFLogxK

U2 - 10.1109/pcs.2016.7906399

DO - 10.1109/pcs.2016.7906399

M3 - Conference contribution

AN - SCOPUS:85019423425

T3 - 2016 Picture Coding Symposium, PCS 2016

BT - 2016 Picture Coding Symposium

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2016 Picture Coding Symposium, PCS 2016

Y2 - 4 December 2016 through 7 December 2016

ER -

Research@Leibniz University

Deep learning-based intra prediction mode decision for HEVC

Authors

Research Organisations

Details

Publication series

Abstract

ASJC Scopus subject areas

Cite this

By the same author(s)

Wire Break Detection in Hybrid Towers of Wind Turbines: A Novel Application to Monitor Tendons Using Acoustic Emission Analysis

Quantized Inverse Design for Photonic Integrated Circuits

Pruning-Aware Loss Functions for STOI-Optimized Pruned Recurrent Autoencoders for the Compression of the Stimulation Patterns of Cochlear Implants at Zero Delay

A flexible framework for large-scale FDTD simulations: open-source inverse design for 3D nanostructures

Inverse design of robust out-of-plane coupling elements

Wire Break Detection in Hybrid Towers of Wind Turbines: A Novel Application to Monitor Tendons Using Acoustic Emission Analysis

Quantized Inverse Design for Photonic Integrated Circuits

Pruning-Aware Loss Functions for STOI-Optimized Pruned Recurrent Autoencoders for the Compression of the Stimulation Patterns of Cochlear Implants at Zero Delay

A flexible framework for large-scale FDTD simulations: open-source inverse design for 3D nanostructures

Inverse design of robust out-of-plane coupling elements

Wire Break Detection in Hybrid Towers of Wind Turbines: A Novel Application to Monitor Tendons Using Acoustic Emission Analysis