Contact Person
Daniel Silhavy
Daniel Silhavy
Project Manager
Business Unit FAME
+49 30 3463-7680
Newsletter subscription
Please enter your e-mail address to subscribe to our newsletter.                     
 
E-Mail*
Title
First Name
Last Name
Company



The protection of your personal data is very important to us. Please refer to our data protection .

FAMIUM Deep Encode

What is it about?

Video streaming content differs in terms of complexity and requires title-specific encoding settings to achieve a certain visual quality. A classic “one-fits-all” encoding ladder ignores this characteristics and applies the same encoding settings to all video files. In the worst case this leads to a waste of bandwidth and storage, quality impairments, and a bad user experience. Our title-based encoding solution has the potential to significantly decrease the storage and delivery costs of video streams while improving the perceptual quality.

Title based encoding

The efficiency of a video-codec is closely linked to the spatial and temporal redundancy of the input video. A video that contains lots of movement and scene changes (e.g sports and action movies) is much “harder” to encode, than a video in which most parts are redundant or slowly changing over time (e.g animated movies). Hence, different types of content require different bitrate settings to achieve a certain quality. As a result, applying a title-based encoding-ladder introduces major advantages compared to the classic approach of a “one-fits-all” encoding ladder.

Standard per-title encoding solutions identify the optimal encoding settings for a single asset in a complexity analysis step, prior to the actual production encodes. The most common approach to determine this complexity is by performing multiple test encodes. Based on the resulting bitrate/quality (PSNR, SSIM,VMAF) value pairs, content specific bitrate/resolution ladders are derived and applied. The major downside of such a solution comes with the fact that the test encodes are very computational heavy and not applicable in a live video streaming scenario.

FAMIUM Deep Encode

The FAMIUM Deep Encode tool leverages different machine learning techniques to avoid the computational heavy test encodes. Examples of algorithms in this area include decision trees ( e. g. through Boosted Decision Trees) and Deep Neural Networks (through Recurrent Neural Networks or Deep Convolutional Networks). By constant validation and retraining of our models the accuracy and efficiency of the algorithms are improved with each input and the entire system learns on its own.

This image has no alt text.
Fraunhofer FOKUS
This image has no alt text.
Fraunhofer FOKUS

Key facts

  • Title-based encoding solution for live and VoD content leveraging  machine learning techniques like Decision Trees and Deep Neural Networks
  • Constant validation and retraining of the machine learning models to improve the accuracy and efficiency of the algorithms
  • Calculation of video quality metrics like PSNR, SSIM and VMAF
  • Detection of complex metadata like scene changes, object classification and labeling for each video
  • Codec agnostic approach, can be applied to H.264, H.265, VP9, AV1 etc.

Related Projects