- Lo
- Il
- Ma
Lounes Douar, Ilia Des, Matthew Lowe and others are enlisted in the challenge
Pre- and post-processing for improving video compression
Access to detailed topic description, code resources and dataThe objective of this task is to design a pair of pre-processing and post-processing filters to enhance the performance of an HEVC/h.265 [1] video codec, as illustrated in Fig. 1. Performance improvement is defined as either reducing the encoded video stream size while maintaining reconstruction quality, or enhancing reconstruction quality while keeping the stream size constant or a combination of both.
Fig.1: Block diagram of the video compression framework
To facilitate development, we provide a Python-based video compression framework, a test dataset, and benchmarking tools for performance evaluation. The framework includes placeholder modules for the pre- and post-processing filters, which participants are expected to replace with their own implementations.
In addition, the framework supports the transmission of metadata—arbitrary side information that can be embedded into the stream to aid post-processing. Importantly, the size of the metadata is counted toward the total stream size and must be considered when assessing overall performance.
Submissions will be evaluated based on the achieved BD-Rate [2], which reflects both the stream size and the quality of the decoded images.
Participants are free to adopt any approach, whether classical or learning-based. We encourage applicants to consult recent publications in top-tier conferences—such as CVPR and others for ideas and inspiration. According to recent studies [3], combining pre- and post-processing filters can lead to compression performance gains of 10% to 30%.
Provided input:
Expected output:
Evaluation:
Additional information:
The framework utilizes the HEVC/h.265 codec from the FFmpeg library to compress video frames. Participants are not allowed to modify the codec itself; only the pre-processing and post-processing modules may be altered. To assist participants in getting started, the following example strategies may serve as inspiration:
Reference:
[1] High Efficiency Video Coding, https://en.wikipedia.org/wiki/High_Efficiency_Video_Coding
[2] Bjøntegaard Delta (BD): A Tutorial Overview of the Metric, Evolution, Challenges, and Recommendations. http://dx.doi.org/10.13140/RG.2.2.18622.05444
[3] Perceptual Video Compression with Neural Wrapping, https://cvpr.thecvf.com/virtual/2025/poster/34427
[4] YUV, https://en.wikipedia.org/wiki/Y%E2%80%B2UV
[5] Chroma Subsampling, https://en.wikipedia.org/wiki/Chroma_subsampling
[6] Peak signal-to-noise ratio, https://en.wikipedia.org/wiki/Peak_signal-to-noise_ratio
[7] Wang, Z.; Simoncelli, E.P.; Bovik, A.C. (2003-11-01). "Multiscale structural similarity for image quality assessment". The Thirty-Seventh Asilomar Conference on Signals, Systems & Computers, 2003. Vol. 2. pp. 1398–1402 Vol.2. https://doi.org/10.1109/ACSSC.2003.1292216
[8] Video Multimethod Assessment Fusion https://en.wikipedia.org/wiki/Video_Multimethod_Assessment_Fusion
[9] Adaptive Resolution Change for Versatile Video Coding, https://ieeexplore.ieee.org/document/9301762
[10] AOMedia Film Grain Synthesis 1, https://norkin.org/research/afgs1/
[11] Luma Mapping with Chroma Scalling in Versatile Video Coding, https://sigport.org/sites/default/files/docs/DCC2020_ID189_LMCS_present.pdf
The competition consists of two evaluation parts:
All teams are required to submit a final project approach by October 15, 2025.
The final deliverable will be evaluated by Huawei experts based on technical quality, innovation, and completeness.
This part contributes 80% of the total score.