main e93f882b3ab6 cached
1 files
48.5 KB
16.0k tokens
1 requests
Download .txt
Repository: ppingzhang/Awesome-Deep-Learning-Based-Video-Compression
Branch: main
Commit: e93f882b3ab6
Files: 1
Total size: 48.5 KB

Directory structure:
gitextract_de1i0u7m/

└── README.md

================================================
FILE CONTENTS
================================================

================================================
FILE: README.md
================================================

# <p align=center> Awesome 🎉Deep Learning Based Video Compression </p>
<!--# <p align=center>`# Awesome 🎉Deep Learning Based Video Compression🎉`</p>-->

[![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome) ![Visitors](https://visitor-badge.glitch.me/badge?page_id=ppingzhang/Awesome-Deep-Learning-Based-Video-Compression) ![GitHub stars](https://img.shields.io/github/stars/ppingzhang/Awesome-Deep-Learning-Based-Video-Compression.svg?color=red) 

# Contents (After June 2024)
- [Generative compression](#Generative)
- [Architecture](#ar)
- [MultiView & 360 Degree ](#multiv)
- [VCM & Feature Compression](#VCM)
- [Rate Control & Vraible rate](#RateControl)
- [Implicit neural representation](#implicit)
- [Low Complexity & Speed](#lowcomplexity)
- [Motion & Prediction](#motion)
- [Benchmark & Dataset & Survey](#bmk)




# Group by time (Before June 2024)
- [2024](#2024)
- [2023](#2023)
- [2022](#2022)
- [2021](#2021)
- [2020](#2020)
- [2019](#2019)
- [2018](#2018)
- [2017](#2017)

------


### <span id="Generative"> Generative compression

| Title | Pub. & Date
|:-----|:-----|
|[Ultra-Low Bitrate Face Video Compression Based on Conversions from 3D Keypoints to 2D Motion Map](http://arxiv.org/abs/2210.03335v1) | TIP 2024
|[Diffusion-based Perceptual Neural Video Compression with Temporal Diffusion Information Reuse](http://arxiv.org/abs/2501.13528v1) | arXiv 2025
|[Generative Human Video Compression with Multi-granularity Temporal Trajectory Factorization](https://arxiv.org/abs/2410.10171) | arXiv 2024
|[Beyond GFVC: A Progressive Face Video Compression Framework with Adaptive Visual Tokens](https://arxiv.org/pdf/2410.08485) | arXiv 2024
|[Multi-Reference Generative Face Video Compression with Contrastive Learning](https://arxiv.org/pdf/2409.01029) | arXiv 2024
|[When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding](https://arxiv.org/pdf/2408.08093) | Arxiv 2024
|[CodingHomo: Bootstrapping Deep Homography with Video Coding](https://ieeexplore.ieee.org/document/10570492/authors#authors) | TCSVT 2024|
|[I2VC: A Unified Framework for Intra- & Inter-frame Video Compression](https://arxiv.org/pdf/2405.14336) | Arixv 2024
|[PredToken: Predicting Unknown Tokens and Beyond with Coarse-to-Fine Iterative Decoding](https://openaccess.thecvf.com/content/CVPR2024/papers/) | Arxiv 2024
|[SMC++: Masked Learning of Unsupervised Video Semantic Compression](https://arxiv.org/pdf/2406.04765/) | Arxiv 2024 |

### <span id="ar"> Architecture

| Title | Pub. & Date
|:-----|:-----|
|[Long-term Temporal Context Gathering for Neural Video Compression](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/08346.pdf)| ECCV 2025
|[CRAM: Large-scale Video Continual Learning with Bootstrapped Compression](https://www.arxiv.org/pdf/2508.05001)| ICCV 2025
|[Context Guided Transformer Entropy Modeling for Video Compression](https://iccv.thecvf.com/virtual/2025/poster/906)| ICCV 2025
|[EHVC: Efficient Hierarchical Reference and Quality Structure for Neural Video Coding](https://arxiv.org/pdf/2509.04118) | ACMMM 2025
|[Neural Video Compression with In-Loop Contextual Filtering and Out-of-Loop Reconstruction Enhancement](https://arxiv.org/pdf/2509.04051?) | arXiv 2025
|[An image to tailor: I-Frame Domain Adaptation in Neural Video Compression](https://openreview.net/pdf?id=6AU7JglYSV) | NeurIPSW 2024 
|[Adaptive Surveillance Video Compression With Background Hyperprior](http://arxiv.org/abs/2001.06590v3) | SPL 2024
|[Hybrid Scalable Video Coding with Neural Compression and Enhancement for Streaming Media](http://arxiv.org/abs/2107.05548v2) | ACM MM 2024
|[End-to-end Deep Video Compression Based on Hierarchical Temporal Context Learning](http://arxiv.org/abs/2204.11723v1) | TMM 2025
|[Motion Free B-frame Coding for Neural Video Compression](http://arxiv.org/abs/2309.13835v2) | arXiv 2024
|[GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting](https://arxiv.org/abs/2501.12060) | arXiv 2025
|[ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression](https://arxiv.org/pdf/2410.09706) | arXiv 2024
|[Joint Source-Channel Optimization for UAV Video Coding and Transmission](https://arxiv.org/pdf/2408.06667) | arXiv 2024
|[Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression](https://arxiv.org/pdf/2409.11718) | ECCV 2025
|[VQ-DeepVSC: A Dual-Stage Vector Quantization Framework for Video Semantic Communication](https://arxiv.org/pdf/2409.03393) | arXiv 2024
|[Spatio-temporal convolutional neural network for enhanced inter prediction in video coding](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10648618) | TIP 2024
|[NVC-1B: A Large Neural Video Coding Model](https://arxiv.org/pdf/2407.19402) | arXiv 2024
|[Bi-Directional Deep Contextual Video Compression](https://arxiv.org/pdf/2408.08604) | arXiv 2024


### <span id="multiv"> MultiView & 360 Degree Video Compression
| Title | Pub. & Date
|:-----|:-----|
|[Beyond Perspective: Neural 360-Degree Video Compression](https://iccv.thecvf.com/virtual/2025/poster/879)|CVPR 2025
|[FV-NeRV: Neural Compression for Free Viewpoint Videos](https://openreview.net/pdf?id=hrXt6Fdl2P)| arXiv 2025
|[A Multi-Grid Implicit Neural Representation for Multi-View Videos](https://arxiv.org/pdf/2509.16706) |  arXiv 2025



### <span id="VCM"> Video Coding for Machine & Feature compression

| Title | Pub. & Date
|:-----|:-----|
|[Parameter-efficient instance-adaptive neural video compression](https://arxiv.org/abs/2405.08530) | ACCV 2024
|[DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines](None) | arXiv 2024
|[RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression](None) | arXiv 2025
|[Learned Multimodal Compression for Autonomous Driving](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10743791&casa_token=z5NqhvgXTigAAAAA:nOQNaJ2qxq5Cpr8a6ViW4up32c_fLiiLw5tHbDXsoYSWWL-xN3Xz4mdyvIR87gbjqHhoxbA) | MMSP 2024
| [DMVC: Multi-Camera Video Compression Network aimed at Improving Deep Learning Accuracy](https://arxiv.org/pdf/2410.18400) | arXiv 2024
| [Picture Partitioning Design of Neural Network-Based Intra Coding For Video Coding For Machines](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10647747) | ICIP 2024
| [ROI-DVC: A Region-of-Interest Based Deep Video Coding Framework](https://arxiv.org/pdf/2203.01978) | ICIP 2024
| [Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression](https://arxiv.org/pdf/2409.11718) | arXiv 2024
| [On Annotation-free Optimization of Video Coding for Machines](https://arxiv.org/pdf/2406.07938) | arXiv 2024
| [Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines](https://arxiv.org/pdf/2406.12367) | arXiv 2024
| [Deep Video Compression with Conditional Feature Coding](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10566367&casa_token=YrKZPj4xYCAAAAAA:w528F9C1IXPH1oYU0Tlcuviqv6MKIKcmoqduJayXaOE8mjgiNPnR8R54M86AH-SKI1B0ilQ) | PCS 2024





### <span id="RateControl"> Rate Control & Vraible rate
| Title | Pub. & Date
|:-----|:-----|
|[Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network](https://www.ecva.net/papers/eccv_2024/papers_ECCV/papers/11394.pdf)| ECCV 2025
|[Perception Loss Function Adaptive to Rate for Learned Video Compression](https://openreview.net/forum?id=XQN2sBSjuQ&referrer=%5Bthe%20profile%20of%20Buu%20Phan%5D(%2Fprofile%3Fid%3D~Buu_Phan3)) | NeurIPS 2024 
|[Content-Adaptive Rate Control Method for User-Generated Content Videos](http://arxiv.org/abs/2412.18834v1) | TCSVT 2024
|[Adaptive Rate Control for Deep Video Compression with Rate-Distortion Prediction](http://arxiv.org/abs/2412.18834v1) | arXiv 2024
| [Content-adaptive Variable Resolution Framework for Intra Coding](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10558245&casa_token=lukOUFOmaRQAAAAA:a33FUL_WUKxotgsitmwzYzHZNdeadpHVKFWmOWBqsu_2zQDjFPa8rNJEHrhaCdeqXxJ0cmk) | ISCAS 2024
| [Deep Video Codec Control for Vision Models](https://openaccess.thecvf.com/content/CVPR2024W/AI4Streaming/papers/Reich_Deep_Video_Codec_Control_for_Vision_Models_CVPRW_2024_paper.pdf) | CVPR 2024


### <span id="implicit"> Implicit Neural Representation
| Title | Pub. & Date
|:-----|:-----|
|[GIViC: Generative Implicit Video Compression](https://arxiv.org/pdf/2503.19604)| ICCV 2025
|[Towards Practical Real-Time Neural Video Compression](https://openaccess.thecvf.com/content/CVPR2025/papers/Jia_Towards_Practical_Real-Time_Neural_Video_Compression_CVPR_2025_paper.pdf)| CVPR 2025
|[A Multi-Grid Implicit Neural Representation for Multi-View Videos](https://arxiv.org/pdf/2509.16706) |  Arxiv 2025
|[SNeRV: Scalable Neural Representations for Video Coding](https://openreview.net/pdf?id=ZqN4bnXSSY) |  NeurIPSW 2024 
|[HFS-HNeRV: High-Frequency Spectrum Hybrid Neural Representation for Videos](None) | ACM MM 2024
|[High-Frequency Enhanced Hybrid Neural Representation for Video Compression](http://arxiv.org/abs/2410.01654v2) | arXiv 2024
| [NVRC: Neural Video Representation Compression](https://arxiv.org/pdf/2409.07414) | NeurPIS 2024
| [PNVC: Towards Practical INR-based Video Compression](https://arxiv.org/pdf/2409.00953) | AAAI 2025
| [High-Frequency Enhanced Hybrid Neural Representation for Video Compression](https://arxiv.org/pdf/2411.06685) | arXiv 2024
| [Fast Encoding and Decoding for Implicit Video Representation](https://link.springer.com/chapter/10.1007/978-3-031-72933-1_23)  | ECCV 2024
| [QS-NeRV: Real-Time Quality-Scalable Decoding with Neural Representation for Videos](https://openreview.net/pdf?id=vJbyT9bYgf)  | ACM MM 2024
| [Temporal Enhanced Hybrid Neural Representation for Video Compression](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10566352&casa_token=Wj65ixeg3vgAAAAA:xqiVqUfQr-OJI5dKukLPbTE3wgpw9BpJYrywM999ul_5BKUBwI-Cwne-YQHiDU5bxbGcEt8) |  PCS 2024
| [Combining Frame and GOP Embeddings for Neural Video Representation](https://openaccess.thecvf.com/content/CVPR2024/papers/Saethre_Combining_Frame_and_GOP_Embeddings_for_Neural_Video_Representation_CVPR_2024_paper.pdf) | CVPR 2024


### <span id="lowcomplexity"> Low Complexity & Speed
| Title | Pub. & Date
|:-----|:-----|
| [Real-Time Semantic Video Communication of General Scenes](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10647591) | ICIP 2024
| [Accelerating Learned Video Compression via Low-Resolution Representation Learning](https://arxiv.org/pdf/2407.16418) | arXiv 2024
| [Standard compliant video coding using low complexity, switchable neural wrappers](https://arxiv.org/pdf/2407.07395) | arXiv 2024



### <span id="motion"> Motion & Prediction
| Title | Pub. & Date
|:-----|:-----|
|[Perceptual Video Compression with Neural Wrapping](https://openaccess.thecvf.com/content/CVPR2025/papers/Khan_Perceptual_Video_Compression_with_Neural_Wrapping_CVPR_2025_paper.pdf)| CVPR 2025
|[FLAVC: Learned Video Compression with Feature Level Attention](https://openaccess.thecvf.com/content/CVPR2025/papers/Zhang_FLAVC_Learned_Video_Compression_with_Feature_Level_Attention_CVPR_2025_paper.pdf)| CVPR 2025
|[CodingHomo: Bootstrapping Deep Homography With Video Coding](https://ieeexplore.ieee.org/document/10570492) | TCSVT 2024
| [Deep Video Compression with Scaled Hierarchical Bi-directional Motion Model](https://dl.acm.org/doi/pdf/10.1145/3664647.3685524?casa_token=YML8Fy3tKDwAAAAA:3rMIk_MV86yMzc_U6FV7cl3mXydMFhWiQTFl5qetd2czGsGPHvXlhlmxXxNyLshHPn_Ui0dVk0U) | ACMMM 2024
| [Multi-Scale Motion Alignment and Frame Reconstruction for Efficient Deep Video Compression](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10636206) | SPL 2024
| [High-Efficiency Neural Video Compression via Hierarchical Predictive Learning](https://arxiv.org/pdf/2410.02598) | arXiv 2024
| [Spatial Neighbor Information Assisted Motion Compensated Temporal Filter for Video Coding](https://ieeexplore.ieee.org/document/10566466) | PCS 2024



### <span id="bmk"> Benchmark & Dataset & Survey
| Title | Pub. & Date
|:-----|:-----|
|[Human-Machine Collaborative Image and Video Compression: A Survey](https://www.vmsci.com/en/articles/5959385__HumanMachine_Collaborative_Image_and_Video_Compression_A_Survey) | ATSIP 2024
|[USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s](https://arxiv.org/pdf/2409.08481) | arXiv 2024


## <span id="2024">✔2024 </span> [       «🎯Back To Top»       ](#)


- (CVPR 2024) **Deep Video Codec Control for Vision Models** Reich C, Debnath B, Patel D, et al. [paper](https://openaccess.thecvf.com/content/CVPR2024W/AI4Streaming/papers/Reich_Deep_Video_Codec_Control_for_Vision_Models_CVPRW_2024_paper.pdf)


- (ToMM 2024) **Learned Video Compression with Adaptive Temporal Prior and Decoded Motion-aided Quality Enhancement** Yang, Jiayu and Yang, Chunhui and Xiong, Fei and Zhai, Yongqi and Wang, Ronggang[paper](https://dl.acm.org/doi/pdf/10.1145/3661824)

- (Trans Broadcasting 2024) **Depth Video Inter Coding Based on Deep Frame Generationl**Li, Ge and Lei, Jianjun and Pan, Zhaoqing and Peng, Bo and Ling, Nam[paper](https://ieeexplore.ieee.org/abstract/document/10485621)

- (ICASSP 2024) **Rate-Quality Based Rate Control Model for Neural Video Compression**Liao, Shuhong and Jia, Chuanmin and Fan, Hongfei and Yan, Jingwen and Ma, Siwei[paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10447777) 
- (ICASSP 2024) **Learned Video Compression with Spatial-Temporal Optimization** Wang, Yiming and Huang, Qian and Tang, Bin and Liu, Wenting and Shan, Wenchao and Xu, Qian[paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10446198) 
- (ICASSP 2024) **Region-Adaptive Video Sharpening Via Rate-Perception Optimization** Pang, Yingxue and Zhao, Shijie and Guo, Mengxi and Li, Junlin and Zhang, Li [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10446929&casa_token=maZ7QZiRtLIAAAAA:fY0Ate5C0-QcGVKcfgigdZor4FuBS-RY2l5XWgEP_EIKoNU9VkDKqyJ-3vUmPvDdtV3NBOwOCQ) 
- (ICASSP 2024) **Leveraging Redundancy in Feature for Efficient Learned Image Compression** Qin, Peng and Bao, Youneng and Meng, Fanyang and Tan, Wen and Li, Chao and Wang, Genhong and Liang, Yongsheng [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10447424&casa_token=LvyB8nyNgq4AAAAA:P8Bl52gCJRLfhbJZeqx77XpBpjW59ptF5lBbU3jRFQnr8MuhqrIuwsS7mtd63Hcz7iDayZK2Kw) 
- (ICASSP 2024) **A Tri-Dynamic Preprocessing Framework for UGC Video Compression** Zhao, Fei and Guo, Mengxi and Zhao, Shijie and Li, Junlin and Zhang, Li and Xie, Xiaodong [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10445837&casa_token=1Gg25Sz6eJkAAAAA:rQRn-jysJ7-nDKslGAHIzSUJCeHKN-90xegmQsv5o-HGRqkAiVuEE9nhWV-qzlOICuKL17vCHQ) 
- (ICASSP 2024) **Improving Learned Video Compression by Exploring Spatial Redundancy** Yang, Jiayu and Yang, Chunhui and Zhai, Yongqi and Wang, Qi and Pan, Xinghao and Wang, Ronggang [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10448496&casa_token=ladLsmyfXJMAAAAA:wPVZPmeZ260DtCgmKXy9smwYmZR4x6yhKpMnRHgkTVuZZoMqhO-cbvktMJvFdnWXW5vxak1BXA) 
- (ICASSP 2024) **Learned Video Compression with Spatial-Temporal Optimization** Wang, Yiming and Huang, Qian and Tang, Bin and Liu, Wenting and Shan, Wenchao and Xu, Qian [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10446198&casa_token=yXWdEXN4tkMAAAAA:o-lFjdjZZ2mSXCTXzEuhN7z6w37arL3vMauGcMJwRftbBqu_XnunaByqAcmf3VXpqsXePQJowQ) 


- (WCACV 2024) **MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device** van Rozendaal, Ties and others [paper](https://arxiv.org/pdf/2310.01258.pdf) 

- (TPAMI 2024) **VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision** Sheng, Xihua and Li, Li and Liu, Dong and Li, Houqiang [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10411051&casa_token=SATXNmT8_RUAAAAA:lIcbJH933NSAAoZP_IGqEPJy6dkK3J9soKCjCWGLZa-qRS8m5oJR4Tcy9XADF2ssEzSfsdvM)
- (TPAMI 2024) **A Coding Framework and Benchmark towards Low-Bitrate Video Understanding** Tian, Yuan and Lu, Guo and Yan, Yichao and Zhai, Guangtao and Chen, Li and Gao, Zhiyong [paper](https://ieeexplore.ieee.org/iel7/34/4359286/10440520.pdf)


- (TIP 2024) **Cross-Component Prediction Boosted With Local and Non-Local Information in Video Coding** Zhang, Kai and Deng, Zhipin and Zhang, Li [paper](https://ieeexplore.ieee.org/document/10413275)

- (TCSVT 2024) **Exploiting Bidirectional Quality Impulse for Reference Picture Resampled Gaming Video Coding** Fang, Xiaohan and Chen, Peilin and Wang, Meng and Xie, Xi and Wang, Shiqi and Wang, Shanshe and Ma, Siwei [paper](https://ieeexplore.ieee.org/document/10477392/)

- (TCSVT 2024) **Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression** Becking, Daniel and M{\"u}ller, Karsten and Haase, Paul and Kirchhoffer, Heiner and Tech, Gerhard and Samek, Wojciech and Schwarz, Heiko and Marpe, Detlev and Wiegand, Thomas [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10412190)


- (Arxiv 2024) **Efficient Learned Wavelet Image and Video Coding**Meyer, Anna and Prativadibhayankaram, Srivatsa and Kaup, Andre[paper](https://arxiv.org/pdf/2405.12631)
- (Arxiv 2024) **Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression** Chen, Zhenghao and Zhou, Luping and Hu, Zhihao and Xu, Dong[paper](https://arxiv.org/pdf/2405.04274)
- (Arxiv 2024) **Parameter-Efficient Instance-Adaptive Neural Video Compression** Yang, Hyunmo and Oh, Seungjun and Park, Eunbyung[paper](https://arxiv.org/pdf/2405.08530)

- (Arxiv 2024) **Task-Aware Encoder Control for Deep Video Compression**Ge, Xingtong and Luo, Jixiang and Zhang, Xinjie and Xu, Tongda and Lu, Guo and He, Dailan and Geng, Jing and Wang, Yan and Zhang, Jun and Qin, Hongwei[paper](https://arxiv.org/pdf/2404.04848.pdf)
- (Arxiv 2024) **Image and Video Compression using Generative Sparse Representation with Fidelity Controls**Jiang, Wei and Wang, Wei[paper](https://arxiv.org/pdf/2404.06076.pdf)
- (Arxiv 2024) **A Perspective on Deep Vision Performance with Standard Image and Video Codecs**Reich, Christoph and Hahn, Oliver and Cremers, Daniel and Roth, Stefan and Debnath, Biplob[paper](https://arxiv.org/pdf/2404.12330)

- (Arxiv 2024) **Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression** Chen, Zhenghao and Zhou, Luping and Hu, Zhihao and Xu, Dong[paper](https://arxiv.org/pdf/2405.04274)
- (Arxiv 2024) **CGVC-T: Contextual Generative Video Compression with Transformers** Du, Pengli and Liu, Ying and Ling, Nam[paper](https://ieeexplore.ieee.org/abstract/document/10496072?casa_token=W5EwodaNwdMAAAAA:Ki0F2KsNW7G3tB-_Qp3r92XBObDsMoHn5gQszGarpIfsrs57pHMu9Cx7rrl3nyh-Mu_YbLdDWg)
- (Arxiv 2024) **Low-Latency Neural Stereo Streaming** Hou, Qiqi and Farhadzadeh, Farzad and Said, Amir and Sautiere, Guillaume and Le, Hoang[paper](https://arxiv.org/pdf/2403.17879.pdf)

- (Arxiv 2024) **Analysis of Neural Video Compression Networks for 360-Degree Video Coding** Regensky, Andy and Brand, Fabian and Kaup, Andr{\'e}[paper](https://arxiv.org/pdf/2402.10257.pdf)


- (Arxiv 2024) **Extreme Video Compression with Pre-trained Diffusion Models** Li, Bohan and Liu, Yiming and Niu, Xueyan and Bai, Bo and Deng, Lei and G{\"u}nd{\"u}z, Deniz [paper](https://arxiv.org/pdf/2402.08934v1.pdf)

- (Arxiv 2024) **Boosting Neural Representations for Videos with a Conditional Decoder** Zhang, Xinjie and Yang, Ren and He, Dailan and Ge, Xingtong and Xu, Tongda and Wang, Yan and Qin, Hongwei and Zhang, Jun [paper](https://arxiv.org/pdf/2402.18152v1.pdf)

- (Arxiv 2024) **Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low latency Encoding** Menon, Vignesh V and Zhu, Jingwen and Rajendran, Prajit T and Afzal, Samira and Schoeffmann, Klaus and Callet, Patrick Le and Timmerer, Christian [paper](https://arxiv.org/pdf/2401.15343.pdf)

- (Arxiv 2024) **VQ-NeRV: A Vector Quantized Neural Representation for Videos** Xu, Yunjie and Feng, Xiang and Qin, Feiwei and Ge, Ruiquan and Peng, Yong and Wang, Changmiao[paper](https://arxiv.org/pdf/2403.12401v1.pdf)
- (Arxiv 2024) **Low-Rate, Low-Distortion Compression with Wasserstein Distortion** Qiu, Yang and Wagner, Aaron B [paper](https://arxiv.org/pdf/2401.16858.pdf)
- (Arxiv 2024) **LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression** Jiang, Wei and Li, Junru and Zhang, Kai and Zhang, Li[paper](https://arxiv.org/pdf/2402.00680.pdf)
- (Arxiv 2024) **Immersive Video Compression using Implicit Neural Representations** Kwan, Ho Man and Zhang, Fan and Gower, Andrew and Bull, David[paper](https://arxiv.org/pdf/2402.01596.pdf)
- (Arxiv 2024) **Cool-chic video: Learned video coding with 800 parameters** Leguay, Thomas and Ladune, Th{\'e}o and Philippe, Pierrick and D{\'e}forges, Olivier[paper](https://arxiv.org/pdf/2402.03179.pdf)
- (Arxiv 2024) **A Neural-network Enhanced Video Coding Framework beyond ECM** Zhao, Yanchen and He, Wenxuan and Jia, Chuanmin and Wang, Qizhe and Li, Junru and Li, Yue and Lin, Chaoyi and Zhang, Kai and Zhang, Li and Ma, Siwei [paper](https://arxiv.org/pdf/2402.08397.pdf)
- (Arxiv 2024) **Motion-Adaptive Inference for Flexible Learned B-Frame Compression** Akin Yilmaz, M and Ugur Ulas, O and Bilican, Ahmet and Murat Tekalp, A [paper](https://arxiv.org/pdf/2402.08550.pdf)
- (Arxiv 2024) **Analysis of Neural Video Compression Networks for 360-Degree Video Coding** Regensky, Andy and Brand, Fabian and Kaup, Andr{\'e} [paper](https://arxiv.org/pdf/2402.10257.pdf)




- (VICP 2024) **High-Fidelity Free-View Talking Head Synthesis for Low-Bandwidth Video Conference** Zhang, Zhiyu and Tang, Anni and Zhu, Chen and Lu, Guo and Xie, Rong and Song, Li [paper](https://arxiv.org/pdf/2401.16858.pdf)

- (MMM 2024) **Hierarchical Bi-directional Temporal Context Mining for Improved Video Compression** Lin, Zijian and Luo, Jianping [paper](https://link.springer.com/chapter/10.1007/978-3-031-53305-1_31)



---


## <span id="2023">✔2023 </span> [       «🎯Back To Top»       ](#)

- (NeurIPS 2023) **HiNeRV: Video Compression with Hierarchical Encoding based Neural Representation** Kwan, Ho Man and Gao, Ge and Zhang, Fan and Gower, Andrew and Bull, David [paper](https://arxiv.org/pdf/2306.09818.pdf) [code](https://hmkx.github.io/hinerv/)


- (TPAMI 2023) **Compressed-SDR to HDR Video Reconstruction** Wang, Hu and Ye, Mao and Zhu, Xiatian and Li, Shuai and Li, Xue and Zhu, Ce [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10373884&casa_token=0YwYzQr-mtIAAAAA:fRzSWMZHeYm4f4lBNIZFowtNF9ZZxh5Lm7B36KGipe_6I1WgOWB50iqbNvOWdEP4tSU5DH8JIw)
- (TIP 2023) **Sur-driven video Bitrate control for jointly optimizing perceptual quality and buffer control** Yang, Zetao and Gao, Wei and Li, Ge and Yan, Yiqiang [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10266980&casa_token=AgyJU8S3bVUAAAAA:y7CRQnfjtPHN4HBuayf_xeY5QlAMdhhNlCO6CnNUqhMuwMuqyzhuBG52CxQAEphl8_0nfzXx)
- (Trans BROADCASTING 2023) **Virtual-Competitors-Based Rate Control for 360-Degree Video Coding** Lin, Jielian and Lin, Hongbin and Xu, Yiwen and Kang, Yuanxun and Zhao, Tiesong [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10341540)

- (Neurocomputing 2023) **Multiple Hypotheses Based Motion Compensation for Learned Video Compression** Lin, Rongqun and Wang, Meng and Zhang, Pingping and Wang, Shiqi and Kwong, Sam [paper](https://www.sciencedirect.com/science/article/pii/S0925231223005192?casa_token=4V7fGASs-pYAAAAA:8Lk-HCwftOicqBzj2F6i3YVJCOd9MUnokVmDANZRU1D7mwIPauX_pAAcCaMqiVWKCNzkwFSp)


- (ACMMM 2023) **High Visual-Fidelity Learned Video Compression** Li, Meng and Shi, Yibo and Wang, Jing and Huang, Yunqi [paper](https://arxiv.org/pdf/2310.04679.pdf)
- (ACMMM 2023) **DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision** Li, Meng and Shi, Yibo and Wang, Jing and Huang, Yunqi [paper](https://dl.acm.org/doi/10.1145/3581783.3612500)
- (ACMMM 2023) **Neural Video Compression with Spatio-Temporal Cross-Covariance Transformers** Chen, Zhenghao and Relic, Lucas and Azevedo, Roberto and Zhang, Yang and Gross, Markus and Xu, Dong and Zhou, Luping and Schroers, Christopher [paper](https://studios.disneyresearch.com/app/uploads/2023/09/Neural-Video-Compression-with-Spatio-Temporal-Cross-Covariance-Transformers-Paper.pdf)
- (ACMMM 2023) **Peering into The Sketch: Ultra-Low Bitrate Face Compression for Joint Human and Machine Perception** Mao, Yudong and Chen, Peilin and Wang, Shurun and Wang, Shiqi and Wu, Dapeng [paper](https://dl.acm.org/doi/pdf/10.1145/3581783.3613799?casa_token=CE_6kUxeIREAAAAA:xmZFIQJFRKkIZE1VBMyq3npr-gzcJQ4cyJAHDNPivRjQZJ4jcpy5MfJO9WkRIwpFwwBR_11yH7gZkg)




- (TMM 2023) **End-to-End Distortion Modeling for Error-Resilient Screen Content Video Coding** Tang, Tong and Yin, Zhiyang and Li, Jie and Wang, Honggang and Wu, Dapeng and Wang, Ruyan [paper](https://ieeexplore.ieee.org/abstract/document/10285532)

- (TMM 2023) **Learning to Predict Object-Wise Just Recognizable Distortion for Image and Video Compression** Zhang, Yun and Lin, Haoqin and Sun, Jing and Zhu, Linwei and Kwong, Sam [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10349945&casa_token=zhkj56wMmBQAAAAA:393OXq2npAAaHU5t6XW2D2o-By7m80ucHcQPUPG9tGAf2D78ibRSD-dnKhFrhilz2CNNc78K8g)

- (TMM 2023) **Enhanced Context Mining and Filtering for Learned Video Compression** Guo, Haifeng and Kwong, Sam and Ye, Dongjie and Wang, Shiqi [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10254316&casa_token=D3_YJB3zg1YAAAAA:8BqE8OyoHJrRtmMj4tshdGApp4sKsOG11-rREhnW-0WhLDWJZ43MFiFsbsKdiNMVr6lX8zz-)
- (TMM 2023) **Content-adaptive Rate-Distortion Modeling for Frame-level Rate Control in Versatile Video Coding** Liao, Junqi and Li, Li and Liu, Dong and Li, Houqiang [paper](https://ieeexplore.ieee.org/iel7/6046/4456689/10413636.pdf?casa_token=TdPkR5FGB4oAAAAA:Nu-0r0eo2oZwYNHrheWWHhY0XciQvsuO9a9lX-DDhpYLGgowapfbLMALVrkFzq7Omro025xV)



- (TOMM 2023) **Principal Component Approximation Network for Image Compression** Zhang, Shupei and Zhao, Chenqiu and Basu, Anup [paper](https://dl.acm.org/doi/abs/10.1145/3637490)

- (ICCV 2023) **Non-Semantics Suppressed Mask Learning for Unsupervised Video Semantic Compression** Abdulmotaleb El{-}Saddik and Tao Mei and Rita Cucchiara and Marco Bertini and Diana Patricia Tobon Vallejo and Pradeep K. Atrey and M. Shamim Hossain [paper](https://scholar.googleusercontent.com/scholar.bib?q=info:uv3Zxyq_gOkJ:scholar.google.com/&output=citation&scisdr=ClHXww84EOfu32LPvEI:AFWwaeYAAAAAZcHJpEI-LTSd7LD2FMwPd1sdahA&scisig=AFWwaeYAAAAAZcHJpOVN6Jgku4ecVTjwiEMA-e4&scisf=4&ct=citation&cd=-1&hl=zh-CN)

- (ICIP 2023) **FGC-VC: Flow-Guided Context Video Compression** Wang, Yiming and Huang, Qian and Tang, Bin and Sun, Huashan and Guo, Xiaotong [paper](https://ieeexplore.ieee.org/abstract/document/10222501) 
- (ICIP 2023) **Block-Based Motion Estimation for Deep-Learned Video Coding** S. Pientka, M. Schäfer, J. Pfaff, H. Schwarz, D. Marpe and T. Wiegand [paper](https://ieeexplore.ieee.org/document/10222411) 
- (ICIP 2023) **Learned Image Compression with Large Capacity and Low Redundancy of Latent Representation** Meng, Xiandong and Zhu, Shuyuan and Ma, Siwei and Zeng, Bing [paper](https://ieeexplore.ieee.org/abstract/document/10222381)
- (ICIP 2023) **Multi-scale deformable alignment and content-adaptive inference for flexible-rate bi-directional video compression** Y{\i}lmaz, M Ak{\i}n and Ulas, O Ugur and Tekalp, A Murat [paper](https://arxiv.org/pdf/2306.16544.pdf) 
- (ICIP 2023) **Machine-Attention-based Video Coding for Machines** Lee, Yegi and Kim, Shin and Yoon, Kyoungro and Lim, Hanshin and Kwak, Sangwoon and Choo, Hyon-Gon [paper](https://ieeexplore.ieee.org/abstract/document/10222037) 
- (ICIP 2023) **Predictive Coding for Animation-Based Video Compression** Konuko, Goluck and Lathuili{\`e}re, St{\'e}phane and Valenzise, Giuseppe [paper](https://arxiv.org/pdf/2307.04187.pdf) 
- (ICIP 2023) **Blurry Video Compression: A Trade-Off Between Visual Enhancement and Data Compression** Argaw, Dawit Mureja and Kim, Junsik and Kweon, In So [paper](https://openaccess.thecvf.com/content/WACV2024/papers/Argaw_Blurry_Video_Compression_A_Trade-Off_Between_Visual_Enhancement_and_Data_WACV_2024_paper.pdf) 
- (TCSVT 2023) **End-to-end learnable multi-scale feature compression for vcm** Kim, Yeongwoong and Jeong, Hyewon and Yu, Janghyun and Kim, Younhee and Lee, Jooyoung and Jeong, Se Yoon and Kim, Hui Yong [paper](https://arxiv.org/pdf/2306.16670.pdf)
- (TCSVT 2023) **Camera Pose-Based Background Modeling for Video Coding in Moving Cameras** Fang, Zheng and Zheng, Mingkui and Chen, Pingping and Chen, Zhifeng and Wu, Dapeng Oliver [paper](https://ieeexplore.ieee.org/abstract/document/10261273)
- (TCSVT 2023) **Sparse-to-Dense: High Efficiency Rate Control for End-to-end Scale-Adaptive Video Coding** Chen, Jiancong and Wang, Meng and Zhang, Pingping and Wang, Shurun and Wang, Shiqi [paper](https://ieeexplore.ieee.org/abstract/document/10246313)
- (TCSVT 2023) **MPAI-EEV: Standardization Efforts of Artificial Intelligence based End-to-End Video Coding** Jia, Chuanmin and Ye, Feng and Dong, Fanke and Lin, Kai and Chiariglione, Leonardo and Ma, Siwei and Sun, Huifang and Gao, Wen [paper](https://arxiv.org/pdf/2309.07589.pdf)
- (TCSVT 2023) **DBVC: An End-to-End 3-D Deep Biomedical Video Coding Framework** Xue, Dongmei and Ma, Haichuan and Li, Li and Liu, Dong and Xiong, Zhiwei and Li, Houqiang [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=10210614)

- (CVPR 2023) **Towards Scalable Neural Representation for Diverse Videos** He, Bo and Yang, Xitong and Wang, Hanyu and Wu, Zuxuan and Chen, Hao and Huang, Shuaiyi and Ren, Yixuan and Lim, Ser-Nam and Shrivastava, Abhinav [paper](https://arxiv.org/abs/2303.14124)
- (CVPR 2023) **DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos** Zhao, Qi and Asif, M Salman and Ma, Zhan [paper](https://openaccess.thecvf.com/content/CVPR2023/papers/Zhao_DNeRV_Modeling_Inherent_Dynamics_via_Difference_Neural_Representation_for_Videos_CVPR_2023_paper.pdf)
- (CVPR 2023) **HNeRV: A Hybrid Neural Representation for Videos** Chen, Hao and Gwilliam, Matt and Lim, Ser-Nam and Shrivastava, Abhinav [paper](https://arxiv.org/pdf/2304.02633.pdf)
- (CVPR 2023) **Motion Information Propagation for Neural Video Compression** Qi, Linfeng and Li, Jiahao and Li, Bin and Li, Houqiang and Lu, Yan [paper](https://openaccess.thecvf.com/content/CVPR2023/papers/Qi_Motion_Information_Propagation_for_Neural_Video_Compression_CVPR_2023_paper.pdf)

- (ICASSP 2023) **LCCM-VC: LEARNED CONDITIONAL CODING MODES FOR VIDEO CODING** Hadi Hadizadeh and Ivan V. Bajic [paper](https://arxiv.org/pdf/2210.15883.pdf)



- (Arxiv 2023) **Implicit-explicit Integrated Representations for Multi-view Video Compression** Zhu, Chen and Lu, Guo and He, Bing and Xie, Rong and Song, Li[paper](https://arxiv.org/pdf/2311.13846.pdf)
- (Arxiv 2023) **Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation** Peng, Tianhao and Gao, Ge and Sun, Heming and Zhang, Fan and Bull, David[paper](https://arxiv.org/pdf/2312.02605.pdf)
- (Arxiv 2023) **Hierarchical Autoencoder-based Lossy Compression for Large-scale High-resolution Scientific Data** Le, Hieu and Santos, Hernan and Tao, Jian[paper](https://arxiv.org/pdf/2307.04216.pdf)
- (Arxiv 2023) **Offline and Online Optical Flow Enhancement for Deep Video Compression** Tang, Chuanbo and Sheng, Xihua and Li, Zhuoyuan and Zhang, Haotian and Li, Li and Liu, Dong[paper](https://arxiv.org/pdf/2307.05092.pdf)
- (Arxiv 2023) **CANF-VC++: Enhancing Conditional Augmented Normalizing Flows for Video Compression with Advanced Techniques** Chen, Peng-Yu and Peng, Wen-Hsiao [paper](https://arxiv.org/pdf/2309.05382.pdf)
- (Arxiv 2023) **Implicit-explicit Integrated Representations for Multi-view Video Compression** Zhu, Chen and Lu, Guo and He, Bing and Xie, Rong and Song, Li [paper](https://arxiv.org/pdf/2311.17350.pdf)
- (Arxiv 2023) **C3: High-performance and low-complexity neural compression from a single image or video** Kim, Hyunjik and Bauer, Matthias and Theis, Lucas and Schwarz, Jonathan Richard and Dupont, Emilien [paper](https://arxiv.org/pdf/2312.02753.pdf)

- (Arxiv 2023) **Interactive Face Video Coding: A Generative
Compression Framework** Chen, Bolin and Wang, Zhao and Li, Binzhe and Wang, Shurun and Wang, Shiqi and Ye, Yan [paper](https://arxiv.org/pdf/2302.09919.pdf)
- (Arxiv 2023) **MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression** Chen, Yi-Hsin and Xie, Hong-Sheng and Chen, Cheng-Wei and Gao, Zong-Lin and Peng, Wen-Hsiao and Benjak, Martin and Ostermann, J{\"o}rn[paper](https://arxiv.org/pdf/2312.15829.pdf)
- (Arxiv 2023) **Interactive Face Video Coding: A Generative
Compression Framework** Chen, Bolin and Wang, Zhao and Li, Binzhe and Wang, Shurun and Wang, Shiqi and Ye, Yan [paper](https://arxiv.org/pdf/2302.09919.pdf)
- (Arxiv 2023) **Butterfly: Multiple Reference Frames Feature Propagation Mechanism for Neural Video Compression** Wang, Feng and Ruan, Haihang and Xiong, Fei and Yang, Jiayu and Li, Litian and Wang, Ronggang [paper](https://arxiv.org/pdf/2303.02959.pdf)
- (Arxiv 2023) **IBVC: Interpolation-driven B-frame Video Compression** Liu, Meiqin and Xu, Chenming and Yao, Chao and Lin, Weisi and Zhao, Yao [paper](https://arxiv.org/pdf/2309.13835.pdf)
- (Arxiv 2023) **Multiscale Motion-Aware and Spatial-Temporal-Channel Contextual Coding Network for Learned Video Compression** Wang, Yiming and Huang, Qian and Tang, Bin and Sun, Huashan and Li, Xing [paper](https://arxiv.org/pdf/2310.12733.pdf)
- (Arxiv 2023) **Effortless Cross-Platform Video Codec: A Codebook-Based Method** Tian, Kuan and Guan, Yonghang and Xiang, Jinxi and Zhang, Jun and Han, Xiao and Yang, Wei [paper](https://arxiv.org/pdf/2310.10292.pdf)

- (Arxiv 2023) **Generative Face Video Coding Techniques and Standardization Efforts: A Review** Chen, Bolin and Chen, Jie and Wang, Shiqi and Ye, Yan [paper](https://arxiv.org/pdf/2311.02649.pdf)
- (Arxiv 2023) **Bitstream Organization for Parallel Entropy Coding on Neural Network-based Video Codecs** Said, Amir and Le, Hoang and Farhadzadeh, Farzad [paper](https://arxiv.org/pdf/2312.00921.pdf)
- (Arxiv 2023) **Hyperspectral Image Compression Using Sampling and Implicit Neural Representations** Rezasoltani, Shima and Qureshi, Faisal Z [paper](https://arxiv.org/pdf/2312.01558.pdf)
- (Arxiv 2023) **Deep Hierarchical Video Compression** Lu, Ming and Duan, Zhihao and Zhu, Fengqing and Ma, Zhan [paper](https://arxiv.org/pdf/2312.07126.pdf)
- (Arxiv 2023) **VCD: A Video Conferencing Dataset for Video Compression** Naderi, Babak and Cutler, Ross and Khongbantabam, Nabakumar Singh and Hosseinkashi, Yasaman [paper](https://arxiv.org/pdf/2309.07376.pdf)


---

## <span id="2022">✔2022 </span> [       «🎯Back To Top»       ](#)

---

- (Arxiv 2022) **VCT: A Video Compression Transformer** Mentzer, Fabian and Toderici, George and Minnen, David and Hwang, Sung-Jin and Caelles, Sergi and Lucic, Mario and Agustsson, Eirikur [paper](https://arxiv.org/pdf/2206.07307.pdf)


- (ECCV 2022) **Neural Video Compression Using GANs for Detail Synthesis and Propagation** Mentzer, Fabian and Agustsson, Eirikur and Ball{\'e}, Johannes and Minnen, David and Johnston, Nick and Toderici, George [paper](https://link.springer.com/content/pdf/10.1007/978-3-031-19809-0_32.pdf)
- (ECCV 2022) **Canf-vc: Conditional augmented normalizing flows for video compression** Ho, Yung-Han and Chang, Chih-Peng and Chen, Peng-Yu and Gnutti, Alessandro and Peng, Wen-Hsiao [paper](https://link.springer.com/content/pdf/10.1007/978-3-031-19787-1_12.pdf)
- (ECCV 2022) **AlphaVC: High-Performance and Efficient Learned Video Compression** Shi, Yibo and Ge, Yunying and Wang, Jing and Mao, Jue [paper](https://link.springer.com/content/pdf/10.1007/978-3-031-19800-7_36.pdf)
- (ECCV 2022) **E-nerv: Expedite neural video representation with disentangled spatial-temporal context** Li, Zizhang and Wang, Mengmeng and Pi, Huaijin and Xu, Kechun and Mei, Jianbiao and Liu, Yong [paper](https://arxiv.org/pdf/2207.08132.pdf)
- (ACM MM 2022) **Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression** Li, Jiahao and Li, Bin and Lu, Yan. [paper](https://arxiv.org/pdf/2207.05894.pdf)

- (TMM 2022) **Temporal Context Mining for Learned Video Compression** Sheng, Xihua and Li, Jiahao and Li, Bin and Li, Li and Liu, Dong and Lu, Yan [paper](https://arxiv.org/pdf/2111.13850.pdf)

- (TCSVT 2022) **HMFVC: A Human-Machine Friendly Video Compression Scheme** Huang, Zhimeng and Jia, Chuanmin and Wang, Shanshe and Ma, Siwei [paper](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9894405)

- (arXiv preprint 2022) **CONTENT-ADAPTIVE MOTION RATE ADAPTION FOR LEARNED VIDEO COMPRESSION** Chen, Chih-Hsuan Lin Yi-Hsin and Peng, Wen-Hsiao [[paper](http://mapl.nctu.edu.tw/content/pages/research/PCS_2022.pdf)]

- (CVPRW 2022) **Learned Low Bitrate Video Compression with Space-Time Super-Resolution** Yang, Jiayu and Yang, Chunhui and Xiong, Fei and Wang, Feng and Wang, Ronggang [[paper](https://openaccess.thecvf.com/content/CVPR2022W/CLIC/papers/Yang_Learned_Low_Bitrate_Video_Compression_With_Space-Time_Super-Resolution_CVPRW_2022_paper.pdf)]
- (CVPRW 2022) **Learned Low Bitrate Video Compression With Space-Time Super-Resolution** Yang, Jiayu and Yang, Chunhui and Xiong, Fei and Wang, Feng and Wang, Ronggang [[paper](https://openaccess.thecvf.com/content/CVPR2022W/CLIC/papers/Yang_Learned_Low_Bitrate_Video_Compression_With_Space-Time_Super-Resolution_CVPRW_2022_paper.pdf)]
- (CVPR 2022) **Coarse-to-fine Deep Video Coding with Hyperprior-guided Mode Prediction** Zhihao Hu, Guo Lu, Jinyang Guo, Shan Liu, Wei Jiang, Dong Xu [[paper](https://openaccess.thecvf.com/content/CVPR2022/papers/Hu_Coarse-To-Fine_Deep_Video_Coding_With_Hyperprior-Guided_Mode_Prediction_CVPR_2022_paper.pdf)]
- (CVPR 2022) **Learning Based Multi-Modality Image and Video Compression**, Lu, Guo and Zhong, Tianxiong and Geng, Jing and Hu, Qiang and Xu, Dong [[paper](https://openaccess.thecvf.com/content/CVPR2022/papers/Lu_Learning_Based_Multi-Modality_Image_and_Video_Compression_CVPR_2022_paper.pdf)]
- (CVPR 2022) **LSVC: A Learning-based Stereo Video Compression Framework**, Chen, Zhenghao and Lu, Guo and Hu, Zhihao and Liu, Shan and Jiang, Wei and Xu, Dong [[paper]](https://openaccess.thecvf.com/content/CVPR2022/papers/Chen_LSVC_A_Learning-Based_Stereo_Video_Compression_Framework_CVPR_2022_paper.pdf) 

- (TPAMI 2022) **Multi-modality deep restoration of extremely compressed face videos**, Zhang, Xi and Wu, Xiaolin. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9730053)
  
- (arXiv preprint 2022) **A Coding Framework and Benchmark towards Compressed Video Understanding**, Yuan Tian, Guo Lu, Yichao Yan, Guangtao Zhai, Li Chen, Zhiyong Gao. [[paper]](https://arxiv.org/pdf/2202.02813.pdf)

- (Under review ICLR 2022) **Learning Perceptual Compression of Facial Video**, Shukor, Mustafa and Xu, YAO and Damodaran, Bharath Bhushan and Hellier, Pierre. [[paper]](https://openreview.net/pdf?id=4ZEJ_Z18NH)

- (Under review ICLR 2022) **Uncertainty-Aware Deep Video Compression with Ensembles**, Ma, Wufei and Li, Jiahao and Li, Bin and Lu, Yan. [[paper]](https://openreview.net/pdf?id=vkZtFD0zga8)

- (Signal Processing: Image Communication 2022) **Learning to compress videos without computing motion**, Chen, Meixu and Goodall, Todd and Patney, Anjul and Bovik, Alan C. [[paper]](https://reader.elsevier.com/reader/sd/pii/S0923596522000029?token=0DD9114AD904612721941553941BA62D7D7F1FCC292AF6C26D121372C2E69C81B4ACDCBD040F51AA44EEF35A1038DE80&originRegion=us-east-1&originCreation=20220419084824)

- (arXiv preprint 2022) **Multi-View Video Coding with GAN Latent Learning**, Lan, Chengdong and Luo, Cheng and Yan, Hao and Zhao, Tiesong and Kwong, Sam. [[paper]](https://arxiv.org/pdf/2205.03599.pdf)

- (ICASSP 2022) **Rate Control for Learned Video Compression**, Li, Yanghao and Chen, Xinyao and Li, Jisheng and Wen, Jiangtao and Han, Yuxing and Liu, Shan and Xu, Xiaozhong. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9746080)

- (TCSVT 2022) **Edge-Based Video Compression Texture Synthesis using Generative Adversarial Network**, Zhu, Chen and Xu, Jun and Feng, Donghui and Xie, Rong and Song, Li. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9762281)



---

## <span id="2021">✔2021 </span> [       «🎯Back To Top»       ](#)

---
- (NeurIPS 2021) **Nerv: Neural representations for videos**Chen, Hao and He, Bo and Wang, Hanyu and Ren, Yixuan and Lim, Ser Nam and Shrivastava, Abhinav [[paper]](https://proceedings.neurips.cc/paper_files/paper/2021/file/b44182379bf9fae976e6ae5996e13cd8-Paper.pdf) 

- (ICLR 2021) **Hierarchical autoregressive modeling for neural video compression**, Yang, Ruihan and Yang, Yibo and Marino, Joseph and Mandt, Stephan. [[paper]](https://arxiv.org/pdf/2010.10258.pdf) 

- (TPAMI 2021) **An end-to-end learning framework for video compression**, Lu, Guo and Zhang, Xiaoyun and Ouyang, Wanli and Chen, Li and Gao, Zhiyong and Xu, Dong. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9072487) 

- (TIP 2021) **End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression**, Y{\i}lmaz, M Ak{\i}n and Tekalp, A Murat. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9667275)

- (CVPR 21) **Online-trained Upsampler for Deep Low Complexity Video Compression**, Klopp, Jan P and Liu, Keng-Chi and Chien, Shao-Yi and Chen, Liang-Gee. [[paper]](https://openaccess.thecvf.com/content/ICCV2021/papers/Klopp_Online-Trained_Upsampler_for_Deep_Low_Complexity_Video_Compression_ICCV_2021_paper.pdf)

- (NIPS 21) **Deep Contextual Video Compression**, Li, Jiahao and Li, Bin and Lu, Yan. [[paper]](https://proceedings.neurips.cc/paper/2021/file/96b250a90d3cf0868c83f8c965142d2a-Paper.pdf)

- (CVPR 21) **ELF-VC: Efficient Learned Flexible-Rate Video Coding**, Rippel, Oren and Anderson, Alexander G and Tatwawadi, Kedar and Nair, Sanjay and Lytle, Craig and Bourdev, Lubomir. [[paper]](https://openaccess.thecvf.com/content/ICCV2021/papers/Rippel_ELF-VC_Efficient_Learned_Flexible-Rate_Video_Coding_ICCV_2021_paper.pdf)

- (CVPR 21) **FVC: A New Framework towards Deep Video Compression in Feature Space**, Hu, Zhihao and Lu, Guo and Xu, Dong. [[paper]](https://openaccess.thecvf.com/content/CVPR2021/papers/Hu_FVC_A_New_Framework_Towards_Deep_Video_Compression_in_Feature_CVPR_2021_paper.pdf)

- (CVPR 21) **Deep Perceptual Preprocessing for Video Coding**, Aaron Chadha, Yiannis Andreopoulos. [[paper]](https://openaccess.thecvf.com/content/CVPR2021/papers/Chadha_Deep_Perceptual_Preprocessing_for_Video_Coding_CVPR_2021_paper.pdf)

- (CVPR 21) **Deep learning in latent space for video prediction and compression**, Liu, Bowen and Chen, Yu and Liu, Shiyu and Kim, Hun-Seok. [[paper]](https://openaccess.thecvf.com/content/CVPR2021/papers/Liu_Deep_Learning_in_Latent_Space_for_Video_Prediction_and_Compression_CVPR_2021_paper.pdf)

- (ICIP 21) **Variable-Rate Video Compression[C]//2021 IEEE International Conference on Image Processing**, Lin, Jianping and Liu, Dong and Liang, Jie and Li, Houqiang and Wu, Feng. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9506269) VR

- (VCIP 21) **DVC-P: Deep Video Compression with Perceptual Optimizations**, Zhang, Saiping and Mrak, Marta and Herranz, Luis and Blanch, Marc G{\'o}rriz and Wan, Shuai and Yang, Fuzheng. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9675350)

- (MTICTI 2021) **Review and Evaluation of End-to-End Video Compression with Deep-Learning**, Yasin, Hajar Maseeh and Ameen, Siddeeq Yosef. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9664790)

- (arXiv preprint 2021) **Deep Video Coding with Dual-Path Generative Adversarial Network**, Zhao, Tiesong and Feng, Weize and Zeng, Hongji and Niu, Yuzhen and Liu, Jiaying. [[paper]](https://arxiv.org/pdf/2111.14474.pdf)
  
- (arXiv preprint 2021) **Versatile Learned Video Compression**, Feng, Runsen and Guo, Zongyu and Zhang, Zhizheng and Chen, Zhibo. [[paper]](https://arxiv.org/pdf/2111.03386.pdf)
  
- (arXiv preprint 2021) **A. Generalized Difference Coder: A Novel Conditional Autoencoder Structure for Video Compression**, Brand, Fabian and Seiler, J{\"u}rgen and Kaup, Andr{\'e}. [[paper]](https://arxiv.org/pdf/2112.08011.pdf)


- (arXiv preprint 2021) **Implicit Neural Video Compression**, Zhang, Yunfan and van Rozendaal, Ties and Brehmer, Johann and Nagel, Markus and Cohen, Taco. [[paper]](https://arxiv.org/pdf/2112.11312.pdf)

- (arXiv preprint 2021) **Self-Supervised Learning of Perceptually Optimized Block Motion Estimates for Video Compression**, Guo, Zongyu and Feng, Runsen and Zhang, Zhizheng and Jin, Xin and Chen, Zhibo. [[paper]](https://arxiv.org/pdf/2110.01805.pdf) MV

- (arXiv preprint 2021) **Learning Cross-Scale Prediction for Efficient Neural Video Compression**, Paul, Somdyuti and Norkin, Andrey and Bovik, Alan C. [[paper]](https://arxiv.org/pdf/2112.13309.pdf) MV
  
- (arXiv preprint 2021) **Neural Video Compression using GANs for Detail Synthesis and Propagation**, Mentzer, Fabian and Agustsson, Eirikur and Ball{\'e}, Johannes and Minnen, David and Johnston, Nick and Toderici, George. [[paper]](https://arxiv.org/pdf/2107.12038.pdf) 

- (arXiv preprint 2021) **Neural weight step video compression**, Czerkawski, Mikolaj and Cardona, Javier and Atkinson, Robert and Michie, Craig and Andonovic, Ivan and Clemente, Carmine and Tachtatzis, Christos. [[paper]](https://arxiv.org/pdf/2112.01504.pdf) 

- (arXiv preprint 2021) **Perceptual Learned Video Compression with Recurrent Conditional GAN**, Yang, Ren and Van Gool, Luc and Timofte, Radu. [[paper]](https://arxiv.org/pdf/2109.03082.pdf) 


---

## <span id="2020">✔2020 </span> [       «🎯Back To Top»       ](#)

---

- (AAAI 20) **Learned video compression via joint spatial-temporal correlation exploration**, Yang, Ren and Mentzer, Fabian and Gool, Luc Van and Timofte, Radu. [[paper]](https://ojs.aaai.org/index.php/AAAI/article/view/6825/6679) 


- (CVPR 20) **Learning for video compression with hierarchical quality and recurrent enhancement**, Liu, Haojie and Shen, Han and Huang, Lichao and Lu, Ming and Chen, Tong and Ma, Zhan. [[paper]](https://openaccess.thecvf.com/content_CVPR_2020/papers/Yang_Learning_for_Video_Compression_With_Hierarchical_Quality_and_Recurrent_Enhancement_CVPR_2020_paper.pdf) 
  
- (CVPR 20) **M-LVC: Multiple frames prediction for learned video compression**, Lin, Jianping and Liu, Dong and Li, Houqiang and Wu, Feng. [[paper]](https://openaccess.thecvf.com/content_CVPR_2020/papers/Lin_M-LVC_Multiple_Frames_Prediction_for_Learned_Video_Compression_CVPR_2020_paper.pdf)

- (CVPR 20) **Learned video compression with feature-level residuals**, Feng R, Wu Y, Guo Z, et al. [[paper]](https://openaccess.thecvf.com/content_CVPRW_2020/papers/w7/Feng_Learned_Video_Compression_With_Feature-Level_Residuals_CVPRW_2020_paper.pdf)

- (ACCV 20) **Feedback recurrent autoencoder for video compression**, Lin, Golinski, Adam and Pourreza, Reza and Yang, Yang and Sautiere, Guillaume and Cohen, Taco S. [[paper]](https://openaccess.thecvf.com/content/ACCV2020/papers/Golinski_Feedback_Recurrent_Autoencoder_for_Video_Compression_ACCV_2020_paper.pdf)

- (CSUR 20) **Deep learning-based video coding: A review and a case study**, Liu, Dong and Li, Yue and Lin, Jianping and Li, Houqiang and Wu, Feng. [[paper]](https://dl.acm.org/doi/pdf/10.1145/3368405)
  
---

## <span id="2019">✔2019 </span> [       «🎯Back To Top»       ](#)

---

- (ICCV 19) **Dvc: An end-to-end deep video compression framework**, Lu, Guo and Ouyang, Wanli and Xu, Dong and Zhang, Xiaoyun and Cai, Chunlei and Gao, Zhiyong. [[paper]](https://openaccess.thecvf.com/content_CVPR_2019/papers/Lu_DVC_An_End-To-End_Deep_Video_Compression_Framework_CVPR_2019_paper.pdf) 
  
- (ICCV 19) **Learned video compression**, Rippel, Oren and Nair, Sanjay and Lew, Carissa and Branson, Steve and Anderson, Alexander G and Bourdev, Lubomir. [[paper]](https://openaccess.thecvf.com/content_ICCV_2019/papers/Rippel_Learned_Video_Compression_ICCV_2019_paper.pdf) 

- (NIPS 19) **Deep generative video compression**, Lombardo, Salvator and Han, Jun and Schroers, Christopher and Mandt, Stephan. [[paper]](https://proceedings.neurips.cc/paper/2019/file/f1ea154c843f7cf3677db7ce922a2d17-Paper.pdf) 




  
- (TCSVT 19) **Image and video compression with neural networks: A review**, Ma, Siwei and Zhang, Xinfeng and Jia, Chuanmin and Zhao, Zhenghui and Wang, Shiqi and Wang, Shanshe. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8693636) 



---

## <span id="2018">✔2018 </span> [       «🎯Back To Top»       ](#)

---

- (ECCV 18) **Video compression through image interpolation**, Wu, Chao-Yuan and Singhal, Nayan and Krahenbuhl, Philipp. [[paper]](https://openaccess.thecvf.com/content_ECCV_2018/papers/Chao-Yuan_Wu_Video_Compression_through_ECCV_2018_paper.pdf) 



---

## <span id="2017">✔2017 </span> [       «🎯Back To Top»       ](#)

---

- (VCIP 17) **Video compression based on spatio-temporal resolution adaptation**, Afonso, Mariana and Zhang, Fan and Bull, David R. [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=8517114) 

---
Download .txt
gitextract_de1i0u7m/

└── README.md
Condensed preview — 1 files, each showing path, character count, and a content snippet. Download the .json file or copy for the full structured content (50K chars).
[
  {
    "path": "README.md",
    "chars": 49669,
    "preview": "\n# <p align=center> Awesome 🎉Deep Learning Based Video Compression </p>\n<!--# <p align=center>`# Awesome 🎉Deep Learning "
  }
]

About this extraction

This page contains the full source code of the ppingzhang/Awesome-Deep-Learning-Based-Video-Compression GitHub repository, extracted and formatted as plain text for AI agents and large language models (LLMs). The extraction includes 1 files (48.5 KB), approximately 16.0k tokens. Use this with OpenClaw, Claude, ChatGPT, Cursor, Windsurf, or any other AI tool that accepts text input. You can copy the full output to your clipboard or download it as a .txt file.

Extracted by GitExtract — free GitHub repo to text converter for AI. Built by Nikandr Surkov.

Copied to clipboard!