Repository: JianzeLi-114/FluxSR
Branch: master
Commit: 31a234d5e708
Files: 1
Total size: 4.1 KB

Directory structure:
gitextract_aauqd5qw/

└── README.md

================================================
FILE CONTENTS
================================================

================================================
FILE: README.md
================================================
# One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation

[Jianze Li], [Jiezhang Cao](https://www.jiezhangcao.com/), [Yong Guo](https://www.guoyongcs.com/), [Wenbo Li](https://fenglinglwb.github.io/), and [Yulun Zhang*](http://yulunzhang.com/), "One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation", ICML, 2025

[[project]] [[arXiv](https://arxiv.org/abs/2502.01993)] [supplementary material] [pretrained models]


#### 🔥🔥🔥 News

- **2025-02-03:** This repo is released.

---

> **Abstract:** Diffusion models (DMs) have significantly advanced the development of real-world image super-resolution (Real-ISR), but the computational cost of multi-step diffusion models limits their application. One-step diffusion models generate high-quality images in a one sampling step, greatly reducing computational overhead and inference latency. However, most existing one-step diffusion methods are constrained by the performance of the teacher model, where poor teacher performance results in image artifacts. To address this limitation, we propose FluxSR, a novel one-step diffusion Real-ISR technique based on flow matching models. We use the state-of-the-art diffusion model FLUX.1-dev as both the teacher model and the base model. First, we introduce Flow Trajectory Distillation (FTD) to distill a multi-step flow matching model into a one-step Real-ISR. Second, to improve image realism and address high-frequency artifact issues in generated images, we propose TV-LPIPS as a perceptual loss and introduce Attention Diversification Loss (ADL) as a regularization term to reduce token similarity in transformer, thereby eliminating high-frequency artifacts. Comprehensive experiments demonstrate that our method outperforms existing one-step diffusion-based Real-ISR methods.

![](figs/teaser.png)

---

### Pipeline

![](figs/pipeline.png)

---

## 🔖 TODO

- [ ] Release testing code and pre-trained models.
- [ ] Release training code. 
- [ ] Release pre-trained models.
- [ ] Provide HuggingFace demo.

## 🔗 Contents

1. Models
1. Training
1. Testing
1. [Results](#results)
1. [Citation](#citation)
1. [Acknowledgements](#acknowledgements)

## <a name="results"></a>🔎 Results

We achieve impressive performance on Real-world Image Super-Resolution. The full results could be downloaded here: [Google Drive](https://drive.google.com/drive/folders/1olqumLOpazfSF4TGTFplO6mF0xKIWdBI?usp=drive_link)

<details>
<summary>Quantitative Results (click to expand)</summary>

- Results in Table 1 of the main paper

<p align="center">
  <img width="900" src="figs/table1.png">
</p>

- Results in Table 2 (RealSet65 testset) of the main paper

<p align="center">
  <img width="450" src="figs/table2.png">
</p>

- Quantitative results (×4) on the Real-ISR testset with ground truth.

| Datasets | PSNR ↑ | SSIM ↑ | LPIPS ↓ | DISTS ↓ | MUSIQ ↑ | MANIQA ↑ | TOPIQ ↑ | QAlign ↑ |
|----------|-------|-------|--------|--------|--------|---------|--------|---------|
| RealSR   | 24.83  | 0.7175 | 0.3200  | 0.1910  | 68.95   | 0.5335   | 0.6699  | 4.3781   |
| DRealSR  | 25.92  | 0.7592 | 0.3418  | 0.1628  | 37.82   | 0.5310   | -       | 4.3356   |

- Quantitative results (×4) on the Real-ISR testset without ground truth.

| Datasets  | MUSIQ ↑ | MANIQA ↑ | TOPIQ ↑ | QAlign ↑ |
|-----------|--------|---------|--------|---------|
| RealLR200 |  71.60  |  0.5588  | 0.6814  |  4.4004  |
| RealLQ250 |  72.65  |  0.5490  | 0.6848  |  4.4077  |


</details>

<details>
<summary>Qualitative Results (click to expand)</summary>

- Results in Figure 5 of the main paper

<p align="center">
  <img width="900" src="figs/visual.png">
</p>


</details>

## <a name="citation"></a>📎 Citation

If you find the code helpful in your research or work, please cite the following paper(s).

```
@inproceedings{li2025one,
  title={One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation},
  author={Li, Jianze and Cao, Jiezhang and Guo, Yong and Li, Wenbo and Zhang, Yulun},
  booktitle={ICML},
  year={2025}
}
```

## <a name="acknowledgements"></a>💡 Acknowledgements

This project is based on [FLUX](https://github.com/black-forest-labs/flux).