Repository: JianzeLi-114/FluxSR
Branch: master
Commit: 31a234d5e708
Files: 1
Total size: 4.1 KB
Directory structure:
gitextract_aauqd5qw/
└── README.md
================================================
FILE CONTENTS
================================================
================================================
FILE: README.md
================================================
# One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation
[Jianze Li], [Jiezhang Cao](https://www.jiezhangcao.com/), [Yong Guo](https://www.guoyongcs.com/), [Wenbo Li](https://fenglinglwb.github.io/), and [Yulun Zhang*](http://yulunzhang.com/), "One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation", ICML, 2025
[[project]] [[arXiv](https://arxiv.org/abs/2502.01993)] [supplementary material] [pretrained models]
#### 🔥🔥🔥 News
- **2025-02-03:** This repo is released.
---
> **Abstract:** Diffusion models (DMs) have significantly advanced the development of real-world image super-resolution (Real-ISR), but the computational cost of multi-step diffusion models limits their application. One-step diffusion models generate high-quality images in a one sampling step, greatly reducing computational overhead and inference latency. However, most existing one-step diffusion methods are constrained by the performance of the teacher model, where poor teacher performance results in image artifacts. To address this limitation, we propose FluxSR, a novel one-step diffusion Real-ISR technique based on flow matching models. We use the state-of-the-art diffusion model FLUX.1-dev as both the teacher model and the base model. First, we introduce Flow Trajectory Distillation (FTD) to distill a multi-step flow matching model into a one-step Real-ISR. Second, to improve image realism and address high-frequency artifact issues in generated images, we propose TV-LPIPS as a perceptual loss and introduce Attention Diversification Loss (ADL) as a regularization term to reduce token similarity in transformer, thereby eliminating high-frequency artifacts. Comprehensive experiments demonstrate that our method outperforms existing one-step diffusion-based Real-ISR methods.

---
### Pipeline

---
## 🔖 TODO
- [ ] Release testing code and pre-trained models.
- [ ] Release training code.
- [ ] Release pre-trained models.
- [ ] Provide HuggingFace demo.
## 🔗 Contents
1. Models
1. Training
1. Testing
1. [Results](#results)
1. [Citation](#citation)
1. [Acknowledgements](#acknowledgements)
## <a name="results"></a>🔎 Results
We achieve impressive performance on Real-world Image Super-Resolution. The full results could be downloaded here: [Google Drive](https://drive.google.com/drive/folders/1olqumLOpazfSF4TGTFplO6mF0xKIWdBI?usp=drive_link)
<details>
<summary>Quantitative Results (click to expand)</summary>
- Results in Table 1 of the main paper
<p align="center">
<img width="900" src="figs/table1.png">
</p>
- Results in Table 2 (RealSet65 testset) of the main paper
<p align="center">
<img width="450" src="figs/table2.png">
</p>
- Quantitative results (×4) on the Real-ISR testset with ground truth.
| Datasets | PSNR ↑ | SSIM ↑ | LPIPS ↓ | DISTS ↓ | MUSIQ ↑ | MANIQA ↑ | TOPIQ ↑ | QAlign ↑ |
|----------|-------|-------|--------|--------|--------|---------|--------|---------|
| RealSR | 24.83 | 0.7175 | 0.3200 | 0.1910 | 68.95 | 0.5335 | 0.6699 | 4.3781 |
| DRealSR | 25.92 | 0.7592 | 0.3418 | 0.1628 | 37.82 | 0.5310 | - | 4.3356 |
- Quantitative results (×4) on the Real-ISR testset without ground truth.
| Datasets | MUSIQ ↑ | MANIQA ↑ | TOPIQ ↑ | QAlign ↑ |
|-----------|--------|---------|--------|---------|
| RealLR200 | 71.60 | 0.5588 | 0.6814 | 4.4004 |
| RealLQ250 | 72.65 | 0.5490 | 0.6848 | 4.4077 |
</details>
<details>
<summary>Qualitative Results (click to expand)</summary>
- Results in Figure 5 of the main paper
<p align="center">
<img width="900" src="figs/visual.png">
</p>
</details>
## <a name="citation"></a>📎 Citation
If you find the code helpful in your research or work, please cite the following paper(s).
```
@inproceedings{li2025one,
title={One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation},
author={Li, Jianze and Cao, Jiezhang and Guo, Yong and Li, Wenbo and Zhang, Yulun},
booktitle={ICML},
year={2025}
}
```
## <a name="acknowledgements"></a>💡 Acknowledgements
This project is based on [FLUX](https://github.com/black-forest-labs/flux).
gitextract_aauqd5qw/ └── README.md
Condensed preview — 1 files, each showing path, character count, and a content snippet. Download the .json file or copy for the full structured content (4K chars).
[
{
"path": "README.md",
"chars": 4205,
"preview": "# One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation\n\n[Jianze Li], [Jiezhang Cao](https:"
}
]
About this extraction
This page contains the full source code of the JianzeLi-114/FluxSR GitHub repository, extracted and formatted as plain text for AI agents and large language models (LLMs). The extraction includes 1 files (4.1 KB), approximately 1.3k tokens. Use this with OpenClaw, Claude, ChatGPT, Cursor, Windsurf, or any other AI tool that accepts text input. You can copy the full output to your clipboard or download it as a .txt file.
Extracted by GitExtract — free GitHub repo to text converter for AI. Built by Nikandr Surkov.