PromptRR: Diffusion Models as Prompt Generators for
Single Image Reflection Removal
Tao Wang
Wanglong Lu
Kaihao Zhang
Wenhan Luo
Tae-Kyun Kim
Tong Lu
Hongdong Li
Ming-Hsuan Yang
[Paper]
[GitHub]


PromptRR is a diffusion-based model that uses frequency information as new visual prompts for single-image reflection removal.

Abstract

Current single image reflection removal (SIRR) methods using deep learning tend to miss key low-frequency (LF) and high-frequency (HF) differences in images, affecting their effectiveness in removing reflections. To address this problem, this paper proposes a novel prompt-guided reflection removal (PromptRR) framework that uses frequency information as new visual prompts for better performance. Specifically, the proposed framework decouples the reflection removal process into the prompt generation and subsequent prompt-guided restoration. For the prompt generation, we first propose a prompt pre-training strategy to train a frequency prompt encoder that encodes the ground-truth image into LF and HF prompts. Then, we adopt diffusion models (DMs) as prompt generators to generate the LF and HF prompts estimated by the pre-trained frequency prompt encoder. For the prompt-guided restoration, we integrate specially generated prompts into the PromptFormer network, employing a novel Transformer-based prompt block to effectively steer the model toward enhanced reflection removal. The results on commonly used benchmarks show that our method outperforms state-of-the-art approaches.


Framework

An overview of our PromptRR. It includes two main stages: (a) prompt pre-training stage and (b) prompt generation and restoration stage. Notably, we do not use the ground-truth image (GT) in the inference stage.


The overview of FPE and PromptFormer for the prompt pre-training. FPE includes a wavelet transform and a dual-branch encoder. PromptFormer is mainly built by a transformer-based prompt block consisting of the prompt multi-head self-attention (PMSA) and the prompt feed-forward network (PFFN). WT and IWT refer to the wavelet transform and inverse wavelet transform respectively.


More results

Comparison of quantitative results on commonly used real-world datasets in terms of PSNR and SSIM. Bold and underline indicate the best and second-best results.


Visual comparison on Nature dataset. Compared with other methods, our PromptRR effectively removes reflections while preserving the fine details in the restored images.


Visual comparison on Real dataset. The reflection results of comparison methods still contain significant reflection effects. Our PromptRR removes the reflection areas more accurately and thoroughly.



 [GitHub]


Paper and Supplementary Material

Tao Wang, Wanglong Lu, Kaihao Zhang, Wenhan Luo, Tae-Kyun Kim, Tong Lu, Hongdong Li, Ming-Hsuan Yang.
PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal (hosted on ArXiv)


[Bibtex]


Acknowledgements

This template was originally made by Phillip Isola and Richard Zhang for a colorful ECCV project; Thanks for their awesome work!