PromptRR

PromptRR is a diffusion-based model that uses frequency information as new visual prompts for single-image reflection removal.

Current single image reflection removal (SIRR) methods using deep learning tend to miss key low-frequency (LF) and high-frequency (HF) differences in images, affecting their effectiveness in removing reflections. To address this problem, this paper proposes a novel prompt-guided reflection removal (PromptRR) framework that uses frequency information as new visual prompts for better performance. Specifically, the proposed framework decouples the reflection removal process into the prompt generation and subsequent prompt-guided restoration. For the prompt generation, we first propose a prompt pre-training strategy to train a frequency prompt encoder that encodes the ground-truth image into LF and HF prompts. Then, we adopt diffusion models (DMs) as prompt generators to generate the LF and HF prompts estimated by the pre-trained frequency prompt encoder. For the prompt-guided restoration, we integrate specially generated prompts into the PromptFormer network, employing a novel Transformer-based prompt block to effectively steer the model toward enhanced reflection removal. The results on commonly used benchmarks show that our method outperforms state-of-the-art approaches.

An overview of our PromptRR. It includes two main stages: (a) prompt pre-training stage and (b) prompt generation and restoration stage. Notably, we do not use the ground-truth image (GT) in the inference stage.

The overview of FPE and PromptFormer for the prompt pre-training. FPE includes a wavelet transform and a dual-branch encoder. PromptFormer is mainly built by a transformer-based prompt block consisting of the prompt multi-head self-attention (PMSA) and the prompt feed-forward network (PFFN). WT and IWT refer to the wavelet transform and inverse wavelet transform respectively.

Comparison of quantitative results on commonly used real-world datasets in terms of PSNR and SSIM. Bold and underline indicate the best and second-best results.

Visual comparison on Nature dataset. Compared with other methods, our PromptRR effectively removes reflections while preserving the fine details in the restored images.

Visual comparison on Real dataset. The reflection results of comparison methods still contain significant reflection effects. Our PromptRR removes the reflection areas more accurately and thoroughly.

Abstract

Framework

More results

Paper and Supplementary Material

Acknowledgements