NVIDIA Technical Blog

Fast Inversion for Real-Time Image Editing with Text

thumbnail
  • Introduction
    • RNRI enables real-time image editing by balancing rapid convergence with superior accuracy, execution time, and memory efficiency.
  • Inversion as Solving an Implicit Equation
    • DDIM inversion involves searching for a seed that reconstructs a given image using a diffusion model.
  • DDIM Inversion
    • It denoises a latent noise vector using an implicit equation that cannot be solved in closed form.
  • Regularized Newton-Raphson Inversion Method
    • NR method is faster and more accurate for solving implicit functions.
    • A regularization term is added to improve reconstruction of images.
  • Performance Comparison
    • RNRI demonstrates improved PSNR, run time, and image fidelity over existing methods.
  • Image Editing with RNRI
    • RNRI performs well in accurately editing images based on text prompts.
  • Conclusion
    • RNRI outperforms existing approaches, achieving state-of-the-art image editing with text in real images.