This Space demonstrates DocScanner, a deep learning model that automatically corrects geometric distortions in document images.

If you have a photo of a document that is warped, skewed, or has curled edges, this tool can transform it into a flat, top-down, scanner-like image.

This application is an implementation of the research paper: DocScanner: Robust Document Image Rectification with Progressive Learning (https://arxiv.org/abs/2110.14968).

How to Use

Upload an Image: Drag and drop a distorted document image into the input box, or click to browse your files.
Submit: Click the "Submit" button to begin the rectification process.
View the Result: The corrected, flattened document will appear in the output box on the right.

Technical Details

Model: This demo uses the DocScanner-L model, as described in the paper.
Technology: The application is built with Python, PyTorch, and the Gradio library.

DocScanner: Document Image Rectification

How to Use

Technical Details