Perspective Transform

Finding Document Corners

import cv2
import numpy as np
gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
# threshold to binary
_, binary = cv2.threshold(gray, 127, 255, cv2.THRESH_BINARY)
# find contours
#ans: contours, _ = cv2.findContours(binary, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
# find largest contour
#ans: largest = max(contours, key=cv2.contourArea)
# approximate to polygon
epsilon = 0.02 * cv2.arcLength(largest, True)
#ans: approx = cv2.approxPolyDP(largest, epsilon, True)
#ans: if len(approx) == 4, it's a rectangle (document)

Perspective Transform

Document Scanning

Finding Document Corners

Ordering Corner Points

Computing Output Size

Complete Document Scan

Bird's Eye View

Bird's Eye View Example

Inverse Perspective

Exercises - Part 1 (Concepts)

Exercises - Part 2 (Concepts)

Exercises - Part 3 (Coding)

Exercises - Part 4 (Coding)

Exercises - Part 5 (Mixed)