歪み補正・台形補正

歪み補正・台形補正#

まっすぐ撮影できずに台形になっているような状態はperspective歪みなどと呼ばれる

射影変換 （projective transformation）によって台形補正を行う

非線形な歪みの補正はできない。例えば書類をスキャンするときに隙間が空いていて、たわんでいる状態でスキャンした文書の補正はできない。

方法#

例えば画像内に長方形になるべきものが写っている場合、4点を検出して、まっすぐな歪みのない長方形にアフィン変換してやればよい。

点はkeypoints検出や直線検出の結果をもとに推定する

局所対応点（keypoints）を検出して、4点にまとめる
- 例えばコーナー検出（corner detection）によって「角っぽい点」を全部検知してから適切にフィルタリング・統合する
線が直交する点が角なので、線を検出して直交点を推測する
- 例えばハフ変換。ただし、太い線だと並行に検出するとは限らない。候補の線を全部出してしまう。Cannyなどで細線化する方法があるが、細線化が失敗すると後の工程が全部失敗するリスクもある
- 例えばLSD（Line Segment Detection）

画像の例#

../_images/9ae62f1feca0ea64d747521ffb6151ff97e644c4ddf1227893829e6730a61e25.png

前処理：台形の輪郭線を取り出す#

# 前処理：台形の輪郭線を取り出す

# グレースケール画像の取得
gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)

# 二値化
_, binary = cv2.threshold(gray, 128, 255, cv2.THRESH_BINARY_INV)

# モルフォロジー変換：膨張と収縮を繰り返して線を抽出
kernel_length = max(1, img.shape[1] // 80)  # カーネルのサイズを画像の幅に基づいて決定
kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (10, 1))
morph = cv2.morphologyEx(binary, cv2.MORPH_CLOSE, kernel, iterations=2)

plt.imshow(morph, cmap="gray")
plt.show()

../_images/f0172c4ae6b68f97814986a8a13859863236c22a8b12f25a92dfb409bebce1f5.png

contours, _ = cv2.findContours(morph, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)

# 輪郭を描画
contour_img = np.zeros_like(gray)
cv2.drawContours(contour_img, contours, -1, (255, 255, 255), 2)

plt.imshow(contour_img, cmap="gray")
plt.show()

../_images/53d017db7cec5527b8c9eddee606273bf87da9ddc735178e794f8126cc542acb.png

前処理：平滑化#

# 前処理2

# 白黒を反転
contour_img = 255 - contour_img

# 線がギザギザしているのでガウシアンブラーで平滑化
contour_img = cv2.GaussianBlur(contour_img, (5, 5), 0)

plt.imshow(contour_img, cmap="gray")
plt.show()

../_images/075809c30ea58ba61cacd5700706b213d681b8f7f9b5594e0c31a1815a1b77e6.png

局所対応点を検出する方法#

Harrisによるコーナー検出#

# Harris corner detection
dst = cv2.cornerHarris(np.float32(contour_img), blockSize=2, ksize=3, k=0.04)
# Dilate the result to mark the corners
dst = cv2.dilate(dst, None)
# Threshold for an optimal value, marking the corners in the original image
img_harris = img.copy()
img_harris[dst > 0.01 * dst.max()] = [0, 0, 255]

plt.imshow(img_harris)

<matplotlib.image.AxesImage at 0x7f3403248400>

../_images/80a066aada3cb0da769e3e1e073d9a1e5e0f850c50695d8f3679f989cd5ace6d.png

FASTによるコーナー検出#

FASTは16x16ピクセルの範囲ごとにグレースケール画像の濃淡の領域を分割するように決定木を構築してコーナーを検出する手法

OpenCV: FAST Algorithm for Corner Detection

fast = cv2.FastFeatureDetector_create(threshold=100)
keypoints = fast.detect(contour_img, None)

# 検出されたコーナーを画像に描画
image_with_keypoints = cv2.drawKeypoints(img, keypoints, None, color=(255, 0, 0))

plt.imshow(image_with_keypoints)
plt.title("Corners Detected by FAST")
plt.show()

../_images/a30f31a3500c532209512201ce3c8d8f4dfe2efb4c33a7e7d8db2858b89d1bb9.png

検出したコーナーたちを4点にまとめる#

# コーナーの座標を取得
corners = cv2.KeyPoint_convert(keypoints)
corners

array([[101.,  99.],
       [403.,  99.],
       [104., 102.],
       [400., 102.],
       [ 55., 498.],
       [449., 498.],
       [ 51., 501.],
       [453., 501.]], dtype=float32)

from sklearn.cluster import KMeans
kmeans = KMeans(n_clusters=4)
kmeans.fit(corners)
centroids = kmeans.cluster_centers_
centroids

array([[401.5, 100.5],
       [ 53. , 499.5],
       [451. , 499.5],
       [102.5, 100.5]], dtype=float32)

values = centroids[:, 0] + centroids[:, 1]
idx = np.argsort(values)
centroids = centroids[idx]
centroids

array([[102.5, 100.5],
       [401.5, 100.5],
       [ 53. , 499.5],
       [451. , 499.5]], dtype=float32)

img_centroids = image_with_keypoints.copy()
for centroid in centroids:
    cv2.circle(img_centroids, tuple(centroid.astype("uint32")), radius=5, color=(0, 0, 255), thickness=2)

plt.imshow(img_centroids)
plt.title("Corners Detected by FAST")
plt.show()

../_images/b1b261acd9c44a801cb0cdbaca6d01e3a8aca2528f9b127fef8f0edc3ae5ba0a.png

Perspective Transform#

pts_src = centroids

# 変換後の座標（出力画像のサイズを決定）
# width, height = 300, 400  # 出力画像の幅と高さを指定
margin = 50  # 余白
pts_dst = np.array([
    [margin, margin],
    [width - margin - 1, margin],
    [margin, height - margin - 1],
    [width - margin - 1, height - margin - 1]
], dtype='float32')
pts_dst

array([[ 50.,  50.],
       [429.,  50.],
       [ 50., 589.],
       [429., 589.]], dtype=float32)

# 変換行列を計算
matrix = cv2.getPerspectiveTransform(pts_src, pts_dst)
# 変換を適用
transformed_img = cv2.warpPerspective(img, matrix, (width, height))

plt.imshow(transformed_img, cmap="gray")

<matplotlib.image.AxesImage at 0x7f33dc4bd8a0>

../_images/32a65c40f5748dbb1eaa680e6e10f38f3ca493e22ef70e49ea67e62c61a1caa9.png

直線を検出する場合#

LSDはライセンスの問題でOpenCVからは削除されたようで、使いたい場合は pylsd をインストールする必要がある。

代わりに FastLineDetector というものがある。（Python使用例：Python+OpenCVによる線分検出 - 社会人研究者が色々頑張るブログ）

直線を検出する#

ぴったり4つではなく8つくらい出てる

length_threshold = 10
distance_threshold = 1.41421356
canny_th1 = 50
canny_th2 = 50
canny_aperture_size = 3
do_merge = False

fld = cv2.ximgproc.createFastLineDetector(
    length_threshold,
    distance_threshold,
    canny_th1,
    canny_th2,
    canny_aperture_size,
    do_merge
)

lines_raw = fld.detect(contour_img)
# 形状を整える
lines = [line[0] for line in lines_raw]

# drawSegmentsで描く場合
# img_lines = fld.drawSegments(img, lines)
# plt.imshow(img_lines)

# cv2.lineで描く場合
print(f"{len(lines)} lines are detected")
out = img.copy()
for line in lines:
    x1, y1, x2, y2 = map(int, line)
    cv2.line(out, (x1, y1), (x2, y2), (0, 0, 255), 1)
plt.imshow(out)

8 lines are detected

<matplotlib.image.AxesImage at 0x7f33dc4f5cf0>

../_images/9df865d48adae5c5758c4472e11054b655fb42fedb69410be80e81a566ae4e76.png

交点を出すために直線を延長する#

\begin{array}{r} \begin{aligned} diff & = P_{1} - P_{2} \\ = (\begin{array}{c} x_{1} \\ y_{1} \end{array}) - (\begin{array}{c} x_{2} \\ y_{2} \end{array}) \\ direction & = \frac{diff}{‖ diff ‖} \\ P_{1}^{n e w} & = (\begin{array}{c} x_{1} \\ y_{1} \end{array}) - direction \times length \\ P_{2}^{n e w} & = (\begin{array}{c} x_{2} \\ y_{2} \end{array}) + direction \times length \end{aligned} \end{array}

def extend_line(p1, p2, length):
    diff = np.array(p2) - np.array(p1)
    direction = diff / np.linalg.norm(diff)
    new_p1 = np.array(p1) - direction * length
    new_p2 = np.array(p2) + direction * length
    new_p1 = tuple(map(int, new_p1))
    new_p2 = tuple(map(int, new_p2))
    return new_p1, new_p2

out = img.copy()
for line in lines:
    x1, y1, x2, y2 = map(int, line)
    p1_, p2_ = extend_line(p1=(x1, y1), p2=(x2, y2), length=50)
    cv2.line(out, p1_, p2_, (0, 255, 0), 1)
    # break
plt.imshow(out)

<matplotlib.image.AxesImage at 0x7f33dc56cac0>

../_images/2d93c2a00242e7e256448f6aa46fcef967cbb58fbd6fe7767f8bc0022b9070ac.png

2つの線の交点を出す#

4つの点

P_{1} = (x_{1}, y_{1}), P_{2} = (x_{2}, y_{2}), P_{3} = (x_{3}, y_{3}), P_{4} = (x_{4}, y_{4})

からなる2つの線分

L_{1} (t) = (1 - t) P_{1} + t P_{2}, (0 \leq t \leq 1) L_{2} (u) = (1 - u) P_{3} + u P_{4}, (0 \leq u \leq 1)

を用いて、交点 $(x, y)$ を求める連立方程式を組む

\begin{array}{r} {\begin{cases} x = (1 - t) x_{1} + t x_{2} \\ x = (1 - u) x_{3} + u x_{4} \\ y = (1 - t) y_{1} + t y_{2} \\ y = (1 - u) y_{3} + u y_{4} \end{cases} \end{array}

整理すると

\begin{array}{r} {\begin{cases} (x_{2} - x_{1}) t - (x_{4} - x_{3}) u = x_{3} - x_{1} \\ (y_{2} - y_{1}) t - (y_{4} - y_{3}) u = y_{3} - y_{1} \end{cases} \end{array}

となる。行列表記にすると以下のようになる。

\begin{array}{r} (\begin{array}{ll} x_{2} - x_{1} & - (x_{4} - x_{3}) \\ y_{2} - y_{1} & - (y_{4} - y_{3}) \end{array}) (\binom{t}{u}) = (\binom{x_{3} - x_{1}}{y_{3} - y_{1}}) \end{array}

行列式

\begin{array}{r} \det = | \begin{array}{ll} x_{2} - x_{1} & - (x_{4} - x_{3}) \\ y_{2} - y_{1} & - (y_{4} - y_{3}) \end{array} | = (x_{2} - x_{1}) (y_{4} - y_{3}) - (y_{2} - y_{1}) (x_{4} - x_{3}) \end{array}

がゼロでなければ、クラメルの公式により

\begin{array}{r} \begin{aligned} t = \frac{(x_{3} - x_{1}) (y_{4} - y_{3}) - (y_{3} - y_{1}) (x_{4} - x_{3})}{\det} \\ u = \frac{(x_{2} - x_{1}) (y_{3} - y_{1}) - (y_{2} - y_{1}) (x_{3} - x_{1})}{\det} \end{aligned} \end{array}

として解が求まる

def get_line_intersection(line1, line2):
    x1, y1, x2, y2 = line1
    x3, y3, x4, y4 = line2
    # Calculate the determinant
    det = (x1 - x2) * (y3 - y4) - (y1 - y2) * (x3 - x4)
    if det == 0:
        return None  # Lines are parallel or coincident
    # Calculate the intersection point
    px = ((x1*y2 - y1*x2) * (x3 - x4) - (x1 - x2) * (x3*y4 - y3*x4)) / det
    py = ((x1*y2 - y1*x2) * (y3 - y4) - (y1 - y2) * (x3*y4 - y3*x4)) / det
    return int(px), int(py)

out = img.copy()
intersects = []
for i in range(len(lines)):
    for j in range(i, len(lines)):
        x1, y1, x2, y2 = map(int, lines[i])
        p1_, p2_ = extend_line(p1=(x1, y1), p2=(x2, y2), length=50)
        cv2.line(out, p1_, p2_, (0, 255, 0), 1)

        intersect = get_line_intersection(lines[i], lines[j])
        if intersect:
            x_, y_ = intersect
            if abs(x_) > out.shape[1] or abs(y_) > out.shape[0]:
                # 異常値は無視
                continue
            cv2.circle(out, intersect, radius=5, color=(0, 0, 255), thickness=1)
            intersects.append(intersect)
plt.figure(figsize=(10,10))
plt.imshow(out)

<matplotlib.image.AxesImage at 0x7f33da355ea0>

../_images/1784e83854e0bc03ca6948acf6dec3e115f3a2ab30c77b257c6d3518f80c90d6.png

交点の中心点をとる#

from sklearn.cluster import KMeans
kmeans = KMeans(n_clusters=4)
kmeans.fit(intersects)
centroids = kmeans.cluster_centers_
centroids

array([[100.75,  98.  ],
       [452.5 , 500.5 ],
       [ 50.5 , 500.5 ],
       [402.25,  99.  ]])

values = centroids[:, 0] + centroids[:, 1]
idx = np.argsort(values)
centroids = centroids[idx]
centroids

array([[100.75,  98.  ],
       [402.25,  99.  ],
       [ 50.5 , 500.5 ],
       [452.5 , 500.5 ]])

out = img.copy()
for centroid in centroids:
    cv2.circle(out, tuple(centroid.astype("uint32")), radius=5, color=(0, 0, 255), thickness=2)

plt.imshow(out)
plt.title("Corners Detected by Intersects of lines")
plt.show()

../_images/f39718aed24f0bcb8e887947f221b18b7928866c04a56295b78d4dd38c2f69b1.png

Perspective Transform#

pts_src = centroids.astype("float32")

# 変換後の座標（出力画像のサイズを決定）
# width, height = 300, 400  # 出力画像の幅と高さを指定
margin = 50  # 余白
pts_dst = np.array([
    [margin, margin],
    [width - margin - 1, margin],
    [margin, height - margin - 1],
    [width - margin - 1, height - margin - 1]
], dtype='float32')
pts_dst

array([[ 50.,  50.],
       [429.,  50.],
       [ 50., 589.],
       [429., 589.]], dtype=float32)

# 変換行列を計算
matrix = cv2.getPerspectiveTransform(pts_src, pts_dst)
# 変換を適用
transformed_img = cv2.warpPerspective(img, matrix, (width, height))

plt.imshow(transformed_img, cmap="gray")

<matplotlib.image.AxesImage at 0x7f33d911c070>

../_images/317f302712508f74613a67c5df8a3e6589394dfa4671d3f0015313ea787369b3.png

歪み補正・台形補正

Contents

歪み補正・台形補正#

方法#

画像の例#

前処理：台形の輪郭線を取り出す#

前処理：平滑化#

局所対応点を検出する方法#

Harrisによるコーナー検出#

FASTによるコーナー検出#

検出したコーナーたちを4点にまとめる#

Perspective Transform#

直線を検出する場合#

直線を検出する#

交点を出すために直線を延長する#

2つの線の交点を出す#

交点の中心点をとる#

Perspective Transform#

参考#