python PILを使用してRGB画像を純粋な白黒画像に変換する

Question

Pythonイメージングライブラリを使用して、非常に単純な画像操作を行っていますが、グレースケール画像をモノクロ（白黒）画像に変換するのに問題があります。 image to greyscale（convert（ 'L'））その後、イメージは期待どおりにレンダリングされますが、イメージをモノクロのシングルバンドイメージに変換すると、下のイメージに示すようにノイズが発生します。 PIL/pythonを使用してカラーpng画像を純粋な白黒画像にする簡単な方法はありますか？

from PIL import Image 
import ImageEnhance
import ImageFilter
from scipy.misc import imsave
image_file = Image.open("convert_image.png") # open colour image
image_file= image_file.convert('L') # convert image to monochrome - this works
image_file= image_file.convert('1') # convert image to black and white
imsave('result_col.png', image_file)

Original Image Converted Image

from PIL import Image import ImageEnhance import ImageFilter from scipy.misc import imsave image_file = Image.open("convert_image.png") # open colour image image_file= image_file.convert('L') # convert image to monochrome - this works image_file= image_file.convert('1') # convert image to black and white imsave('result_col.png', image_file)

Original Image Converted Image

unutbu · Accepted Answer

from PIL import Image image_file = Image.open("convert_image.png") # open colour image image_file = image_file.convert('1') # convert image to black and white image_file.save('result.png')

利回り

enter image description here

Martin Thoma · Answer

別のオプション（セグメンテーションマスクを使用する必要がある場合の科学的な目的などに便利です）は、単にしきい値を適用することです：

#!/usr/bin/env python # -*- coding: utf-8 -*- """Binarize (make it black and white) an image with Python.""" from PIL import Image from scipy.misc import imsave import numpy def binarize_image(img_path, target_path, threshold): """Binarize an image.""" image_file = Image.open(img_path) image = image_file.convert('L') # convert image to monochrome image = numpy.array(image) image = binarize_array(image, threshold) imsave(target_path, image) def binarize_array(numpy_array, threshold=200): """Binarize a numpy array.""" for i in range(len(numpy_array)): for j in range(len(numpy_array[0])): if numpy_array[i][j] > threshold: numpy_array[i][j] = 255 else: numpy_array[i][j] = 0 return numpy_array def get_parser(): """Get parser object for script xy.py.""" from argparse import ArgumentParser, ArgumentDefaultsHelpFormatter parser = ArgumentParser(description=__doc__, formatter_class=ArgumentDefaultsHelpFormatter) parser.add_argument("-i", "--input", dest="input", help="read this file", metavar="FILE", required=True) parser.add_argument("-o", "--output", dest="output", help="write binarized file hre", metavar="FILE", required=True) parser.add_argument("--threshold", dest="threshold", default=200, type=int, help="Threshold when to show white") return parser if __== "__main__": args = get_parser().parse_args() binarize_image(args.input, args.output, args.threshold)

./binarize.py -i convert_image.png -o result_bin.png --threshold 200の場合は次のようになります。

maxschlepzig · Answer

カスタムしきい値を使用して2レベル（白黒）画像を作成するためのPIL専用ソリューション：

from PIL import Image img = Image.open('mB96s.png') thresh = 200 fn = lambda x : 255 if x > thresh else 0 r = img.convert('L').point(fn, mode='1') r.save('foo.png')

とだけ

r = img.convert('1') r.save('foo.png')

ディザリングされた画像が得られます。

入力画像の左から右、白黒変換結果、ディザリング結果：

画像をクリックして、拡大縮小されていないバージョンを表示できます。

RajV · Answer

マーティントーマが言ったように、通常はしきい値を適用する必要があります。しかし、その答えで使用されているforループよりもはるかに高速に実行される単純なベクトル化を使用してこれを行うことができます。

以下のコードは、画像のピクセルを0（黒）と1（白）に変換します。

from PIL import Image import numpy as np import matplotlib.pyplot as plt #Pixels higher than this will be 1. Otherwise 0. THRESHOLD_VALUE = 200 #Load image and convert to greyscale img = Image.open("photo.png") img = img.convert("L") imgData = np.asarray(img) thresholdedData = (imgData > THRESHOLD_VALUE) * 1.0 plt.imshow(thresholdedData) plt.show()

Mark Ransom · Answer

nutb によって得られた結果から判断すると、scipyのimsaveはモノクロ（モード1）イメージを理解していないと結論付けます。

M.Bore · Answer

pythonを使用してそれを行う簡単な方法：

Python import numpy as np import imageio image = imageio.imread(r'[image-path]', as_gray=True) # getting the threshold value thresholdValue = np.mean(image) # getting the dimensions of the image xDim, yDim = image.shape # turn the image into a black and white image for i in range(xDim): for j in range(yDim): if (image[i][j] > thresholdValue): image[i][j] = 255 else: image[i][j] = 0