如何使用python刪除最低概率值

Question 1

我有一個(gè)圖像，我想在其中檢測文本，我使用easyocr檢測文本。OCR給出輸出邊界框值和概率，如輸出圖像所示。我想去掉檢測到的任何文本中小于0.4的概率，我如何改變它？

Image1

Image 2

結(jié)果元素給出了第一個(gè)文本和第二個(gè)文本“AA”的輸出概率，如圖所示。我想刪除檢測到的概率最低的文本。

圖像1的輸出

圖像2的輸出

Requirements

pip安裝pytesseract

pip安裝easyocr

使用pythonmain.py-iimage1.jpg運(yùn)行代碼

# import the necessary packages
from pytesseract import Output
import pytesseract
import argparse
import cv2
from matplotlib import pyplot as plt
import numpy as np
import os
import easyocr
from PIL import ImageDraw, Image



def remove_lines(image):
    result = image.copy()
    gray = cv2.cvtColor(image,cv2.COLOR_BGR2GRAY)
    thresh = cv2.threshold(gray, 0, 255, cv2.THRESH_BINARY_INV + cv2.THRESH_OTSU)[1]

    # Remove horizontal lines
    horizontal_kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (40,1))
    remove_horizontal = cv2.morphologyEx(thresh, cv2.MORPH_OPEN, horizontal_kernel, iterations=2)
    cnts = cv2.findContours(remove_horizontal, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
    cnts = cnts[0] if len(cnts) == 2 else cnts[1]
    for c in cnts:
        cv2.drawContours(result, [c], -1, (255,255,255), 5)


    # Remove vertical lines
    vertical_kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (1,40))
    remove_vertical = cv2.morphologyEx(thresh, cv2.MORPH_OPEN, vertical_kernel, iterations=2)
    cnts = cv2.findContours(remove_vertical, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
    cnts = cnts[0] if len(cnts) == 2 else cnts[1]
    for c in cnts:
        cv2.drawContours(result, [c], -1, (255,255,255), 5)

    plt.imshow(result)
    plt.show()

    return result



# construct the argument parser and parse the arguments
ap = argparse.ArgumentParser()
ap.add_argument("-i", "--image", required=True,
    help="path to input image to be OCR'd")
ap.add_argument("-c", "--min-conf", type=int, default=0,
    help="mininum confidence value to filter weak text detection")
args = vars(ap.parse_args())


reader = easyocr.Reader(['ch_sim','en']) # need to run only once to load model into memory



# load the input image, convert it from BGR to RGB channel ordering,
# and use Tesseract to localize each area of text in the input image
image = cv2.imread(args["image"])
# image = remove_lines(image)

results = reader.readtext(image)


print(results)

Question 2

results=[([[5, 0],[233, 0],[233, 15],[5, 15]],' ? ]TC T III3 U?CU 3', 0.011015821004953916),
 ([[241, 0], [390, 0], [390, 15], [241, 15] ] , '?A[ [ C 0l?', 0.0023567583563770737),
 ([[2, 16], [46, 16], [46, 42], [2, 42]], 'MM', 0.9965182566504686),
 ([[98, 16], [140, 16], [140, 46], [98, 46]], 'D', 0.9973547096148511),
 ([[182, 16], [220, 16],[220, 44], [182, 44]], 'Y', 0.9971791823074896),
 ([[24, 46], [62, 46], [62, 74], [24, 74]], '62', 0.9999828941291119),
 ([[94, 46], [130, 46], [130, 74], [94, 74]], '26', 0.9997197349619524),
 ([[180, 46], [242, 46], [242, 74], [180, 74]],'1970', 0.999931275844574)]

low_precision = []
for text in results:
    if text[2]<0.5: # precision here
        low_precision.append(text)
for i in low_precision:
    results.remove(i) # remove low precision
print(results)

result:

[([[2, 16], [46, 16], [46, 42], [2, 42]], 'MM', 0.9965182566504686),
 ([[98, 16], [140, 16], [140, 46], [98, 46]], 'D', 0.9973547096148511),
 ([[182, 16], [220, 16], [220, 44], [182, 44]], 'Y', 0.9971791823074896),
 ([[24, 46], [62, 46], [62, 74], [24, 74]], '62', 0.9999828941291119),
 ([[94, 46], [130, 46], [130, 74], [94, 74]], '26', 0.9997197349619524),
 ([[180, 46], [242, 46], [242, 74], [180, 74]], '1970', 0.999931275844574)]

Answer 1

results=[([[5, 0],[233, 0],[233, 15],[5, 15]],' ? ]TC T III3 U?CU 3', 0.011015821004953916),
 ([[241, 0], [390, 0], [390, 15], [241, 15] ] , '?A[ [ C 0l?', 0.0023567583563770737),
 ([[2, 16], [46, 16], [46, 42], [2, 42]], 'MM', 0.9965182566504686),
 ([[98, 16], [140, 16], [140, 46], [98, 46]], 'D', 0.9973547096148511),
 ([[182, 16], [220, 16],[220, 44], [182, 44]], 'Y', 0.9971791823074896),
 ([[24, 46], [62, 46], [62, 74], [24, 74]], '62', 0.9999828941291119),
 ([[94, 46], [130, 46], [130, 74], [94, 74]], '26', 0.9997197349619524),
 ([[180, 46], [242, 46], [242, 74], [180, 74]],'1970', 0.999931275844574)]

low_precision = []
for text in results:
    if text[2]<0.5: # precision here
        low_precision.append(text)
for i in low_precision:
    results.remove(i) # remove low precision
print(results)

result:

[([[2, 16], [46, 16], [46, 42], [2, 42]], 'MM', 0.9965182566504686),
 ([[98, 16], [140, 16], [140, 46], [98, 46]], 'D', 0.9973547096148511),
 ([[182, 16], [220, 16], [220, 44], [182, 44]], 'Y', 0.9971791823074896),
 ([[24, 46], [62, 46], [62, 74], [24, 74]], '62', 0.9999828941291119),
 ([[94, 46], [130, 46], [130, 74], [94, 74]], '26', 0.9997197349619524),
 ([[180, 46], [242, 46], [242, 74], [180, 74]], '1970', 0.999931275844574)]

如何使用python刪除最低概率值

熱門問答

遞歸查找網(wǎng)格中的所有路徑

如何在同一個(gè)表上使用select和where子句更新sql中的特定列？

如何從PoS應(yīng)用程序繼承OrderReceipt文件

使用./，../，..//解析url等

如何比較file1的列和file2的列，選擇匹配的值，并使用grep或unix命令輸出到新文件

WSL上的VSCode沒有右鍵單擊選項(xiàng)功能，例如查找代碼引用

如何在Codeigniter中下載轉(zhuǎn)換后的HTML文件

必須為可空字段定義mutator以避免類型錯(cuò)誤

如何隱藏多個(gè)表單并使用事件單擊jquery顯示表單之一

如何在php中獲取確認(rèn)框值

vuecli搭建3.0項(xiàng)目跑不起來一直報(bào)錯(cuò) 求助！

excel大小異常

如何使用python刪除最低概率值

熱門問答

遞歸查找網(wǎng)格中的所有路徑

如何在同一個(gè)表上使用select和where子句更新sql中的特定列？

如何從PoS應(yīng)用程序繼承OrderReceipt文件

使用./，../，..//解析url等

如何比較file1的列和file2的列，選擇匹配的值，并使用grep或unix命令輸出到新文件

WSL上的VSCode沒有右鍵單擊選項(xiàng)功能，例如查找代碼引用

如何在Codeigniter中下載轉(zhuǎn)換后的HTML文件

必須為可空字段定義mutator以避免類型錯(cuò)誤

如何隱藏多個(gè)表單并使用事件單擊jquery顯示表單之一

如何在php中獲取確認(rèn)框值

vuecli搭建3.0項(xiàng)目 跑不起來 一直報(bào)錯(cuò) 求助！

excel大小異常

如何在同一個(gè)表上使用select和where子句更新sql中的特定列？

使用./，../，..//解析url等

如何比較file1的列和file2的列，選擇匹配的值，并使用grep或unix命令輸出到新文件

vuecli搭建3.0項(xiàng)目跑不起來一直報(bào)錯(cuò) 求助！