Inhaltsverzeichnis

Tesseract ist eine kostenlose OCR Software zur Texterkennung.

// Extrahiere Text aus PNG
tesseract image.png text.txt

Training

Todo

Python

#pip install pillow
#pip install pytesseract
 
from PIL import Image
import pytesseract
pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract.exe'
print(pytesseract.image_to_string('images.txt', lang='de'))

EasyOCR

#pip install torch torchvision torchaudio
#pip install easyocr
 
import easyocr
reader = easyocr.Reader(['en'])
result = reader.readtext('test.jpg', detail = 0)
print(result)