How to use the autocorrect.correct_text_block function in autocorrect

To help you get started, we’ve selected a few autocorrect examples, based on popular ways it is used in public projects.

Secure your code as it's written. Use Snyk Code to scan source code in minutes - no build needed - and fix issues immediately.

github jasalt / kuittiskanneri / app / modules / ocr_utils / ocr.py View on Github external
im_proc.communicate()

    # Read receipt with Tesseract
    print "Running OCR"
    image_text = ""
    proc = subprocess.Popen([OCR_SCRIPT, processed_imagepath],
                            stdout=subprocess.PIPE)

    for line in iter(proc.stdout.readline, ''):
        image_text += line.rstrip() + '\n'

    image_text = image_text.decode('utf-8')

    # Autocorrect
    print "Autocorrecting text"
    corrected_text = autocorrect.correct_text_block(image_text)

    if corrected_text is unicode:
        corrected_text = corrected_text.encode('utf-8')

    print "Parsing text"
    parsed_text = receiptparser.parse_receipt(corrected_text)

    return (image_text,
            corrected_text,
            parsed_text)
github jasalt / kuittiskanneri / web-server / server.py View on Github external
# Process image with ImageMagick
    tempimagepath = os.path.join(app.config['UPLOAD_FOLDER'], 'temp.png')
    im_proc = subprocess.Popen(['convert',imagepath,'-resize','600x800',
                                '-blur','2','-lat','8x8-2%',tempimagepath],
                                stdout=subprocess.PIPE)
    im_proc.communicate()

    image_text = ""
    proc = subprocess.Popen([OCR_SCRIPT, tempimagepath],
                            stdout=subprocess.PIPE)
    for line in iter(proc.stdout.readline, ''):
        image_text += line.rstrip() + '\n'

    image_text = image_text.decode('utf-8')

    corrected_text = autocorrect.correct_text_block(image_text)

    if corrected_text is unicode:
        corrected_text = corrected_text.encode('utf-8')

    return (image_text,
            corrected_text,
            receiptparser.parse_receipt(corrected_text))