Process a PDF file with OCR

This endpoint processes a PDF file using OCR (Optical Character Recognition). Users can specify languages, sidecar, deskew, clean, cleanFinal, ocrType, ocrRenderType, and removeImagesAfter options. Uses OCRmyPDF if available, falls back to Tesseract. Input:PDF Output:PDF Type:SI-Conditional

The input PDF file

File ID for server-side files (can be used instead of fileInput)

List of languages to use in OCR processing, e.g., 'eng', 'deu'

Include OCR text in a sidecar text file if set to true

Deskew the input file if set to true

Clean the input file if set to true

Clean the final output if set to true

Specify the OCR type, e.g., 'skip-text', 'force-ocr', or 'Normal'

Specify the OCR render type, either 'hocr' or 'sandwich'

Remove images from the output PDF if set to true

ILovePDF Tools - Quick Access
Scroll to Top