pdf.utils

Functions

extract_pdf_info(→ tuple[int, str])

Extracts the number of pages and text from a PDF.

Module Contents

pdf.utils.extract_pdf_info(content: SupportsRead[bytes], remove: str = '\x00') tuple[int, str][source]

Extracts the number of pages and text from a PDF.

Requires poppler.