Text data extractor: PDF to Text

Hello community,
I made this PDF to text extractor app that takes a pdf as input, displays the document on the page and returns, based on the user option, either a txt file that contains all of the PDF’s text or a ZIP folder that has txt files containing the text from the pages, such as every file represents a page from the pdf.

I will be adding text extraction from scanned PDF next.
Hope it helps!

5 Likes

Update :balloon:

You can now enable OCR for scanned documents and extract your text data:

  • Upload your PDF
  • Enable OCR
  • Select the PDF language (English, French, Spanish or Arabic)
  • Download your output file (zip/txt) :tada:

App → PDF text extractor with OCR
Code → Repository