Extracting text pulls the readable words out of a PDF into a plain .txt file. This lets you reuse content in another document, feed it into a translator or analyser, or quickly copy a long passage without manual retyping.
ColaPDF uses the PDF.js engine to read the text layer of your document directly in the browser. Note that scanned PDFs with no text layer (just images of pages) will not yield text unless they have already been run through OCR.
Scanned PDFs contain images of text, not actual text. This tool will return no text for those. Use an OCR tool for scanned documents.
The output is plain text — formatting like bold, tables and columns is not preserved.
No. Text extraction runs in your browser using PDF.js.