Skip to content
/home/telatin

/home/telatin

a web notebook to store some memos…

Tag: ghostscript

linux

Extract text from PDF, from the command line

pdfs-512pdftotext is a command line tool for converting PDF files to plain text. Included by default with many Linux distributions.

$ pdftotext file.pdf

The gs (Ghostscript) program can also handle the process:
$ gs -sDEVICE=txtwrite -o extractedText.txt input.pdf

February 14, 2017February 14, 2017ghostscript, linux, pdf, pdftotext, shell, terminalLeave a comment
Blog at WordPress.com.
Cancel

You must be logged in to post a comment.

Loading Comments...
Comment
    ×
    Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
    To find out more, including how to control cookies, see here: Cookie Policy