Categories
Tech Tips

How to extract image and text from PDF


1. Install poppler-utils

2. To extract original embedded images:

$ pdfimages -j <file.pdf> <to_dir>

3. To extract text:

$ pdftotext -j <file.pdf>

Leave a Reply

Your email address will not be published. Required fields are marked *