Just extract text?

Brought to you by: tobias-elze

#13 Just extract text?

Milestone: v1.0 (example)

Status: open

Owner: nobody

Labels: None

Priority: 5

Updated: 2016-07-19

Created: 2016-07-19

Creator: hmijail

Private: No

It would be very useful to have an option to only dump the text contained in the PDF. Looks like one of the files created by the Tesseract processing (as seen by using -debug, at least with Tesseract 3.04.00) is a .txt dump, but of course this is only page-by-page, so they would have to be recovered, concatenated and saved.

Just extract text?

Group

Searches

Help

#13 Just extract text?

Discussion