Find the answer to your Linux question:
Results 1 to 3 of 3
I have a large number of PDF produced nightly which must be converted to Text files for our Optical Solution. Any suggestions on applications which can solve this problem. Need ...
Enjoy an ad free experience by logging in. Not a member yet? Register.
  1. #1
    Just Joined!
    Join Date
    May 2005
    Posts
    2

    Convert PDF to Text files


    I have a large number of PDF produced nightly which must be converted to Text files for our Optical Solution. Any suggestions on applications which can solve this problem. Need to maintain maintain format with respect to columns, page breaks, etc. I would want to set this up as a nightly job to take all PDF files in a directory, convert them to text files and store in another directory.

  2. #2
    Linux Engineer
    Join Date
    Aug 2004
    Posts
    826
    You're going to lose a lot of the formatting in the PDF files by converting them to ASCII text. The best solution is probably to use pdftohtml (http://pdftohtml.sourceforge.net/). You won't lose a whole lot of the formatting with it, and you can always use an HTML to text converter if you absolutely need the text format (http://jsoftco.8m.com/download.html).

    To have it run nightly, setup a cron job. Here's one of many tutorials on cron: http://www.unixgeeks.org/security/ne...ix/cron-1.html

    I hope this helps. These programs are just ones I know off the top of my head and there may be much better ones out there.

  3. #3
    Just Joined!
    Join Date
    Mar 2008
    Posts
    1

  4. $spacer_open
    $spacer_close

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •