Find the answer to your Linux question:
Results 1 to 4 of 4
Hay, now i have i a much bigger problem.I dont know how to set that scaned documet with output format html,xml,pdf can give me only data which i want,like bill,supplier,date ...
  1. #1
    Just Joined!
    Join Date
    Feb 2009
    Posts
    4

    Another problem with Abbyy OCR

    Hay,
    now i have i a much bigger problem.I dont know how to set that scaned documet with output format html,xml,pdf can give me only data which i want,like bill,supplier,date of bill and so on. I dont know how to define types for documet,because problem is that those datas are always on diffrent place.I think this is problem with position of datas and i dont know how to solve it.


    Any ideas?

  2. #2
    Super Moderator devils casper's Avatar
    Join Date
    Jun 2006
    Location
    Chandigarh, India
    Posts
    24,316
    There must be instructions for setting output type of scanned docs in Abbyy documentation. Have you checked it thoroughly?
    It is amazing what you can accomplish if you do not care who gets the credit.
    New Users: Read This First

  3. #3
    Just Joined!
    Join Date
    Feb 2009
    Posts
    4
    I think we didnt understand each other, i know how to set output types,but i dont know how to select on scan document only metada which i need.I found that is related with setting block types but the rest i is still a dark.

  4. #4
    Linux Guru gogalthorp's Avatar
    Join Date
    Oct 2006
    Location
    West (by God) Virginia
    Posts
    3,105
    Let use try to understand what you want.

    You want to scan invoices from multiple sources which are formated differently. You wish to only pick out certain data (date amounts etc) which are located in different places on the different documents.

    Do you know of any software that will do this on any OS??

    It seems to me that the software first would need to recognize a given format and you would need to associate that format with the locations of the wanted datum. Each new invoice source would need to be taught to the system. That also assumes that each format could be recognized in the first place by some unique feature.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
...