PDF Read OCR

Hi All,

I have requirement to read content of PDF having Table and Paragraphs which is structured and unstructured and store it into TABLES without any manual intervention.

I have used the Appian inbuilt feature doc extract but its not providing me accurate reading capability like some fields are readable but some are not. 

https://docs.appian.com/suite/help/23.2/evaluate-doc-extraction.html

We would like to stick with Appian as it keeps the doc in Appian rather than using Google 

Any suggestion will be helpful

Thanks

  Discussion posts and replies are publicly visible

Parents Reply
  • 0
    Certified Lead Developer
    in reply to bihitakdass

    As I already tried to explain, the training for extract ONLY happens in the reconciliation step. You will need to repeat that at least a few times with various documents, so that the machine learning model understands what you want it to do.

    Then, you add a data validation step after the extraction to decide whether the extraction was good or needs a manual reconciliation.

    And if you have the feeling that the OOTB extraction is not powerful enough, I suggest to contact Appian and discuss your specific use case.

Children
No Data