Hi,
We are planning to implement the below use case via Appian RPA. Read the contents of a PDF file and based on the contents from PDF file, create task and assign to various groups in Appian. We are assuming this PDF to be in a predefined format.Do we have any examples to refer to for extracting the PDF content via Appian RPA?
Discussion posts and replies are publicly visible
Hi, Thanks for your question. We should first consider if the PDF is readable or not. A readable PDF means that you are able to select and copy (ctrl + c) the desired piece of text to be processed by the robot. In that case, the Java library Apache PDFBox would be the best approach to cover the text extraction. https://pdfbox.apache.org/By the other hand, if the process deals with text as image in the PDF file, Appian IDP covers a complete solution to extract and classify information from the PDFs. I highly recommend you to have a look to the documentation of Appian IDP here: https://docs.appian.com/suite/help/20.2/idp-1/idp-landing-page.html
Hi Javier Advani,
Thanks for the info. I will have a look at the pdfbox java library for the implementation.