Google Vision Cloud Integration

Abbas Khan over 6 years ago

How do I pass pdf docs for OCR? Although google supports pdf documents but Appian's integration object doesn't seem to support pdf docs. Is there a way to get sail code for integration object and enhance it to accept pdf docs?

Discussion posts and replies are publicly visible

Parents

0 Jorge Sanchez
Appian Employee
over 6 years ago

matthew.shutt What limitation are you referring to from Google? I see they do accept PDF files, but limit the documents to 2000 pages.
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Reply

0 Jorge Sanchez
Appian Employee
over 6 years ago

matthew.shutt What limitation are you referring to from Google? I see they do accept PDF files, but limit the documents to 2000 pages.
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel

Children

0 matthew.shutt
Appian Employee
over 6 years ago in reply to Jorge Sanchez

cloud.google.com/.../pdf

The limitation is that this is only available for files stored in Google Cloud Storage and using asyncBatchAnnotate function.
"The Vision API can detect and transcribe text from PDF and TIFF files stored in Google Cloud Storage.

Document text detection from PDF and TIFF must be requested using the asyncBatchAnnotate function, which performs an asynchronous request and provides its status using the operations resources."

The Google Vision Connected System currently does not require the designer to store files in Google Cloud Storage (enabling Appian documents to be sent for analysis) and does not use the asyncBatchAnnotate.

This could be a potential enhancement for our connected system.
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel
0 Jorge Sanchez
Appian Employee
over 6 years ago in reply to matthew.shutt

I see. So if I understand correctly, in order to use this functionality, we would have to create another integration to upload files to GC Storage, get the doc path, and use it in the async call to begin the OCR process. Using the id of the request, check the status, and once it is done, retrieve results.
Cancel
Vote Up 0 Vote Down

Sign in to reply

Verify Answer

Cancel