<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="https://community.appian.com/cfs-file/__key/system/syndication/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Google OCR retrieve the JSON result file</title><link>https://community.appian.com/discussions/f/rules/17599/google-ocr-retrieve-the-json-result-file</link><description>Hi, I was trying the connected system ( Google Cloud Document OCR Connected System ) and its services. I am able to perform the OCR operation on the PDF and get the JSON file. Now if I try to use the Format OCR Results service and retrieve the data out</description><dc:language>en-US</dc:language><generator>Telligent Community 12</generator><item><title>RE: Google OCR retrieve the JSON result file</title><link>https://community.appian.com/thread/69360?ContentTypeID=1</link><pubDate>Thu, 12 Sep 2019 15:01:39 GMT</pubDate><guid isPermaLink="false">d3a83456-d57b-489c-a84c-4e8267bb592a:c5088ddf-48c6-4b2f-a3fa-de1a588ad1e1</guid><dc:creator>Ben Edwards</dc:creator><description>&lt;p&gt;Hi there, my team recently ran into the same issue. I informed Appian support through a support case and was told that the engineering team is investigating an update.&amp;nbsp;&lt;/p&gt;
&lt;p&gt;We developed a pretty decent workaround that involves more work. We add a pdf to our Google Cloud Storage bucket. Then, we use an integration&amp;nbsp;that performs the&amp;nbsp;&amp;#39;Start OCR&amp;#39; operation via the Google OCR connected system that Appian provides. After that, we use the &amp;#39;Get Signed URL&amp;#39; an integration that performs the &amp;#39;Get Signed URL&amp;#39; operation via the Google Storage connected system that Appian provides to get the results file that gets created in the Storage bucket from the &amp;#39;Start OCR&amp;#39; operation. Using that URL, we perform a GET request to get the contents of that results file and parse the file, which includes more than just the first page results. It&amp;#39;s more work than just using the &amp;#39;Format OCR Results&amp;#39; operation, but it&amp;#39;s working for a solution we&amp;#39;ve developed. The key is that the &amp;#39;Start OCR&amp;#39; operation creates a result file in your Storage bucket and you should be able to retrieve it.&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item></channel></rss>