<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="https://community.appian.com/cfs-file/__key/system/syndication/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Regarding IDP document extraction of undefined PDF</title><link>https://community.appian.com/discussions/f/rules/28758/regarding-idp-document-extraction-of-undefined-pdf</link><description>I have a requirement where User wants to extract any Pdf value whose format is not fixed. Means invoice or any other Pdf will be different ?. if yes then how can we build dynamic CDT value in IDP for document extraction of different extracted PDF.</description><dc:language>en-US</dc:language><generator>Telligent Community 12</generator><item><title>RE: Regarding IDP document extraction of undefined PDF</title><link>https://community.appian.com/thread/113186?ContentTypeID=1</link><pubDate>Wed, 24 May 2023 06:43:40 GMT</pubDate><guid isPermaLink="false">d3a83456-d57b-489c-a84c-4e8267bb592a:1db4f944-81a4-4a13-b908-c3e55bcf5d19</guid><dc:creator>himanshus286423</dc:creator><description>&lt;p&gt;Thanks for the detailed information stefan. It will really help.&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>RE: Regarding IDP document extraction of undefined PDF</title><link>https://community.appian.com/thread/113185?ContentTypeID=1</link><pubDate>Wed, 24 May 2023 06:38:36 GMT</pubDate><guid isPermaLink="false">d3a83456-d57b-489c-a84c-4e8267bb592a:1d8ca43e-10f1-42c4-85e1-f6b2b4deec53</guid><dc:creator>Stefan Helzle</dc:creator><description>&lt;p&gt;I just had a look at the new version, and you still have to predefine all fields.&lt;/p&gt;
&lt;p&gt;But again, even machine learning is not very good in guessing and trying to translate random data into a structure.&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>RE: Regarding IDP document extraction of undefined PDF</title><link>https://community.appian.com/thread/113173?ContentTypeID=1</link><pubDate>Wed, 24 May 2023 00:46:34 GMT</pubDate><guid isPermaLink="false">d3a83456-d57b-489c-a84c-4e8267bb592a:81fc61b4-34a5-4093-a668-76211c2f6f01</guid><dc:creator>aditya007</dc:creator><description>&lt;p&gt;Create the maximum number of columns in the tables that you can see near future can be utilized and train the IDP model using all possible files you having. In backend Appian uses google ML AI tool.&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>RE: Regarding IDP document extraction of undefined PDF</title><link>https://community.appian.com/thread/113151?ContentTypeID=1</link><pubDate>Tue, 23 May 2023 12:56:56 GMT</pubDate><guid isPermaLink="false">d3a83456-d57b-489c-a84c-4e8267bb592a:503a9039-ea3b-4a65-ac4c-699e0683ff1a</guid><dc:creator>Stefan Helzle</dc:creator><description>&lt;p&gt;Trying to extract &amp;quot;any&amp;quot; data is always a bit difficult. At the end the purpose of doing this is to get that data into a structure, and that structure needs to be pre-defined.&lt;/p&gt;
&lt;p&gt;I suggest to try this with the new AI capabilities in Appian 23.2. It seems to be superior to the &amp;quot;old&amp;quot; IDP implementation.&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item></channel></rss>