Hi Appian Community, hope you are doing wellI have the following use case, The solution should be intelligent enough to capture the payslip documents from the end user and extract the data from the payslip document. As with Appian 23.4 release the AI skill has became much more smarter and works like a charm for documents that have similar structure. But, in this particular use case each user can have a different format of payslip that will be uploaded to the system, although all the payslips will be pdf and will be a structured document, but again can vary in terms of format. What is the best way to train the model so it can capture data from these payslips. Also, in case it is not possible can you please suggest a workaround as well.Regards
Discussion posts and replies are publicly visible
Documents having bigger deviations in their structure are alway a challenge. I assume that not every document is different. Can you build classes and then use a classification model to identify variants? Then build an extraction for each class. That might be able to cover at least a bigger part of the documents.