Hi Community,
I have a multiple question on the Document classification AI Skill. I have trained the AI Skill classification with 2 types of document. Both type does not have fixed structure (few text change based on the conditions).
When we upload the document of these 2 type the confident score is always fall between 55 to 60 .
Another behaviour i observed when i upload different document type which is totally different , the confident score is 50 which i do not expect.
Questions:
Does anyone know how the confident score is determine?
The document structure is not same every time is that because the confident score not going beyond 60? I expect to be more.
Why confident score is still 50 when non trained document uploaded?
Discussion posts and replies are publicly visible
Hi Melvin,
Because machine learning is not deterministic, a high variance in your confidence score or accuracy scores may mean that your dataset is not representative enough. Please refer to our documentation on how to create a representative dataset: https://docs.appian.com/suite/help/23.4/collect-data.html as well as how to evaluate model performance: https://docs.appian.com/suite/help/23.4/evaluate-ai.html
Thank you gabby. I will check and revert if i have questions
The confidence score is internal to Appian and it represents how close the document classification and extraction is.
Thanks Abhay but my curiosity is Why confident score is still 50 when non trained document uploaded? confident score is crucial to decided on the flow of the process.
I think this happens when the identified patterns in the documents are not strong enough and you upload a document that does not deviate enough from the training material.
May be Appian is still able to identify some of the fields from the non-trained document.
Thanks Stefan. i upload a picture in the PDF which is possible scenario in our process. we decide the flow based on the confident score but now it is different .. will check further
To address the issue of low confidence scores for untrained document types, it's essential to: