Appian is integration agnostic and has the ability to connect with any machine learning offering that exposes itself with a web API. The purpose of this article is to provide information that will empower your general understanding of machine learning technology regardless of the specific tool being used. For a list of the machine learning integrations that have been written about in detail in Appian's documentation, refer to the articles below:
Machine learning is a type of artificial intelligence that uses mathematical models to generate probabilistic predictions by finding patterns in historical data. Machine learning models can be thought of as black boxes that are created by processing many observations with known outcomes. These models are then able to take in one or many observations without a known outcome and produce possible outcomes and their probabilities.
There are many different uses and applications for machine learning, but this article currently focuses on machine learning technology that analyzes structured data—such as rows of an Excel spreadsheet or an Appian CDT—and delivers a prediction for a specific field or column in the data. This feature, value or attribute that is being predicted for is often referred to as the target.
Other uses for machine learning include natural language analysis and translation and the ability to decipher image contents, done using tools such as IBM's Watson and Google's AutoML Vision, respectively.
There are two major categories of model types that are used for making machine learning predictions on structured data:
Which type you utilize is dependent on the target attribute you want to predict for and your overall objective in creating the model. Read the sections below to learn more about the purpose of each model type and see examples describing appropriate uses of each one.
Regression
Regression models can be used to predict:
Binary Classification
Binary classification models can be used to predict:
Multiclassification
Multiclass classification models can be used to predict:
Model Types Summary
Root Mean Square Error (RMSE)
Mean Absolute Error (MSE)
F1 Score
Log Loss
To create a model, you must supply the machine learning tool with training data that it will use to learn about associations between different attribute values and the target attribute. This training data is the means by which the model understands and recognizes patterns about the data for which you ask it to make predictions. Below is an example of a data structure that might be used for training data for a model designed to predict the sale price of a used car. In this use case, the column marked "Sale Price" would be identified to the model as the target attribute to predict for.
The details about how data should be ordered, formatted and uploaded to a machine learning tool for training vary depending on the specific tool being used, so refer to your tool's documentation for specific information about appropriately presenting data.
Best Practices and Tips for Training Data
See Also
Websites:
Videos: