The capabilities described on this page are included in Appian's advanced and premium capability tiers. Usage limits may apply. |
This topic covers how to create an AI skill to classify documents using machine learning (ML) models.
A document classification AI skill takes documents as inputs via the Classify Documents smart service, analyzes the documents, and returns predictions about a document's type.
Here's a high-level breakdown of how to create and use a document classification AI skill:
Note: This AI skill type is only available in select regions.
In the Build view, click NEW > AI Skill.
In the Classification section, select Documents.
Configure the following properties:
Property | Description |
---|---|
Name | Enter a name that follows the recommended naming standard. |
Description | (Optional) Enter a brief description of the AI skill. |
Document types are the categories of documents you want this skill to classify. For example, document types might be purchase orders or invoices. You'll define a document type for each category of document, and then provide training documents that represent the documents you expect to classify in production.
To create a document type:
On the Configure Training page, click NEW DOCUMENT TYPE.
The model needs training documents to learn about the patterns or traits the documents have in common. The model also uses some of these training documents to test its knowledge after training.
The model can only learn from the documents you provide, so be sure to build a comprehensive and diverse set of documents that represent what you expect to classify in your process.
Training documents must meet the following requirements:
You can add training documents to a new or existing document type.
To add training documents to an existing document type:
After you've added training documents, you're ready to start training the model.
Keep in mind the following requirements when training this AI skill type:
If any of these guardrails are limiting, please reach out to your Appian contact.
You'll manually prompt the model to start training based on the training documents you provided. When you've created the relevant document types and provided training documents for each one, click TRAIN MODEL on the Configure Training page.
During training, the model analyzes a portion of your training documents to learn about the traits or patterns that could identify them as the document type.
After learning about your documents, the model uses the remaining portion of the training documents to test itself. The model training will display metrics to show how it performed.
Tip: Training can take a few minutes. You can close the Training Summary page and return later to view the results.
After the model is finished training and testing, you're shown a number of metrics measuring its performance. You'll use this information to determine if this model is ready to publish, or if you'd like to provide more documents for the model to continue training.
Some of these metrics might be more or less relevant based on your use case and the type of documents you expect to classify.
Learn more about evaluating model performance.
Before you integrate the skill into a process, you want to be confident the published model performs as you expect. You can test the model using a few sample files to verify it works as you expected directly within the AI skill object.
To test the model:
Click TEST MODEL.
Browse for and select the documents you want to add. You can add up to five files.
In the banner showing you the test status, click REFRESH to see when results are available.
Click START OVER to run another test with different files, or CLOSE to exit the results window.
If the model's training metrics don't yet meet your requirements, you can continue training to try to boost the model's performance. Each time you retrain, you're creating a new model. Provide the new model with additional training documents to help it learn more about the document type. Be sure your training documents represent the documents you expect to see in production.
Tip: Training a new model doesn't make the AI skill unavailable or otherwise impact a published model. The AI skill uses the existing published model until you publish a new one.
To continue training by creating a new model:
As you expand how you use machine learning models throughout your applications, you might wonder: when should I train a new model vs. creating an entirely new skill?
Keep in mind you can only publish one model for a skill. In practice, this means that a Classify Documents smart service will only be able to access one model for the skill the smart service is configured to use.
Also keep in mind that ML models are designed for a specific purpose. To help the model specialize and hone in on key traits in your document types, consider creating one at the lowest level possible for your process. That is, if a single step in a process requires you to classify invoices and purchase orders, don't bother including a third document type for inventory reports. Adding this extraneous information will only serve to distract the model, so it's best to exclude. Instead, create document types specific to the documents you expect to classify in your specific use case.
To help demonstrate these ideas, let's use the example of studying for a test. Imagine you're taking a driving exam to get your license. You'll study laws and general rules of the road, as well as practice identifying street signs. You may even test your knowledge by navigating a practice test that contains word problems and multiple choice questions. Your family is moving to a neighboring state around the same time you're planning to take the test.
With these two things in mind, you can begin to consider when to create a new AI skill instead of using an existing one. In short:
When the model's training metrics meet your requirements, you're ready to put it to use. Publish the model to make it available for use in your process, through the Classify Documents smart service.
You'll configure the Classify Documents smart service to use your AI skill. When the process reaches the smart service node, it uses the published model within the skill to analyze and classify documents.
Now you're ready to use your document classification skill in a process.
Add the Classify Documents smart service and configure it to call your new skill.
Review the feature's compliance to ensure it aligns with your organization's security requirements.
Create a Document Classification AI Skill