SphereGen

spheregen logo on transparent background

UiPath’s New Generative AI Features for Document Understanding 

UiPath recently released new features in their Document Understanding tool that includes Generative AI document classification and data extraction. The Generative AI components leverage machine learning (ML) and natural language processing (NLP) to understand and interpret documents in a more human-like manner by paying attention to context and meaning. This allows the tool to handle a wide range of document formats and structures, making it a versatile solution for businesses dealing with large volumes of data.

These AI features not only greatly simplify the process of automating document translation and data extraction; they increase the probability of accuracy and reduce document training time. Document Understanding was always a great tool, however the incorporation of Generative AI has made it an incredibly powerful tool.

How Document Understanding has Changed

UiPath has offered their Document Understanding (DU), for many years. DU is a robust tool able to translate digital documents which are typed or originally handwritten. Multiple types of documents, including invoices, receipts, contracts, and forms can be processed.

Prior to the Generative AI release, the DU tool required training to recognize the form type and the data to be extracted from the form. Training took time, as the tool was fed hundreds of forms to develop pattern recognition with a high degree of accuracy and confidence. Once the data was retrieved, it was formatted, if necessary, for further processing within an automation or application.

With Generative AI, two major improvements have been made to the Document Understanding tool. These areas are related to document classification and data extraction.

Under the latest release including Generative AI, Document Understanding can ingest a document and use advanced NLP techniques to analyze documents and identify key data points. It understands the context and semantics of the text, enabling it to accurately extract the required information with no training.

DU is able to classify documents and extract data based on a series of key phrases and prompts, posed to UiPath’s LLMs (Large Language Models) DocPATH and CommPATH. Automations can handle multiple types of document formats with incredibly quick setup, as each format does not need to be defined to the process. The UiPath automation flow can then be designed to take actions based on the document classification and data values.

Why is Document Understanding with GenAI More Effective?

Less Training Time, Better ROI

As stated earlier, prior to the introduction of Generative AI, the Document Understanding tool required a good deal of time for defining document formats and training the bot to recognize those formats. With the AI based document classifier and data extractor, the time required to define a form and train the bot has been greatly reduced – sometimes by as much as 50-80%.

This timing factor is incredibly important because it reduces the overall automation cost, as bot training was a sizeable allocation of Document Understanding projects. Lower implementation costs increase ROI, which makes Document Understanding an important cost-effective option for all organizations.

* It is important to note, that although the training time is reduced, the tool still includes the ability for humans to validate any data which is in question of failing the extraction process.

Easier Setup and Data Retrieval

Data retrieval and formatting rules are created with key phrases and prompts, resulting in a low code setup. AI is used to generate algorithms that can be applied with NLP to translate and understand the forms to extract the desired data.

For instance, the automation can be set up to retrieve order date from a document. Once text is translated, DU can recognize any text within the document which contains “Order Date”. Using prompts, a developer can easily define that the data field associated with Order Date should be retrieved and also specify in which format (YYYYMMDD) the date should be retrieved. This greatly reduces bot training time and the amount of data manipulation which must be programmed in order to format data for processing.

Higher Confidence in Results

Due to the ability of NLP to analyze and define context, the results of data extraction are categorized at a much higher confidence level. The generative AI extractor is built on machine learning models that continuously improve over time. As it processes more documents, it becomes better at understanding and extracting data, enhancing its performance and reliability. As stated earlier, although human validation may still be needed at some levels, the amount of required human interaction has decreased.

Increased Use Case Opportunities

Multiple Document Types

Prior to DU with Generative AI, document types needed to be defined to the tool. With NLP, multiple document types and formats can be processed with classification determined by key phrases. This widens the ability to process even more documents and form types. Every industry can make use of Document Understanding.

Digitized Documents + NLP = More Meaningful Results

With enhanced data retrieval due to NLP, digitized documents can now better support automated chatbots and customer or patient interaction. Applications can obtain more meaningful search results to satisfy customers. In Healthcare, patients can experience answers to questions with information pulled from multiple documents such as lab results, personal medical records and doctor’s notes to present a more informed response.

New Features can be Implemented into Existing Automations

The tool also seamlessly integrates with existing UiPath automation workflows. This means that businesses can easily incorporate the generative extractor into their current processes without significant disruptions or the need for extensive retraining.

In Summary

By automating the document extraction process, organizations can save considerable time and resources. Automation minimizes the need for manual data entry, allowing employees to focus on more strategic tasks.

Data can be extracted from multiple form types including invoices, receipts, contracts, financial statements, shipping and delivery documents, medical records, lab reports and insurance forms – enabling industries across the board to harness the power of this tool.

As businesses continue to embrace digital transformation, tools like Document Understanding will play a crucial role in driving operational excellence and innovation….and with the newest release of GenAI, it just became easier and faster to use!

About SphereGen

SphereGen logo on white backgroundSphereGen is a unique solutions provider that specializes in cloud-based applications, Intelligent Automation, and Extended Reality (AR/VR/MR). We offer full-stack custom application development to help customers employ innovative technology to solve business problems.

Learn more about what we do in Intelligent Automation: https://www.spheregen.com/intelligent-automation

SphereGen logo on white backgroundSphereGen is a unique solutions provider that specializes in cloud-based applications, Intelligent Automation, and Extended Reality (AR/VR/MR). We offer full-stack custom application development to help customers employ innovative technology to solve business problems.

Learn more about what we do in Intelligent Automation: https://www.spheregen.com/intelligent-automation

microsoft partner badge       uipath silver partner badge