Azure-AI-Document-Intelligence
Version: 1
Azure AI Document Intelligence
Document Intelligence is a cloud-based service that enables you to build intelligent document processing solutions. Massive amounts of data, spanning a wide variety of data types, are stored in forms and documents. Document Intelligence enables you to effectively manage the velocity at which data is collected and processed and is key to improved operations, informed data-driven decisions, and enlightened innovation.Core Features
- General extraction models
- Description: General extraction models enable text extraction from forms and documents and return structured business-ready content ready for your organization's action, use, or development.
- Key Features
- Read model allows you to extract written or printed text liens, words, locations, and detected languages.
- Layout model, on top of text extraction, extracts structural information like tables, selection marks, paragraphs, titles, headings, and subheadings. Layout model can also output the extraction results in a Markdown format, enabling you to define your semantic chunking strategy based on provided building blocks, allowing for easier RAG (Retrieval Augmented Generation).
- Prebuilt models
- Description: Prebuilt models enable you to add intelligent document processing to your apps and flows without having to train and build your own models. Prebuilt models extract a pre-defined set of fields depending on the document type.
- Key Features
- Financial Services and Legal Documents: Credit Cards, Bank Statement, Pay Slip, Check, Invoices, Receipts, Contracts.
- US Tax Documents: Unified Tax, W-2, 1099 Combo, 1040 (multiple variations), 1098 (multiple variations), 1099 (multiple variations).
- US Mortgage Documents: 1003, 1004, 1005, 1008, Closing Disclosure.
- Personal Identification Documents: Identity Documents, Health Insurance Cards, Marriage Certificates.
- Custom models
- Description: Custom models are trained using your labeled datasets to extract distinct data from forms and documents, specific to your use cases. Standalone custom models can be combined to create composed models.
- Key Features
- Document field extraction models
- Custom generative: Build a custom extraction model using generative AI for documents with unstructured format and varying templates.
- Custom neural: Extract data from mixed-type documents.
- Custom template: Extract data from static layouts.
- Custom composed: Extract data using a collection of models. Explicitly choose the classifier and enable confidence-based routing based on the threshold you set.
- Custom classification models
- Custom classifier: Identify designated document types (classes) before invoking an extraction model.
- Document field extraction models
- Add-on capabilities
- Description: Use the add-on features to extend the results to include more features extracted from your documents. Some add-on features incur an extra cost. These optional features can be enabled and disabled depending on the scenario of the document extraction.
- Key Features
- High resolution extraction
- Formula extraction
- Font extraction
- Barcode extraction
- Language detection
- Searchable PDF output
Use Cases
- Accounts payable: A company can increase the efficiency of its accounts payable clerks by using the prebuilt invoice model and custom forms to speed up invoice data entry with a human in the loop. The prebuilt invoice model can extract key fields, such as Invoice Total and Shipping Address.
- Insurance form processing: A customer can train a model by using custom forms to extract a key-value pair in insurance forms and then feeds the data to their business flow to improve the accuracy and efficiency of their process. For their unique forms, customers can build their own model that extracts key values by using custom forms. These extracted values then become actionable data for various workflows within their business.
- Bank form processing: A bank can use the prebuilt ID model and custom forms to speed up the data entry for "know your customer" documentation, or to speed up data entry for a mortgage packet. If a bank requires their customers to submit personal identification as part of a process, the prebuilt ID model can extract key values, such as Name and Document Number, speeding up the overall time for data entry.
- Robotic process automation (RPA): Using the custom extraction model, customers can extract specific data needed from distinct types of documents. The key-value pair extracted can then be entered into various systems such as databases, or CRM systems, through RPA, replacing manual data entry. Customers can also use custom classification model to categorize documents based on their content and file them in proper location. As such, an organized set of data extracted from the custom model can be an essential first step to document RPA scenarios for businesses that manage large volumes of documents regularly.
Benefits
- No experience required: Incorporate Document Intelligence features into your projects with no machine learning experience required.
- Effortlessly customize your models: Training your own custom extraction and classification model can be done with as little as one document labeled, making it easy to train your own models.
- State of the art models: ready for use APIs, constantly enhanced models, and flexible deployment options reduce the need for ongoing manual training or extensive customization.
Technical Details:
- Deployment: Deployment options may vary by service, reference the following docs for more information: Use Document Intelligence models and Install and run containers .
- Requirements: Requirements may vary slightly depending on the model you are using to analyze the documents. Reference the following docs for more information: Service quotas and limits .
- Support: Support options for AI Services can be found here: Azure AI services support and help options - Azure AI services | Microsoft Learn .
Pricing
View up-to-date pricing information for the pay-as-you-go pricing model here: Azure AI Document Intelligence pricing .Model Specifications
Last UpdatedSeptember 2024
PublisherMicrosoft