AIMultiple ResearchAIMultiple Research

Still using Intelligent Character Recognition? Upgrade in 2024!

Intelligent character recognition technology was developed in the 90s relying on OCR and pattern matching to automate extracting machine readable data from documents. This machine readable data would then be used to help companies automatically process documents. Aim of iOCR was to help businesses handle paper based processes without any human intervention by automating data extraction from paper documents. With the rise of deep learning (DL) since 2010s, data extraction solutions relying on DL can achieve much higher rates of automation compared to iOCR solutions.

This problem is still relevant. Digital transformation is a must for businesses. However, digitalization is not easy, especially for companies whose data mostly stored in paper-based documents. According to a study conducted by AIIM, 49% of data capture volume is paper, and businesses need to convert their paper documents into digital systems to leverage data in digital documents.

What it Intelligent Character Recognition (ICR)?

Intelligent character recognition (ICR) or intelligent optical character recognition (iOCR) is an extended version of optical character recognition (OCR) technology that converts scanned images of text into machine-readable data using mostly rules based pattern recognition.

This is not the definition you may find when you look for ICR’s definition in resources like Wikipedia. It seems that most definitions of this topic are written by vendors that exaggerate the capabilities of their solution category. ICR or iOCR is a fragile technology leading to low levels of automation. As a result, companies are migrating from iOCR solutions to deep learning based data extraction solutions and the interest in ICR is falling as seen below:

interest in iOCR tech is decreasing

What is the difference between ICR and OCR?

While OCR converts images into text, ICR converts images into machine readable data.

You can see below Google Cloud Vision OCR converting an image into text:

An illustration of OCR from Vision AI

An iOCR would convert the text into structured data. To be able to do this, an iOCR works with specific types of documents where specific data fields need to be captured. Here you can see an invoice shared as an image converted to machine-readable data:

Hypatos output showing conversion of invoice image into machine readable data
Source: Hypatos

In addition, ICR is a more expensive technology since it goes one step further than OCR and provides machine-readable data.

If you want to learn more on optical character recognition technology, feel free to check our related articles:

What are the types of ICR software?

ICR needs to be specific to the type of document it will process. ICR can rely on

  • Constraint handprint. In these documents, specific areas constrained by bounding boxes are left for humans to fill out with clear labels next to them. This enables ICR to identify key-value pairs with ease.
Source: Abbyy
  • Document type specific ICR: Using specific text clues, rules can be used to extract data from documents. This is commonly used to process invoices. For example, text like “Total”, “total amount”, “gross amount”, “amount due” tend to be followed by the amount to be paid. Such rules are programmed into ICR to build document specific ICR solutions. However, these are fragile approaches as suppliers use myriad of different terminology in documents and rules are not a robust approach to deal with them. For more, please see our article on modern approaches to data extraction involving deep learning.

What are its use cases?

ICR applications focus on document capture and input captured data into the company’s enterprise content management system. Common documents where ICR can help are:

  • Accounts payable
  • Invoices
  • Offers
  • Purchase orders
  • Employee onboarding forms for payroll
  • Travel expenses
  • Customer surveys

Yet, there are some industry-specific use cases as well. For banking;

  • Loan contracts
  • Bank statement

For logistics;

  • Freight bills
  • Bill of lading,

For real estate;

  • VA home loan forms
  • Borrower forms
  • Homeowner and contractor agreements
  • FHA applications

Healthcare & Insurance;

  • Patient admittance forms
  • Medical records
  • Medical prescriptions
  • Claims

You can read more about these use cases in our document automation guide.

What are the benefits of intelligent character recognition vs manual processes?

Operational benefits

  • Reduced workforce: ICR technology can reduce human resources requirements in data inputting processes.
  • Improved efficiency: Automating paper-based document processes such as data extraction saves time and reduces errors, which will improve the efficiency of document operations compared to manual operations

Digitization benefit

  • Improved security: Paper documents can be misplaced, stolen, or extinguished. Storing paper-based documents in digital formats prevent such cases. However, organizations should also leverage information security technologies to prevent any data breach in the server.
  • Reduced reliance on a physical office: Since all paper-based documents can be converted into digital, organizations don’t need to rent an office to store their file stack.
  • Improved customer service: Once documents are scanned and converted into computer-readable text, it is easier for the customer service team to access personal or order-related information to process customers ‘ requests.
  • Makes documents editable: In any update regarding the document, companies can make changes to the document rather than rewriting it.
  • Improved Compliance: Digital documents enable audit trail that eases the maintenance of records and other sensitive data.

Though iOCR has significant benefits, deep learning based data extraction approaches build on these benefits and provide data at higher levels of accuracy.

What are the benefits of deep learning based data extraction vs iOCR?

Upgrading to a deep learning based data extraction solution has the same types of benefits of an iOCR but the positive impact is significantly bigger. Deep learning based approaches improve data extraction accuracy and this significantly reduces operational costs.

For example, consider a company aiming to get 10 data fields from a document like invoice. Typically, iOCR solutions tend to provide each field with 50% accuracy. This leads operators to examine almost all documents and make changes to 5 fields (10 x 50%) on average. A deep learning based solution with 98% field level accuracy could lead to ~80% of documents being processed with no human involvement, achieving a significant automation improvement.

If you need guidance to choose the right ICR vendor, check out our data-driven list of data extraction tools, and feel free to contact us:

Find the Right Vendors

Access Cem's 2 decades of B2B tech experience as a tech consultant, enterprise leader, startup entrepreneur & industry analyst. Leverage insights informing top Fortune 500 every month.
Cem Dilmegani
Principal Analyst
Follow on

Cem Dilmegani
Principal Analyst

Cem has been the principal analyst at AIMultiple since 2017. AIMultiple informs hundreds of thousands of businesses (as per similarWeb) including 60% of Fortune 500 every month.

Cem's work has been cited by leading global publications including Business Insider, Forbes, Washington Post, global firms like Deloitte, HPE, NGOs like World Economic Forum and supranational organizations like European Commission. You can see more reputable companies and media that referenced AIMultiple.

Throughout his career, Cem served as a tech consultant, tech buyer and tech entrepreneur. He advised businesses on their enterprise software, automation, cloud, AI / ML and other technology related decisions at McKinsey & Company and Altman Solon for more than a decade. He also published a McKinsey report on digitalization.

He led technology strategy and procurement of a telco while reporting to the CEO. He has also led commercial growth of deep tech company Hypatos that reached a 7 digit annual recurring revenue and a 9 digit valuation from 0 within 2 years. Cem's work in Hypatos was covered by leading technology publications like TechCrunch and Business Insider.

Cem regularly speaks at international technology conferences. He graduated from Bogazici University as a computer engineer and holds an MBA from Columbia Business School.

To stay up-to-date on B2B tech & accelerate your enterprise:

Follow on

Next to Read

Comments

Your email address will not be published. All fields are required.

1 Comments
Don Winant
Nov 08, 2023 at 00:42

8 NOV 2023

Cem Dilmegani,

What software package would be appropriate for extraction of medical research data comprising (a) word recognition, (b) data/numerical recognition, and (c) special character recognition?

I have approximately 5,000 single page documents that I would like to scan to extract the data rather than manually imputing the data by hand.

Respectfully,
Don Winant
Curtin University
Perth, Australia

Cem Dilmegani
Nov 12, 2023 at 16:04

Hi Don, why don’t you use your organization’s cloud service provider’s solution? AWS, Google, Microsoft Azure all offer state of the art data extraction solutions.

However, before doing so, I would check with your organization’s tech team and get guidance on how you can process healthcare data involving personal information in with your organization’s cloud service provider. They may recommend specific services or data retention policies etc.

Related research