What is OCR full form ?

 OCR stands for optical character recognition.

Optical character recognition (OCR) is a term that refers to the process of recognizing characters It's also known as a text recognition system or an optical character reader (OCR). Its purpose is to convert scanned paper documents or digital camera photos of documents into readable, editable, and searchable data.

What is OCR full form ?
What is OCR full form ?

A scanned page of a physical document can be displayed on the screen and read, but it is nothing more than a series of black and white dots to the computer. OCR was created to allow a computer to read a scanned document and make a soft copy. OCR scans the text of a scanned document and converts the characters into code that makes the text machine-readable, allowing it to be converted into an electronic format or soft copy that users can edit, format, search, and read, much like a document prepared with a word processor.


Thus, it helps computer recognize words and characters on a scanned page or digital images of physical printed or handwritten documents by using the optical properties of words and characters printed on a scanned page or document.

An optical character recognition (OCR) device is a hardware and software combination that converts physical documents into machine-readable text. OCR hardware (an optical scanner or a circuit board) copies and scans text, while software handles further processing. Artificial intelligence can also be used to apply advanced methods of intelligent character recognition (ICR), such as the capacity to recognize language or handwriting style.

How OCR works:

  • The document's physical shape is processed by the scanner.
  • The software analyzes the structure of the document after it has been scanned and converts it to a coloured (black and white) version.
  • The document is scanned and analyzed for light and dark areas.
  • Characters are detected in dark areas, whereas backdrop is distinguished in bright parts.
  • The dark areas are examined further to see if they contain letters or numeric digits. The lines are broken down into words, and the words are broken down into characters. OCR tries to figure out if the dark regions correspond to a specific letter or number.
  • Once the characters are singled out and identified, they are converted into an ASCII code that can be used by computer systems to handle further manipulations and thereby presents you the recognized text.
OCR software may vary in their techniques, but generally analyse one character, word or block of text at a time and then identify characters using one of the following two algorithms.

1) Pattern recognition: 

OCR software is created by feeding it examples of text in various fonts and formats in order for it to recognize the shape or pattern of characters and accurately identify them.

2) Feature detection: 

OCR applications use the feature of a character or a number in this technique. The number of angled lines, crossing lines, or curves in a character are examples of characteristics. For example, the letter 'A' could be recorded as two lines united at one end and connected in the middle by a horizontal line.

Post a Comment

0 Comments