OCR stands for optical character recognition.
Optical character recognition (OCR) is a term that refers to the process of recognizing characters It's also known as a text recognition system or an optical character reader (OCR). Its purpose is to convert scanned paper documents or digital camera photos of documents into readable, editable, and searchable data.
| What is OCR full form ? |
A scanned page of a physical document can be displayed on the screen and read, but it is nothing more than a series of black and white dots to the computer. OCR was created to allow a computer to read a scanned document and make a soft copy. OCR scans the text of a scanned document and converts the characters into code that makes the text machine-readable, allowing it to be converted into an electronic format or soft copy that users can edit, format, search, and read, much like a document prepared with a word processor.
Thus, it helps computer recognize words and characters on a scanned page or digital images of physical printed or handwritten documents by using the optical properties of words and characters printed on a scanned page or document.
An optical character recognition (OCR) device is a hardware and software combination that converts physical documents into machine-readable text. OCR hardware (an optical scanner or a circuit board) copies and scans text, while software handles further processing. Artificial intelligence can also be used to apply advanced methods of intelligent character recognition (ICR), such as the capacity to recognize language or handwriting style.
How OCR works:
- The document's physical shape is processed by the scanner.
- The software analyzes the structure of the document after it has been scanned and converts it to a coloured (black and white) version.
- The document is scanned and analyzed for light and dark areas.
- Characters are detected in dark areas, whereas backdrop is distinguished in bright parts.
- The dark areas are examined further to see if they contain letters or numeric digits. The lines are broken down into words, and the words are broken down into characters. OCR tries to figure out if the dark regions correspond to a specific letter or number.
- Once the characters are singled out and identified, they are converted into an ASCII code that can be used by computer systems to handle further manipulations and thereby presents you the recognized text.

0 Comments