OCR means Optical Character Recognition.
This process is often referred to as text recognition. This technology allows the recognition of individual characters, making your digital letters searchable.
Roughly, the process works like this:
- The program analyzes the structure of the document. It divides the page into the different elements (e.g. sender, body, subject line).
- Then the individual text lines are separated into words and further into the individual letters
- Once the program recognizes the individual letters, it compares them with a set of patterns so that the particular letter can be defined.
- The findings are reassembled and the text is searchable for the smart full text search.