Electronic text recognition

The intelligent character recognition ( ICR ) is an extension of the optical character recognition (OCR). It is - roughly speaking - a handwriting recognition system that can use a computer to learn various forms of the medium during processing. ICR software is often based on adaptive programs that use artificial neural networks . Newly learned manuscripts find direct equivalents in the program's own database. This specially designed, electronic text recognition increases the application range of existing scanning devices for the processing of handwritten documents. OCR has so far only covered the area of processing printed documents in a satisfactory manner. Since the process of recognizing handwriting is much more demanding, the accuracy that has been achieved up to now has been insufficient. With well-structured documents, hit rates of more than 97% are possible with ICR, and the software requires several passes using alternating methodology. Each of them is weighted differently in the evaluation. Even separate methods for number and letter recognition are used. An important step for the ICR was the development of the so-called automated forms processing in 1993. This describes a three-stage process for the recording of a character or an image by the ICR software. In the first step, a recording of the document is created. This recording is processed by the ICR software in the second step and finally evaluated automatically in the last step.

Electronic text recognition increases the efficiency of optical character recognition noticeably. ICR software is used in the business world, for example, to process handwritten forms. ICR solutions are available from the following manufacturers:

company	product	ICR supported languages
ABBYY	ABBYY FlexiCapture ABBYY FlexiCapture Engine ABBYY FineReader Engine	Afrikaans, Albanian, Aymara, Azerbaijani (Latin), Basque, Bemba, Blackfoot, Breton, Bugotu, Bulgarian, Cebuano, Chamorro, Corsican, Crimean Tatar, Croatian, Crow, Czech, Dakota (Sioux), Dutch (Belgium), Dutch (Netherlands), English, Estonian, Straight, Evenk, Fiji, Finnish, French, Frisian, Friulian, Galician, Ganda, German, German (Luxembourg), German (new spelling), Greek, Guarani, Hani, Hausa, Hawaiian, Hungarian , Icelandic, Indonesian, Irish, Italian, Jingpo, Karachai-balkar, Kasub, Kawa, Kazakh, Kyrgyz, Congo, Kpelle, Kumyk, Kurdish, Latin, Latvian, Lithuanian, Luba, Malagasy, Malinke, Maori, Maya, Miao, Minangkabau , Mohawk, Moldavian, Mongol, Mordvin, Nahuatl, Nivkh, Nogay, Nyanja, Ojibwa, OldFrench, OldGerman, OldItalian, OldSpanish, Papiamento, Polish, Quechua, Romansh, Romanian, Romani, Rundi, Russian, Rwanda, Sami (Lapland), Samoa, Scottish Gaelic, Selkupian, Serbian (Latin), Slovak, Slovenian, Somali, Sotho, Spanish, Swahili, Swazi, Tagalog, Tahiti, Tok Pisin, Tonga, Tswana, Tun, Turkish, Uyghur (Latin), Ukrainian, Wolof, Xhosa, Zapotec, Ido, Interlingua
Accusoft	SmartZone ICR / OCR	German, English, Danish, Dutch, Finnish, French, German, Italian, Norwegian, Portuguese, Spanish and Swedish (.NET all, ActiveX only English)
ExperVision	TypeReader OpenRTK	English, French, German, Italian, Spanish, Portuguese, Danish, Dutch, Swedish, Norwegian, Hungarian, Polish, Simplified Chinese, Traditional Chinese, Russian, Finnish, and Polynesian
IRIS Group	IRISCapture Pro for Forms	Latin based languages
LEADTOOLS	LEADTOOLS ICR SDK modules	Catalan, Czech, Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Polish, Portuguese, Swedish, Spanish
reRecognition	Cadmos
Recogniform	Recogniform
CharacTell	SoftWriting

Electronic text recognition

See also