Introduction of TH-OCR text recognition system

1. TH-OCR

TH-OCR is the abbreviation of English OpTIcal Character RecogniTIon, which means optical character recognition, commonly known as text recognition. Its working principle is to obtain text and picture information on paper through optical input devices such as scanners or digital cameras, and use various pattern recognition. The algorithm analyzes the morphological characteristics of the text, determines the standard encoding of Chinese characters, and stores them in a text file in a common format. From this, it can be seen that OCR actually allows the computer to recognize the words and realize automatic text input. It is a fast, labor-saving and efficient text input method.

The outstanding features of TH-OCR:

â—‡ Simultaneous bilingual Chinese and English, with the highest recognition rate, ranking the world's leading level.

â—‡ Can recognize black and white, grayscale and color images, and can read a variety of image formats.

â—‡ The first electronic document layout restoration function for recognition results, what you see is what you get.

â—‡ The first recognition function for Japanese, Korean, Japanese-English mixed arrangement, Korean-English mixed arrangement, the recognition rate is over 98%.

Several advantages of TH-OCR:

1. It is the only multi-body character recognition system that can recognize more than 20,000 Chinese characters. Chinese character recognition is the best in China.

2. Recognize Chinese characters and English, Japanese and English, Korean and English at the same time.

3. The highest recognition rate of Chinese characters. Wentong TH-OCR has passed the 863 "intelligent expert group's evaluation of hundreds of thousands of words and rigorous testing of products by the China Software Evaluation Center. The recognition accuracy rate exceeds 99.5%, which represents the highest level of printed text recognition.

4. Support multiple environment interfaces. Wentong TH-OCR supports WINDOWS environment and various internal codes such as GB, BIG5, GBK, JIS, SHIFT-JIS and KSC. It can be used for WINDOWS NT and WINDOWS 98/2000 / XP, suitable for use in various regions of the world. TH-OCR also has a self-learning function, no matter what unfamiliar characters can be learned through keyboard input, which greatly expands the recognition character set of the OCR system.

2. Hanwang OCR Text King

Hanwang text king. Hanwang Text King is a set of high-quality input and efficient office software system developed by Hanwang Company using the latest printed character recognition technology and integrated text reading proofreading. Hanwang Text King has a high recognition rate and fast recognition speed, and has customized a variety of simple working modes for users: automatic, single-step intelligent working mode and professional batch working mode. I believe it will become a good assistant for your office.

Technical index:

● Recognize characters:

Simplified character set: more than 6800 Chinese characters in the first and second grades of GB2312-80.

Pure English character set.

Simplified and Traditional Character Set: In addition to simplified Chinese characters, you can also mix more than 5400 traditional Chinese characters and traditional Chinese characters in Hong Kong.

● Identify the type of font:

It can recognize more than one hundred fonts such as Song, Imitation Song, Kai, Hei, Wei Bei, Li Shu, Yuan Ting, Xing Kai, etc., and supports multiple font mixing.

● Identify the font size:

Initial number-small sixth font.

● Form identification:

It can automatically judge and recognize various general printed forms. The spreadsheet is accurate and beautifully restored, and the output spreadsheet can be edited at will.

● Correct recognition rate:

Under normal recognition conditions, the recognition accuracy of printed documents can be very high.

● Recognition speed:

On the PII-233 computer, the recognition speed of printed documents reaches 120 words / second.

Features:

1. Intelligent identification, accurate: full intelligent identification core, fast identification speed, high identification efficiency

2. One-key scanning, WORD output: user operation is simple and fast, seamlessly connected with WORD, just simply press a key, the document is automatically output to WORD

3. A variety of modes, you can choose: users can choose automatic, single-step smart work mode or professional batch work mode according to work needs or personal habits

4. Complex layout, automatic analysis: intelligent analysis of various Chinese, English, Chinese, traditional, table, and graph mixed format text without excessive manual intervention

5. Form input, easy to realize: diversified form identification, perfect and accurate form restoration, and can be transformed into an electronic form that can be edited in an instant

6. Batch input, fast and efficient: large batch of file scanning, fully automated text recognition, fast speed and higher efficiency

7. Layout restoration, original text reproduction: the original layout format is accurately retained, and the original appearance of the text is accurately restored

8. File saving, multiple formats: the recognized documents can be saved as files in multiple formats (PDF, HTML, RTF, XLS, TXT), convenient and practical

9. Project management, easier: project files are easy to manage, work progress is saved at any time, open the project file to continue working

10. Text reading and translation, saving time and effort: Hanwang reading elves to avoid eyestrain and let you listen as you like; translation software helps you cross language barriers

CNC Machining Parts

Cnc Machining Parts,Cnc Machined Fitting Parts,Cnc Machining Accessories,Cnc Machining Aluminum Accessories

Dongguan Formal Precision Metal Parts Co,. Ltd , https://www.formalmetal.com

Posted on