Pure Javascript Multilingual OCR
Get Started

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine.

This library supports more than 100 languages, automatic text orientation and script detection, a simple interface for reading paragraph, word, and character bounding boxes. Tesseract.js can run either in a browser and on a server with NodeJS.

Check out the Example code and API docs on GitHub.

English Demo
Chinese Demo
Russian Demo
Drop an English image on this page or
Click here to recognize text in the demo image, or drop an English image anywhere on this page.