Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Tesseract was the best open-source OCR for a long time. But I’d argue that docTR is better now, as it’s more accurate out of the box and GPU accelerated. It implements a variety of different text detection and recognition model architectures that you can combine in a modular pipeline. And you can train or fine-tune in PyTorch or TensorFlow to get even better performance on your domain.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: