login about faq

As of February 2015 Libcatcode is no longer accepting new questions. This site will stay up as the owner decides the next steps for preserving the content on the site. Thank you all for your support in the past three years!

Hello all,

I've got a few hundred images to run OCR on and I am not sure which FOSS option is the best. I've been steered to ocropus, tesseract, and DocMorph. I've been playing around with this goofy online web service: http://www.onlineocr.net/Default.aspx

It seems to get the "job" done fairly well, though obviously is a workflow nightmare. I'm not very familiar with how to set up ocropus, et al. Thoughts?

asked Feb 14 '12 at 15:42

todrobbins's gravatar image


Hi torobbins,

Have you seen http://code.google.com/p/tesseract-ocr/ ?


answered Nov 09 '12 at 07:51

padraic's gravatar image


Thanks @padraic! I have looked into Tesseract, as I mentioned in my question, but what do you or others think is the best for cultural-type collections? Also, do you know of any good tutorials for setting Tesseract up/configuring it?

(Nov 09 '12 at 11:51) todrobbins
Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here



Answers and Comments

Markdown Basics

  • *italic* or __italic__
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported



Asked: Feb 14 '12 at 15:42

Seen: 3,729 times

Last updated: Nov 09 '12 at 11:51

Related questions

Powered by hamsters in the server | CSS skin by prtk