2 thoughts on “The effectiveness of OCR on historic newspapers

  1. enduringlegacy March 26, 2014 / 11:38 am

    Thank you Liz. This is a question I was considering asking you, but you have addressed it well here. I hope that MdHS’s digitized copies will have better OCR results. I thought of their project to scan from originals when you said: “it’s always ideal to scan the original source materials.” I hope that you are working with them to encourage their digitization to continue. You make a good argument that it is worth it (even if many of their papers cover the same issue dates as the microfilm collection). I have no idea at this point when they might put their digitized images online, but hopefully we will have another anouncemnt from them soon.

    I’m already able to find things by searching page-by-page in Der Deutsche Correspondent, that do not appear in OCR searches. So I know that it is not as good as we want it to be.

    Best Regards

Discuss!

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s