Content-type: text/html ~ Stephen's Web ~ Running OCR against PDFs and images directly in your browser

Stephen Downes

Knowledge, Learning, Community

I tested this and it does work, though with the caveats expressed by Simon Willison in this post. What he has developed, in a nutshell, is a script that will convert a PDF or image to text (using an optical character recognition (OCR) algorithm called Tesseract) right in your browser - no uploading required! Here it is. This post describes how he created the tool, a process that involved working with Claude 3. This, I think, is becoming a new normal. Even if they do nothing more than save typing time, having an AI coding assistant is becoming a powerful developer tool.

Today: 2 Total: 1285 [Direct link] [Share]

Stephen Downes Stephen Downes, Casselman, Canada

Copyright 2024
Last Updated: May 22, 2024 5:27 p.m.

Canadian Flag Creative Commons License.