Some practical results from using GPT4 vision for OCR

Follow the full discussion on Reddit.
Hi there, wasn't really sure of where to post it, so I'll try here. Lots of excitement out there about GPT4 vision, but when trying it out on some real data for a real project, it is lacking. It will hallucinate at times, or refuse to perform the task, and the accuracy isn't substantially better than Tesseract. However, things get very good if the two are combined. More details and source data at the link below. https://pslusarz.github.io/articles/2023/12/22/compare-ocr-tesseract-gpt4-nara-rolls.html

Comments

There's unfortunately not much to read here yet...

Discover the Best of Machine Learning.

Ever having issues keeping up with everything that's going on in Machine Learning? That's where we help. We're sending out a weekly digest, highlighting the Best of Machine Learning.

Join over 900 Machine Learning Engineers receiving our weekly digest.

Best of Machine LearningBest of Machine Learning

Discover the best guides, books, papers and news in Machine Learning, once per week.

Twitter