Show HN: OCR Arena – A playground for OCR models

I built OCR Arena as a free playground for the community to compare leading foundation VLMs and open-source OCR models side-by-side.

Upload any doc, measure accuracy, and (optionally) vote for the models on a public leaderboard.

It currently has Gemini 3, dots.ocr, DeepSeek, GPT5, olmOCR 2, Qwen, and a few others. If there's any others you'd like included, let me know!

50
16
kbyatnal
3 days ago
ocrarena.ai

ArcaneMoose
·
1 hour ago
·
[ - ]

I've been really impressed with this model specifically because of how insanely cheap it is: https://replicate.com/ibm-granite/granite-vision-3.3-2b

I didn't expect IBM to be making relevant AI models but this thing is priced at $1 per 4,000,000 output tokens... I'm using it to transcribe handwritten input text and it works very well and super fast.

zzleeper
·
1 hour ago
·
[ - ]

Love this! Would have liked to see something like textract for a pre-LLM benchmark (but of course that's expensive), and also a distinction between handwritten text and printed one.

But still, this is incredibly useful!

krashidov
·
1 hour ago
·
[ - ]

I would be curious to see how Sonnet does. Their models are pretty solid when it comes to PDFs

kbyatnal
·
34 minutes ago
·
[ - ]

Sonnet/Opus is being added shortly!

fzysingularity
·
2 hours ago
·
[ - ]

FYI one of the models on the battle was pretty slow to load. Are these also being rated on latency or just quality?

kbyatnal
·
20 minutes ago
·
[ - ]

Ultimately, there’s some intersection of accuracy x cost x speed that’s ideal, which can be different per use case. We’ll surface all of those metrics shortly so that you can pick the best model for the job along those axes.

andrewlu0
·
30 minutes ago
·
[ - ]

ideally we want people to rate based on quality - but i imagine some of the results are biased rn based on loading time

ianhawes
·
2 hours ago
·
[ - ]

Please add Chandra by Datalab

codeddesign
·
48 minutes ago
·
[ - ]

Most of these are general LLM’s and not specifically OCR models. Where is Google Vision, Mistral, Paddle, Nanonets, or Chandra??

kbyatnal
·
31 minutes ago
·
[ - ]

We wanted to keep the focus on (1) foundation VLMs and (2) open source OCR models.

We had Mistral previously but had to remove it because their hosted API for OCR was super unstable and returned a lot of garbage results unfortunately.

Paddle, Nanonets, and Chandra being added shortly!

arathis
·
2 hours ago
·
[ - ]

Claude would be good!

kbyatnal
·
30 minutes ago
·
[ - ]

Claude coming shortly (in the next ~1 hour)

dang
·
2 hours ago
·
[ - ]

[under-the-rug stub]

[see https://news.ycombinator.com/item?id=45988611 for explanation]

ylhert
·
3 days ago
·
[ - ]

We've got like 10 LLM arenas but nothing for OCR yet, really hope this takes off!

athoscouto
·
3 days ago
·
[ - ]

Nice! Would love to see Azure Document Intelligence on this

profburial
·
3 days ago
·
[ - ]

This is a killer idea!