Genealogy AI Frontier

Chandra OCR 2 Tops HTR/OCR Benchmarks [developing]

Chandra OCR 2 Tops HTR/OCR Benchmarks [developing]

Key Questions

What is Chandra OCR 2?

Chandra OCR 2 is an open-source 4B parameter model specialized in OCR and HTR tasks. It excels at processing handwriting, tables, multilingual text, and faded documents, and can run locally.

How does Chandra OCR 2 compare to GPT-4o?

Chandra OCR 2 outperforms GPT-4o on HTR/OCR benchmarks, particularly for challenging inputs like handwriting and faded documents. This performance highlights a shift toward specialized AI models over general ones.

What are the applications of Chandra OCR 2 in genealogy?

It is ideal for digitizing genealogy records such as census data and letters. This aligns with efforts by FamilySearch using AI for record digitization and partnerships like ParaScript and ABBYY for advanced document processing.

Open-source 4B Chandra OCR 2 crushes GPT-4o on handwriting, tables, multilingual/faded docs—local run for genealogy census/letters. Signals specialized AI shift with new HTR papers reinforcing Scribe/Gemini/IJDAR boom.

Sources (2)
Updated May 11, 2026