Question 1

Does it handle the œ ligature?

Accepted Answer

Yes — Tesseract's French model recognizes œ and Œ correctly. Lower-resolution scans sometimes split it into 'oe', which is acceptable in modern French but technically incorrect. Re-scan at 300 DPI if exact ligature preservation matters.

Question 2

What about French Canadian (Quebec) French?

Accepted Answer

Same model handles both Iberian/European French and Quebec French. Spelling differences are minor enough that Tesseract handles both uniformly.

Question 3

Can I mix French and English?

Accepted Answer

Yes — specify both languages in the picker. Useful for bilingual Canadian documents, France-UK contracts, or academic papers with English abstracts on French content.

OCR French PDF

Frequently asked questions