Supported Languages

The following recognition languages are supported for OCR processing in the Project Client.

For information about setting the recognition language in your project settings, see OCR Settings.

Note: Some languages are not supported in combination. For example, OCR processing may not process some languages when also combined with Chinese, Japanese, or Korean. If you have more than one recognition language selection in your OCR Settings and receive an error when trying to process, you may need to select only the primary language for the particular item.

Search term highlighting is not supported for Hebrew, Chinese, Japanese, and Korean.

  • Abkhaz
  • Adyghe
  • Afrikaans
  • Agul
  • Albanian
  • Altaic
  • Armenian (Eastern)
  • Armenian (Grabar)
  • Armenian (Western)
  • Avar
  • Aymara
  • Azerbaijani (Cyrillic)
  • Azerbaijani (Latin)
  • Bashkir
  • Basic programming language
  • Basque
  • Belarussian
  • Bemba
  • Blackfoot
  • Breton
  • Bugotu
  • Bulgarian
  • Buryat
  • Catalan
  • Chamorro
  • Chechen
  • Chemistry (simple chemical formulas)
  • Chinese Simplified
  • Chinese Traditional
  • Chukcha
  • Chuvash
  • Corsican
  • Crimean Tatar
  • Croatian
  • Crow
  • Czech
  • Danish
  • Dargwa
  • Digits (Numbers)
  • Dungan
  • Dutch (Netherlands)
  • Dutch (Belgium)
  • E-13B (MICR text type)
  • English
  • English and Russian
  • Eskimo (Cyrillic)
  • Eskimo (Latin)
  • Esperanto
  • Estonian
  • Even
  • Evenki
  • Faeroese
  • Fijian
  • Finnish
  • Fortran programming language
  • French
  • Frisian
  • Friulian
  • Gagauz
  • Galician
  • Ganda
  • German
  • German (Luxembourg)
  • German (new spelling)
  • Greek
  • Guarani
  • Hani
  • Hausa
  • Hawaiian
  • Hebrew
  • Hungarian
  • Icelandic
  • Ido
  • Indonesian
  • Ingush
  • Interlingua
  • Irish
  • Italian
  • Japanese + English
  • Kabardian
  • Kalmyk
  • Karachay-Balkar
  • Karakalpak
  • Kasub
  • Kawa
  • Kazakh
  • Khakas
  • Khanty
  • Kikuyu
  • Kirghiz
  • Kongo
  • Korean and English
  • Koryak
  • Kpelle
  • Kumyk
  • Kurdish
  • Lak
  • Lappish (Sami)
  • Latin
  • Latvian
  • Lezgin
  • Lithuanian
  • Luba
  • Macedonian
  • Malagasy
  • Malay
  • Malinke
  • Maltese
  • Mansi
  • Maori
  • Mari
  • Maya
  • Miao
  • Minankabaw
  • Mohawk
  • Mongol
  • Mordvin
  • Nahuatl
  • Nenets
  • Nivkh
  • Nogay
  • Norwegian (Bokmal and Nynorsk)
  • Norwegian (Bokmal)
  • Norwegian (Nynorsk)
  • Nyanja
  • Occidental
  • Ojibway
  • Ossetian
  • Papiamento
  • Pascal programming language
  • Pidgin English (Tok Pisin)
  • Polish
  • Portuguese (Brazil)
  • Portuguese (Portugal)
  • Provencal
  • Quechua
  • Rhaeto-Romanic
  • Romanian
  • Romanian (Moldavia)
  • Romany
  • Ruanda
  • Rundi
  • Russian
  • Russian (old spelling)
  • Samoan
  • Scottish Gaelic
  • Selkup
  • Serbian (Cyrillic)
  • Serbian (Latin)
  • Shona
  • Sioux (Dakota)
  • Slovak
  • Slovenian
  • Somali
  • Sorbian
  • Sotho
  • Spanish
  • Sunda
  • Swahili
  • Swazi
  • Swedish
  • Tabassaran
  • Tagalog
  • Tahitian
  • Tajik
  • Tatar
  • Tinpo
  • Tongan
  • Tswana
  • Tun
  • Turkish
  • Turkmen
  • Tuvan
  • Udmurt
  • Ukrainian
  • Uzbek (Cyrillic)
  • Uzbek (Latin)
  • Visayan
  • Welsh
  • Wolof
  • Xhosa
  • Yakut
  • Yiddish
  • Zapotec
  • Zulu