|
      0WR8-0R9
      OCR Results Are Not Satisfactory
      When you create "text-searchable PDF/XPS/OOXML" files, OCR (Optical Character Recognition) may not be properly processed. This may be because the settings on the machine, or the language, character type or format of the original document are not appropriate for OCR processing.
      Checking the Machine Settings and Supported Languages
      You can improve OCR processing by customizing the machine settings regarding character recognition according to the documents, or by using suitable character types or fonts in the documents so that the machine can recognize the characters.
      Settings and Languages for OCR Processing
      Item
      Details
      Language Settings for Character Recognition
      For PDF/XPS/PowerPoint files:
      Characters are recognized based on the language you select in <Switch Language/Keyboard> (<Display Settings>  <Switch Language/Keyboard>).*1
      For Word files:
      Characters are recognized based on the language you select after pressing <Change> when you create a Word file.
      Recognizable Asian Languages
      Japanese, Chinese (Simplified), Chinese (Traditional), Korean
      Recognizable Character Types and Fonts (Asian Languages)
      Recognizable European Languages and Language Groups
      Languages:
      English, French, Italian, German, Spanish, Dutch, Portuguese, Albanian, Catalan, Danish, Finnish, Icelandic, Norwegian, Swedish, Croatian, Czech, Hungarian, Polish, Slovak, Estonian, Latvian, Lithuanian, Russian, Greek, Turkish
      Language Groups:
      Western European (ISO)*2, Central European (ISO)*3, Baltic (ISO)*4
      Recognizable Character Types and Fonts (European Languages)
      *1 Displayed languages in the list may vary. If you select English, French, Italian, German, Spanish, Thai, or Vietnamese, the selected language is recognized as Western European (ISO).
      *2 Including English, French, Italian, German, Spanish, Dutch, Portuguese, Albanian, Catalan, Danish, Finnish, Icelandic, Norwegian, and Swedish.
      *3 Including Croatian, Czech, Hungarian, Polish, and Slovak.
      *4 Including Estonian, Latvian, and Lithuanian.
      Recognizable Character Types and Fonts (Asian Languages)
      Item
      Details
      Recognizable Character Types
      Japanese:
      Alphanumeric characters, Kana characters, Kanji characters (JIS first level, and some of the JIS second level), Symbols
      Chinese (Simplified):
      Alphanumeric characters, Chinese characters, Symbols (GB2312-80)
      Chinese (Traditional):
      Alphanumeric characters, Chinese characters, Symbols (Big5)
      Korean:
      Alphanumeric characters, Chinese characters, Hangul characters, Symbols (KSC5601)
      Recognizable Fonts
      Multiple fonts are supported. (Ming-cho type is recommended.)
      Italicized characters cannot be recognized.
      Fonts Used for Converted Characters (Only when Word is selected as the file format)
      Japanese:
      Asian characters: MS Mincho
      European characters: Century
      Chinese (Simplified):
      Asian characters: SimSun
      European characters: Calibri
      Chinese (Traditional):
      Asian characters: PMingLiU
      European characters: Calibri
      Recognizable Character Types and Fonts (European Languages)
      Item
      Details
      Recognizable Character Types
      Alphanumeric characters, Special characters of the recognized language*1, Symbols
      Recognizable Fonts
      Multiple fonts are supported. (Times, Century, and Arial are recommended.)
      Italicized characters can be recognized.
      Fonts Used for Converted Characters (Only when Word is selected as the file format)
      Calibri
      Italic style is not reproduced.
      *1 The following special Greek characters can be recognized. Special characters for each language can also be recognized. Some special characters cannot be recognized depending on the languages.
      Α, Β, Γ, Δ, Ε, Ζ, Η, Θ, Ι, Κ, Λ, Μ, Ν, Ξ, Ο, Π, Ρ, Σ, Τ, Υ, Φ, Χ, Ψ, Ω, α, β, γ, δ, ε, ζ, η, θ, ι, κ, λ, μ, ν, ξ, ο, π, ρ, σ, τ,υ, φ, χ, ψ, ω
      Checking the Format of the Original Documents
      Use documents suitable for OCR processing to improve the processing accuracy when creating searchable PDF/XPS/OOXML files.
      Item
      Details
      Document Format
      Printed documents, Word processor documents (documents consisting of text, graphics, photographs, or tables, and with no character slant)
      Text Format
      Horizontal and vertical writing (documents containing both horizontal and vertical writing can also be recognized)
      Only horizontal writing can be recognized for European languages and Korean text.
      One to three column documents with no complex column settings
      Character Size
      8 to 40 point
      Table Format (For Word Format Only)
      Tables that meet the following conditions:
      Tables consist of squares divided with solid lines
      Tables with up to 32 columns
      Tables with up to 32 rows
      IMPORTANT
      Some documents suitable for OCR processing may not be processed properly
      High accuracy may not be achieved with documents including a large amount of text on each page.
      Characters may be replaced with unintended characters or be missing due to the background color of the document, form and size of characters, or slanted characters.*
      Paragraphs, line breaks, or tables may not be reproduced.*
      Some parts of illustrations, photographs, or seal impressions may be recognized as characters and be replaced with characters.*
      * When Word is selected as the file format.