Jump to content

OCR-B

From Wikipedia, the free encyclopedia
OCR-B
CategorySans-serif
ClassificationNeo-grotesque
Designer(s)Adrian Frutiger
Date created1968
Sample

OCR-B is a monospace font developed in 1968 by Adrian Frutiger for Monotype by following the European Computer Manufacturer's Association standard. Its function was to facilitate the optical character recognition operations by specific electronic devices, originally for financial and bank-oriented uses. It was accepted as the world standard in 1973.[1] It follows the ISO 1073-2:1976 (E) standard, refined in 1979 ("letterpress" design, size I). It includes all ASCII symbols, and other symbols needed in the bank environment. It is widely used for the human readable digits in UPC/EAN barcodes.[2][citation needed] It is also used for machine-readable passports.[3] It shares that purpose with OCR-A, but it is easier for the human eye and brain to read and it has a less technical look than OCR-A.

History

[edit]

In June 1961, the European Computer Manufacturers Association (ECMA) started standardization activities related to Optical Character Recognition (OCR). After evaluating existing OCR designs, it was decided to develop two new fonts: A stylized design with just digits, called “Class A”; and a more conventional type design with broader character coverage, called “Class B”. In February 1965, ECMA proposed a design for the “Class B” font to ISO, who adopted it as international standard ISO 1073-2 in October 1965.[4] The first revision contained three font sizes: I, II and III. The specification included a Letterpress design, intended for high-quality printing equipment; and a rounded-edge Constant Strokewidth design for impact printers[5]: 3  with reduced typographic quality.

In September 1969, ECMA started work to revise its published standard. To make OCR-B more widely accepted, the shapes of some characters were slightly modified. The new revision removed font size II, which had been rarely used in practice; it deleted five character shapes; and it added a new font size IV. ECMA published the second edition of OCR-B in October 1971.[4]

In March 1976, ECMA published a third revision of its ECMA-11 specification. It added the symbols § and ¥ to OCR-B; two types of erasure marks (█) for blackening out mis-printed characters were added; and the length of the Vertical bar was changed to match ISO 1073-2.[4]

In 1993, Turkey proposed extending ISO 1073-2 to include the Turkish letters Ğğ, İı, and Şş.[6] The request was generalized to extend OCR-B with a number of Latin and Greek letters used in European languages.[7]: 27  A revision of the ISO 1073-2:1976 standard was therefore started, producing three successive draft documents. The final draft would have extended OCR-B with 40 Latin and 10 Greek letters; for six Latin letters, the draft gave new alternate shapes.[7]: 26  A request to extend OCR-B with Vietnamese accents was rejected.[7]: 27  Other than previous versions of the standard, which specified glyph shapes via reference drawings, the new revision would have included the shapes in machine-readable form.[7]: 26  However, industry support for testing the new font could not be secured at the time, so the revision effort was halted in 1997.[7]: IV  The working group described their findings in a technical report.[7]: 1 

Two proposed variants for the OCR-B Euro sign[5]

In June 1998, the European Committee for Standardization published a report for adding the Euro sign to OCR-B.[5] The report proposed both a single-stroked and a double-stroked variant of the Euro sign, leaving the decision to further testing of OCR performance.[5]: 4  Testing was difficult: the theoretical design methods used when the OCR-B glyphs were originally developed could no longer be reproduced, and the technological constraints of the 1960s were also not entirely relevant anymore in the OCR environments of the 1990s.[8] A new test method was devised, using present-time OCR technology. The tests found no difference in OCR performance between the two Euro variants, and recommended the adoption of the double-stroked variant as it matches the conventional glyph shape.[8] The project did not have funds to thoroughly test the glyph extensions of the 1993 proposal; initial results were inconclusive.[8]

Availability

[edit]

Microsoft Office ships a version of Letterpress OCR-B produced by Monotype. It covers Windows-1252.[9] Many vendors, including Adobe, still sell their versions of OCR-A and OCR-B.

The TeX typesetting system has a public domain Constant Strokewidth OCR-B font in METAFONT definition form. It was created by Norbert Swartz in 1995 and updated in 2010. It has a setting for square stroke ends.[10] The definition has also been translated to METATYPE1, so the rounded version is available in TrueType and OpenType too.[11]

A version of Constant Strokewidth OCR-B by Matthew Anderson has extended character coverage. It is available under CC-BY 4.0.[12]

MS-DOS OCR-B encoding

[edit]

The MS-DOS OCR-B encoding is code page 877. Note that the grave, acute, circumflex (at 0x9B), tilde, diaeresis, and cedilla can be added over (in the case of the cedilla, under) letters to form accented letters. Note that the alternative Latin small latter m (not in Unicode), euro sign (€), and vertical bar (|) were later added to the code page, but it is unknown where they are located.

MS-DOS OCR-B[13]
0 1 2 3 4 5 6 7 8 9 A B C D E F
0x
1x [a]
2x  SP  ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { [b] } ~ [c]
8x ü ä å Ä Å
9x æ Æ ö Ö Ü ^ £ ¥
Ax Ñ ø Ø ˍ
02CD
Bx IJ ij
Cx ¤
Dx
Ex ß ´
Fx § ¸ ¨

Characters not in Unicode:

  • ^a Group erase (0x18)
  • ^b Alternative vertical bar (0x7C)
  • ^c Character erase (0x7F)

References

[edit]
  1. ^ Frutiger, Adrian. Type. Sign. Symbol. ABC Verlag, Zurich, 1980. p. 50
  2. ^ "GS1 Human Readable Interpretation (HRI) Implementation Guideline" (PDF). GS1 AISBL. 2018. p. 13. Retrieved 2018-09-27.
  3. ^ Doc 9303: Machine Readable Travel Documents, Part 3: Specifications Common to all MRTDs (PDF) (Eighth ed.). International Civil Aviation Organization. 2015. p. 25. ISBN 978-92-9249-792-7. Retrieved 2016-03-03.
  4. ^ a b c "Standard ECMA-11 for the Alphanumeric Character Set OCR-B for Optical Recognition" (PDF). European Computer Manufacturers Association. March 1976. Section “Brief History”.
  5. ^ a b c d "Draft Report on the Euro Glyph in OCR-B" (PDF). June 28, 1998.
  6. ^ Karl Ivar Larsson (August 8, 2000). "Notes on transfer of responsibility for OCR-B standards".
  7. ^ a b c d e f "Proposal for Type 3 Technical Report, TR 15907, Information technology — Revision of OCR-B standard (ISO 1073/II-1976)" (PDF). September 28, 1998.
  8. ^ a b c Karsson, Kent Ivar (June 28, 1998), Report to TC304 on OCR-B situation, Unicode Technical Committee, Unicode Consortium, UTC Document L2/01-259
  9. ^ "OCRB font family - Typography". 30 March 2022.
  10. ^ "CTAN: /Tex-archive/Fonts/Ocr-b".
  11. ^ "OCR a and OCR B".
  12. ^ "OCR-B". wehtt.am. Archived from the original on 28 March 2019. Retrieved 11 January 2022.
  13. ^ "Code Page 877" (PDF). Archived from the original (PDF) on 2013-01-21.
[edit]