ISO basic Latin alphabet
The ISO basic Latin alphabet is a Latin-script alphabet and consists of two sets of 26 letters, codified in various national and international standards and used widely in international communication. They are the same letters that comprise the English alphabet.
The two sets contain the following 26 letters each:
Uppercase Latin alphabet | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z |
Lowercase Latin alphabet | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o | p | q | r | s | t | u | v | w | x | y | z |
History
By the 1960s it became apparent to the computer and telecommunications industries in the First World that a non-proprietary method of encoding characters was needed. The International Organization for Standardization encapsulated the Latin script in their 7-bit character-encoding standard. To achieve widespread acceptance, this encapsulation was based on popular usage. The standard was based on the already published American Standard Code for Information Interchange, better known as ASCII, which included in the character set the 26 × 2 letters of the English alphabet. Later standards issued by the ISO, for example ISO/IEC 8859 and ISO/IEC 10646, have continued to define the 26 × 2 letters of the English alphabet as the basic Latin script with extensions to handle other letters in other languages.Terminology
The Unicode block that contains the alphabet is called "C0 Controls and Basic Latin". Two subheadings exist:- "Uppercase Latin alphabet": the letters start at U+0041 and contain the string LATIN CAPITAL LETTER in their descriptions
- "Lowercase Latin alphabet": the letters start at U+0061 and contain the string LATIN SMALL LETTER in their descriptions
- Uppercase: the letters start at U+FF21 and contain the string FULLWIDTH LATIN CAPITAL LETTER in their descriptions
- Lowercase: the letters start at U+FF41 and contain the string FULLWIDTH LATIN SMALL LETTER in their descriptions
Timeline for encoding standards
- 1865 International Morse Code was standardized at the International Telegraphy Congress in Paris, and was later made the standard by the International Telecommunication Union
- 1950s Radiotelephony Spelling Alphabet by ICAO
Timeline for widely used computer codes supporting the alphabet
- 1963: ASCII
- 1963/1964: EBCDIC
- 1965-04-30: Ratified by ECMA as ECMA-6 based on work the ECMA's Technical Committee TC1 had carried out since December 1960.
- 1972: ISO 646
- 1983: ITU-T Rec. T.51 | ISO/IEC 6937
- 1987: ISO/IEC 8859-1:1987
- * Subsequently, other versions and parts of ISO/IEC 8859 have been published.
- Mid-to-late 1980s: Windows-1250, Windows-1252, and other encodings used in Microsoft Windows
- 1990: Unicode 1.0, contained in the block "C0 Controls and Basic Latin" using the same alphabetic code values as ASCII and ISO/IEC 646
- * Subsequently, other versions of Unicode have been published and it later became a joint ISO/IEC standard as well, as identified below.
- 1993: ISO/IEC 10646-1:1993, ISO/IEC standard for characters in Unicode 1.1
- * Subsequently, other versions of ISO/IEC 10646-1 and one of ISO/IEC 10646-2 have been published. Since 2003, the standards have been published under the name "ISO/IEC 10646" without the separation into two parts.
- 1997: Windows Glyph List 4
Representation
Not case sensitive, all letters have code words in the ICAO spelling alphabet and can be represented with Morse code.
Usage
All of the lowercase letters are used in the International Phonetic Alphabet. In X-SAMPA and SAMPA these letters have the same sound value as in IPA.Alphabets containing the same set of letters
The list below only includes alphabets that lack:- letters whose diacritical marks make them distinct letters.
- multigraphs that constitute distinct letters.
alphabet | diacritic | multigraphs | ligatures |
Afrikaans alphabet | á, é, è, ê, ë, í, î, ï, ó, ô, ú, û, ý | ||
Catalan alphabet | à, é, è, í, ï, ó, ò, ú, ü, ç, lꞏl | ⟨gu⟩, ⟨ig⟩, ⟨ix⟩, ⟨ll⟩, ⟨ny⟩, ⟨qu⟩, ⟨rr⟩, ⟨ss⟩ | |
Dutch alphabet | ä, é, è, ë, ï, ö, ü | The digraph ⟨ij⟩ is sometimes considered to be a separate letter. When that is the case, it usually replaces or is intermixed with ⟨y⟩. Other digraphs: ⟨aa⟩, ⟨ae⟩, ⟨ai⟩, ⟨au⟩, ⟨ch⟩, ⟨ee⟩, ⟨ei⟩, ⟨eu⟩, ⟨ie⟩, ⟨oe⟩, ⟨oi⟩, ⟨oo⟩, ⟨ou⟩, ⟨ui⟩, ⟨uu⟩ | |
English alphabet | ⟨sh⟩, ⟨ch⟩, ⟨ea⟩, ⟨ou⟩, ⟨th⟩, ⟨ph⟩, ⟨ng⟩ | æ, œ | |
French alphabet | à, â, ç, é, è, ê, ë, î, ï, ô, ù, û, ü, ÿ | ⟨ai⟩, ⟨au⟩, ⟨ei⟩, ⟨eu⟩, ⟨oi⟩, ⟨ou⟩, ⟨eau⟩, ⟨ch⟩, ⟨ph⟩, ⟨gn⟩, ⟨an⟩, ⟨am⟩, ⟨en⟩, ⟨em⟩, ⟨in⟩, ⟨im⟩, ⟨on⟩, ⟨om⟩, ⟨un⟩, ⟨um⟩, ⟨yn⟩, ⟨ym⟩, ⟨ain⟩, ⟨aim⟩, ⟨ein⟩, ⟨oin⟩, ⟨aî⟩, ⟨eî⟩ | æ, œ |
German alphabet | ä, ö, ü | ⟨sch⟩, ⟨qu⟩, ⟨ch⟩, ⟨ph⟩, ⟨ng⟩, ⟨ie⟩, ⟨ck⟩, ⟨ei⟩, ⟨eu⟩, ⟨äu⟩ | ß |
Italian alphabet | à, è, é, ì, ò, ù | ⟨ch⟩, ⟨ci⟩, ⟨gh⟩, ⟨gi⟩, ⟨gl⟩, ⟨gli⟩, ⟨gn⟩, ⟨sc⟩, ⟨sci⟩ | |
Ido alphabet | ⟨qu⟩, ⟨ch⟩, ⟨sh⟩ | ||
Indonesian alphabet | ⟨kh⟩, ⟨ng⟩, ⟨ny⟩, ⟨sy⟩, diphthongs: ai, au, oi | ||
Interlingua alphabet | ⟨qu⟩ | ||
Javanese Latin alphabet | é, è | ⟨dh⟩, ⟨kh⟩, ⟨ng⟩, ⟨ny⟩, ⟨sy⟩, ⟨th⟩ | |
Luxembourgish alphabet | ä, é, ë | ||
Malay alphabet | ⟨gh⟩, ⟨kh⟩, ⟨ng⟩, ⟨ny⟩, ⟨sy⟩ | ||
Portuguese alphabet | ã, õ, á, é, í, ó, ú, â, ê, ô, à, ç | ⟨ch⟩, ⟨lh⟩, ⟨nh⟩, ⟨rr⟩, ⟨ss⟩, ⟨am⟩, ⟨em⟩, ⟨im⟩, ⟨om⟩, ⟨um⟩, ⟨ãe⟩, ⟨ão⟩, ⟨õe⟩ | |
Sundanese Latin alphabet | é | ⟨eu⟩, ⟨ng⟩, ⟨ny⟩ |
English is one of few modern European language requiring no diacritics for native words.
Malay and Indonesian are the only languages outside Europe that use all the Latin alphabet and require no diacritics and ligatures. Many of the 700+ languages of Indonesia, such as Javanese, also use the Indonesian alphabet to write their languages.
Column numbering
The Roman alphabet is commonly used for column numbering in a table or chart. This avoids confusion with row numbers using Arabic numerals. For example, a 3-by-3 table would contain Columns A, B, and C, set against Rows 1, 2, and 3. If more columns are needed beyond Z, the column immediately after Z is AA, followed by AB, and so on. This can be seen by scrolling far to the right in a spreadsheet program such as Microsoft Excel or LibreOffice Calc.These are double-digit "letters" for table columns, in the same way that 10 through 99 are double-digit numbers. The Greek alphabet has a similar extended form that uses such double-digit letters if necessary, but it is used for chapters of a fraternity as opposed to columns of a table.
Such double-digit letters for bullet points are AA, BB, CC, etc., as opposed to the number-like place value system explained above for table columns.