Unicode subscripts and superscripts


has subscripted and superscripted versions of a number of characters including a full set of Arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.
The World Wide Web Consortium and the Unicode Consortium have made recommendations on the choice between using markup and using superscript and subscript characters:
"When used in mathematical context it is recommended to consistently use style markup for superscripts and subscripts.... However, when super and sub-scripts are to reflect semantic distinctions, it is easier to work with these meanings encoded in text rather than markup, for example, in phonetic or phonemic transcription."

Uses

The intended use when these characters were added to Unicode was to allow chemical and algebra formulas and phonetics to be written without markup, but produce true superscripts and subscripts. Thus "H₂O" is supposed to be identical to "H2O".
In reality most fonts that include these characters ignore the Unicode definition, and design the digits for mathematical numerator and denominator glyphs, which are smaller than normal characters but are aligned with the cap line and the baseline, respectively. When used with the solidus, these glyphs are useful for making arbitrary diagonal fractions. Trying to make fractions using existing software super/subscripts look messier, so font designers provided this alternative. This also makes the superscript letters useful for ordinal indicators, more closely matching the ª and º characters. However it makes them incorrect for normal super and subscripts, and generally formulas look better using markup than these characters.
Unicode intended to produce diagonal fractions through a different mechanism but it is very poorly supported. The fraction slash U+2044 is visually similar to the solidus, but when used with the ordinary digits is intended to tell a layout system that a fraction such as ¾ should be rendered using automatic glyph substitution for the digits. Some browsers support this but not in all fonts, a selection of fonts is shown in the below table.
CharactersFontResult
rowspan=4 ½
,, ¹/₂-
,, ¹⁄₂-
,, 1⁄2-
,, Arial1⁄2
,, Cambria1⁄2
,, Consolas1⁄2
,, Times New Roman1⁄2

Superscripts and subscripts block

The most common superscript digits were in ISO-8859-1 and were therefore carried over into those positions in the Latin-1 range of Unicode. The rest were placed in a dedicated section of Unicode at to U+209F. The two tables below show these characters. Each superscript or subscript character is preceded by a normal x to show the subscripting/superscripting. The table on the left contains the actual Unicode characters; the one on the right contains the equivalents using HTML markup for the subscript or superscript.

Other superscript and subscript characters

Unicode version 13.0 also includes subscript and superscript characters that are intended for semantic usage, in the following blocks:
Consolidated, the Unicode standard contains superscript and subscript versions of a subset of Latin and Greek letters. Here they are arranged in order for comparison. Since these characters come from different ranges, they may not be of the same size and position due to font substitution. Asterisks mark small capitals that are not distinct from minuscules and so would not be expected to be supported by Unicode.
ABCDEFGHIJKLMNOPQRSTUVWXYZ
Superscript capitalᴿ
Superscript small cap*******
Superscript minusculeʰʲˡʳˢʷˣʸ
Overscript small cap*◌ᷛ◌ᷞ◌ᷟ◌ᷡ*◌ᷢ*****
Overscript minuscule◌ͣ◌ᷨ◌ͨ◌ͩ◌ͤ◌ᷫ◌ᷚ◌ͪ◌ͥ◌ᷜ◌ᷝ◌ͫ◌ᷠ◌ͦ◌ᷮ◌ͬ◌ᷤ◌ͭ◌ͧ◌ͮ◌ᷱ◌ͯ◌ᷦ
Subscript minuscule
Underscript minuscule◌᷊◌ᪿ

ΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩ
Superscript minuscule⁽ᵋ⁾ᶿ⁽ᶥ⁾⁽ᶹ⁾
Overscript minuscule◌ᷩ
Subscript minuscule

ɐɑɒæçɔɕðəɜɛɟɡɦɥɨɩʝɭɱɯɰŋɲɳɵœɹɻʁʂʃƫʉʊʋʌɣʍʐʑʒɸʔʕ
Superscript⁽ᶜ̧⁾ʱʴʵʶˠˀˁ,ˤ
Overscript◌ᷧ◌ᷔ◌ᷗ◌ᷙ◌ᷪ◌ᷯ◌̉
Underscript◌ᫀ

See also small caps in Unicode, superscript IPA letters.

Composite characters

Primarily for compatibility with earlier character sets, Unicode contains a number of characters that compose super- and subscripts with other symbols. In most fonts these render much better than attempts to construct these symbols from the above characters or by using markup.