Unicode subscripts and superscripts
Adapted from Wikipedia · Adventurer experience
Unicode has special characters that can appear below or above normal text, called subscripts and superscripts. This includes a full set of Arabic numerals that can be written in these styles. These characters are very useful because they let us show polynomials, chemical formulas, and other equations in plain text without needing special coding like HTML or TeX.
The World Wide Web Consortium and the Unicode Consortium have advice about when to use these special characters versus other methods. In math contexts using MathML, it's best to use style markup for superscripts and subscripts. But for showing meaning differences in text, like in phonetic or phonemic transcription, using these special subscript and superscript characters can make things easier to understand.
Uses
Unicode added special superscript and subscript characters to help write chemical and algebraic formulas without extra formatting. For example, H₂O uses a subscript 2.
Many fonts do not follow Unicode’s rules. They design these characters for math, making them work better with fractions like ³/₄. Regular markup does not create correct fractions. These superscript letters can also show ordinal indicators.
Unicode planned for fractions to use a different symbol: the fraction slash. When used with normal digits, it tells the system to create a fraction like ¾. Support for this was weak at first, but now many fonts, browsers, and programs work as Unicode intended. This browser and your default font show the sequence as 3⁄4. (See Slash (punctuation)#Fractions for rendering in various other fonts.)
Superscripts and subscripts block
Main article: Superscripts and Subscripts (Unicode block)
Some special characters, like the numbers 1, 2, and 3, look higher or lower than normal text. These are called superscripts and subscripts. They were part of an older system called ISO-8859-1 and were kept in Unicode. Other superscript and subscript characters, along with some basic math symbols, are in a special group of characters from U+2070 to U+209F. The table below shows these characters. Each one is shown with a normal "x" to help see how high or low it sits.
Some spots in this group of characters are not yet used and could be added to later. As of November 2024, three new subscript letters (w, y, and z) were added to this group.
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| U+00Bx | x² | x³ | x¹ | |||||||||||||
| U+207x | x⁰ | xⁱ | x⁴ | x⁵ | x⁶ | x⁷ | x⁸ | x⁹ | x⁺ | x⁻ | x⁼ | x⁽ | x⁾ | xⁿ | ||
| U+208x | x₀ | x₁ | x₂ | x₃ | x₄ | x₅ | x₆ | x₇ | x₈ | x₉ | x₊ | x₋ | x₌ | x₍ | x₎ | |
| U+209x | xₐ | xₑ | xₒ | xₓ | xₔ | xₕ | xₖ | xₗ | xₘ | xₙ | xₚ | xₛ | xₜ | |||
Other superscript and subscript characters
Unicode has special characters that can appear above or below other characters. These are useful for writing formulas or special sounds.
Unicode has many superscript characters (characters that appear above the normal line of text) in different blocks:
- The Latin-1 Supplement block has two special ordinal indicators: ª and º.
- The Latin Extended-C block has one superscript: ⱽ.
- The Latin Extended-D block has seven superscripts: ꝰ ꟲ ꟳ ꟴ ꟸ ꟹ.
- The Latin Extended-E block has five superscripts: ꭜ ꭝ ꭞ ꭟ ꭩ.
- The Latin Extended-F block has superscript letters used for phonetic transcription.
- The Spacing Modifier Letters block has superscripted letters and symbols.
- The Phonetic Extensions block has many superscripted letters and symbols for secondary articulation.
- The Phonetic Extensions Supplement block has even more superscripted letters and symbols.
- The Cyrillic Extended-B block has two Cyrillic superscripts: ꚜ ꚝ.
- The Cyrillic Extended-D block has many Cyrillic superscripts.
- The Georgian block has one superscripted letter: ჼ.
- The Kanbun block has superscripted characters used in Japanese texts.
- The Tifinagh block has one superscript letter: ⵯ.
- The Unified Canadian Aboriginal Syllabics and its Extended blocks have special letters.
Unicode also has combining superscript characters, which are special marks placed above other letters. These include:
- Medieval superscript letter diacritics in the Combining Diacritical Marks block.
- Three combining insular letters in the Combining Diacritical Marks Extended block.
- Additional medieval superscript letter diacritics in the Combining Diacritical Marks Supplement block.
- Medieval superscript letter diacritics in the Cyrillic Extended-A and -B blocks.
- One additional combining character in the Cyrillic Extended-D block.
Unicode also includes subscript characters (characters that appear below the normal line of text):
- The Latin Extended-C block has one subscript: ⱼ.
- The Phonetic Extensions block has several subscripted letters and symbols.
- The Cyrillic Extended-D block has many Cyrillic subscripts.
Unicode also has combining subscript characters:
- A combining subscript in the Combining Diacritical Marks Supplement block.
- Two combining letters in the Combining Diacritical Marks Extended block for Scots language transcriptions.
Latin, Greek, Cyrillic, and IPA tables
The Unicode standard has special smaller letters that can appear above or below normal text. These are useful for writing chemical formulas, math, and phonetic symbols. Because these characters come from different parts of Unicode, they might look different depending on the font used.
Some new superscript and subscript letters will be added in version 18 of the Unicode Standard. These include superscript versions of the letters A, D, E, P, ƀ, and subscript versions of w, y, and z.
§ Some Cyrillic letters might appear differently in certain fonts.
Some superscript letters in Unicode were originally designed as small capital letters. For example, superscript versions of Ä, Ö, and Ü are created by adding a special mark to the basic letter.
Except for the subscript iota, which is used in Greek text, most of these modifier letters are meant for writing phonetic symbols in texts that use the Latin alphabet. Some of these letters look very similar to normal Latin letters and might not be useful for writing in Latin-based languages.
Superscript versions of the Greek letters psi and omega are planned for version 18 of the Unicode Standard.
Cyrillic modifier characters are intended for use in Cyrillic text.
Superscript and subscript versions of certain Cyrillic letters like ё, ї, й, and ў are created using special marks above or below the basic letters. Many of these Cyrillic characters were added to the Cyrillic Extended-D block, which became available in the Gentium and Andika fonts in February 2023.
See also Unicode Small caps, Fullwidths, and Mathematical alphanumerics.
Superscript IPA
The Latin Extended-F block was created for additional phonetic symbols that appear above normal text. These symbols are supported by the free Gentium and Andika fonts. More superscript characters for historical and special phonetic uses are planned for version 18 of the Unicode Standard in 2026.
Consonant letters
Unicode includes special symbols for phonetic consonant letters that appear above normal text. The entire Latin Extended-F block is used for these superscript IPA symbols. Some symbols represent sounds with extra features and are shown in parentheses below the main letter. Asterisks mark symbols that will be added in Unicode version 18 in September 2026.
Special symbols can show sounds made with a strong release, like popping sounds. These can be written together with the main consonant letter.
Trying to write certain marks above already superscript letters does not work well in simple text.
Some older phonetic symbols that are no longer commonly used are still supported. New symbols for special phonetic uses are also planned for Unicode version 18.
Vowel letters
Unicode includes special symbols for phonetic vowel letters that appear above normal text, including some used in English dictionaries and by certain phoneticians. Some older alternative vowel symbols are also supported. Asterisks mark symbols that will be added in Unicode version 18 in September 2026.
Certain vowel symbols that show a rhotic (r-like) quality are not directly supported. Instead, a special mark can be added to normal vowels to show this quality.
Some older vowel symbols that are no longer used are supported, but others are not.
Some special phonetic symbols for certain language studies are also planned for Unicode version 18.
Length marks
Symbols to show the length of sounds are supported:
These can be used to add length to other superscript symbols.
Wildcards
Many superscript symbols for special uses are supported. These include symbols for sounds made before a consonant, sounds made with a stop, sounds made with friction, and more. Some basic Latin superscript symbols for tones and weak sounds are also supported.
Combining marks and subscripts
In addition to superscripts, a few phonetic letters beyond the basic Latin alphabet have special forms that combine with other letters or appear below them.
| A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Superscript capital | ᴬ | ᴮ | ꟲ | ᴰ | ᴱ | ꟳ | ᴳ | ᴴ | ᴵ | ᴶ | ᴷ | ᴸ | ᴹ | ᴺ | ᴼ | ᴾ | ꟴ | ᴿ | | ᵀ | ᵁ | ⱽ | ᵂ | – | – | – |
| Superscript small capital | * | 𐞄 | * | * | – | 𐞒 | 𐞖 | ᶦ | – | 𞀹§ | ᶫ | 𞀻§ | ᶰ | * | – | 𐞪 | 𞁀§ | ᶸ | 𐞲 | |||||||
| Superscript minuscule | ᵃ | ᵇ | ᶜ | ᵈ | ᵉ | ᶠ | ᵍ | ʰ | ⁱ | ʲ | ᵏ | ˡ | ᵐ | ⁿ | ᵒ | ᵖ | 𐞥 | ʳ | ˢ | ᵗ | ᵘ | ᵛ | ʷ | ˣ | ʸ | ᶻ |
| Overscript small capital | – | ◌ⷡ § | – | – | – | ◌ᷛ | ◌ⷩ § | – | – | ◌ⷦ § | ◌ᷞ | ◌ᷟ | ◌ᷡ | – | – | ◌ᷢ | ◌ⷮ § | – | – | |||||||
| Overscript minuscule | ◌ͣ | ◌ᷨ | ◌ͨ | ◌ͩ | ◌ͤ | ◌ᷫ | ◌ᷚ | ◌ͪ | ◌ͥ | – | ◌ᷜ | ◌ᷝ | ◌ͫ | ◌ᷠ | ◌ͦ | ◌ᷮ | – | ◌ͬ | ◌ᷤ | ◌ͭ | ◌ͧ | ◌ͮ | ◌ᷱ | ◌ͯ | ◌ꙷ § | ◌ᷦ |
| Subscript minuscule | ₐ | – | 𞁞§ | – | ₑ | – | – | ₕ | ᵢ | ⱼ | ₖ | ₗ | ₘ | ₙ | ₒ | ₚ | – | ᵣ | ₛ | ₜ | ᵤ | ᵥ | * | ₓ | * | * |
| Underscript minuscule | ◌᷊ | ◌ᪿ |
| Æ | Ä | Ƀ | Ǝ | Ħ | ꬸ | Ŋ | Ƞ | Ö | Ü | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Superscript capital | ᴭ | (ᴬ̈) | ᴯ | ᴲ | ꟸ | ᴻ | (ᴼ̈) | (ᵁ̈) | |||
| Superscript minuscule | 𐞃 | (ᵃ̈) | * | ᵊ | 𐞕 | ᵑ | * | (ᵒ̈) | (ᵘ̈) | ||
| Overscript minuscule | ◌ᷔ | ◌ᷲ | ◌ᷪ | ◌ᷬ | ◌ᷳ | ◌ᷴ | |||||
| Subscript minuscule | ₔ | ||||||||||
| Α | Β | Γ | Δ | Ε | Η | Θ | Ι | Ρ | Υ | Φ | Χ | Ψ | Ω | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Superscript minuscule | ᵅ | ᵝ | ᵞ | ᵟ | ᵋ | * | ᶿ | ᶥ | ᶹ | ᵠ | ᵡ | * | * | |
| Overscript minuscule | ◌ᷧ | ◌ᷩ | ◌᫇ | |||||||||||
| Subscript minuscule | ᵦ | ᵧ | ͺ | ᵨ | ᵩ | ᵪ | ||||||||
| Underscript minuscule | ◌ͅ | ◌̫ |
| Bilabial | Labiodental | Dental | Alveolar | Postalveolar | Retroflex | Palatal | Velar | Uvular | Pharyngeal | Glottal | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Nasal | m ᵐ 1D50 | ɱ ᶬ 1DAC | n ⁿ 207F (ᶇ *) * | ɳ ᶯ 1DAF | ɲ ᶮ 1DAE (ȵ *) * | ŋ ᵑ 1D51 | ɴ ᶰ 1DB0 | |||||||||||||||
| Plosive | p ᵖ 1D56 | b ᵇ 1D47 | ȹ * * | ȸ * * | t ᵗ 1D57 (ƫ ᶵ) 1DB5 | d ᵈ 1D48 (ᶁ *) * | ʈ 𐞯 107AF | ɖ 𐞋 1078B | c ᶜ 1D9C (ȶ *) * | ɟ ᶡ 1DA1 (ȡ *) * | k ᵏ 1D4F | ɡ ᶢ/g ᵍ 1DA2/1D4D | q 𐞥 107A5 | ɢ 𐞒 10792 | ʡ 𐞳 107B3 | ʔ ˀ 02C0 | ||||||
| Affricate | ʦ 𐞬 107AC | ʣ 𐞇 10787 | ʧ 𐞮 107AE (*) * | ʤ 𐞊 1078A (*) * | ꭧ 𐞭 107AD (𝼜 *) * | ꭦ 𐞈 10788 (𝼙 *) * | (ʨ 𐞫) 107AB | (ʥ 𐞉) 10789 | ||||||||||||||
| Fricative | ɸ ᶲ 1DB2 | β ᵝ 1D5D | f ᶠ 1DA0 | v ᵛ 1D5B | θ ᶿ 1DBF | ð ᶞ 1D9E | s ˢ 02E2 (ᶊ *) * | z ᶻ 1DBB (ᶎ *) * | ʃ ᶴ 1DB4 (ʆ *) * | ʒ ᶾ 1DBE (ʓ *) * | ʂ ᶳ 1DB3 (ᶘ *) * | ʐ ᶼ 1DBC (ᶚ *) * | ç ᶜ̧ 1D9C+0327 (ɕ ᶝ) 1D9D | ʝ ᶨ 1DA8 (ʑ ᶽ) 1DBD | x ˣ 02E3 (ɧ 𐞗) 10797 | ɣ ˠ 02E0 | χ ᵡ 1D61 | ʁ ʶ 02B6 | ħ 𐞕 10795 (ʩ 𐞐) 10790 | ʕ ˤ 02E4 | h ʰ 02B0 (ꞕ *) * | ɦ ʱ 02B1 |
| Approximant | ʋ ᶹ 1DB9 | ɹ ʴ 02B4 | ɻ ʵ 02B5 | j ʲ 02B2 (ɥ ᶣ) 1DA3 | (ʍ ꭩ) AB69 | ɰ ᶭ 1DAD (w ʷ) 02B7 | ||||||||||||||||
| Tap/flap | ⱱ 𐞰 107B0 | ɾ 𐞩 107A9 | ɽ 𐞨 107A8 | |||||||||||||||||||
| Trill | ᴘ * * | ʙ 𐞄 10784 | r ʳ 02B3 | ʀ 𐞪 107AA | ʜ 𐞖 10796 | ʢ 𐞴 107B4 | ||||||||||||||||
| Lateral fricative | ɬ 𐞛 1079B (ʪ 𐞙) 10799 | ɮ 𐞞 1079E (ʫ 𐞚) 1079A | ꞎ 𐞝 1079D | 𝼅 𐞟 1079F | 𝼆 𐞡 107A1 | 𝼄 𐞜 1079C | ||||||||||||||||
| Lateral approximant | l ˡ 02E1 (ᶅ ᶪ) 1DAA | ɭ ᶩ 1DA9 | ʎ 𐞠 107A0 (ȴ *) * | ʟ ᶫ 1DAB (ɫ ꭞ) AB5E | ||||||||||||||||||
| Lateral tap/flap | ɺ 𐞦 107A6 | 𝼈 𐞧 107A7 | ||||||||||||||||||||
| Implosive | ƥ * * | ɓ 𐞅 10785 | ƭ * * | ɗ 𐞌 1078C | 𝼉 * * | ᶑ 𐞍 1078D | ƈ * * | ʄ 𐞘 10798 | ƙ * * | ɠ 𐞓 10793 | ʠ * * | ʛ 𐞔 10794 | ||||||||||
| Click release | ʘ 𐞵 107B5 | ɋ | ǀ 𐞶 107B6 | ʇ * * | ǃ ꜝ A71D | ʗ * * | 𝼊 𐞹 107B9 | ψ * * | ǂ 𐞸 107B8 | 𝼋 * * | (ʞ *) * | |||||||||||
| Lateral click release | ǁ 𐞷 107B7 | ʖ * * | ||||||||||||||||||||
| Percussive | ʬ ᷱᷱ (hack) | ʭ ͆͆ (hack) | ¡ ꜞ A71E | |||||||||||||||||||
| Front | Central | Back | ||||
|---|---|---|---|---|---|---|
| Semivowel | j ʲ 02B2 | ɥ ᶣ 1DA3 | ɉ * ɥ̶ * w̶ * * * * | ɰ ᶭ 1DAD | w ʷ 02B7 | |
| Extra-close | ʅ * * | ʯ * * | ɿ * * | ʮ * * | ||
| Close | i ⁱ 2071 | y ʸ 02B8 | ɨ ᶤ 1DA4 | ʉ ᶶ 1DB6 | ɯ ᵚ 1D5A | u ᵘ 1D58 |
| Near-close | ɪ ᶦ 1DA6 (ɩ ᶥ) 1DA5 | ʏ 𐞲 107B2 | ᵻ ᶧ 1DA7 (ᵼ) | ᵿ * * (ɷ̶) | (ω *) * | ʊ ᶷ 1DB7 (ɷ 𐞤) 107A4 |
| Close-mid | e ᵉ 1D49 | ø 𐞢 107A2 | ɘ 𐞎 1078E | ɵ ᶱ 1DB1 | ɤ 𐞑 10791 | o ᵒ 1D52 |
| Mid | ᴇ * * | ə ᵊ 1D4A | ꭥ | |||
| Open-mid | ɛ ᵋ 1D4B | œ ꟹ A7F9 | ɜ ᶟ 1D9F (ᴈ ᵌ) 1D4C | ɞ 𐞏 1078F (ʚ) | ʌ ᶺ 1DBA | ɔ ᵓ 1D53 |
| Near-open | æ 𐞃 10783 | ɐ ᵄ 1D44 | ||||
| Open | a ᵃ 1D43 | ɶ 𐞣 107A3 | ᴀ * * | ɑ ᵅ 1D45 | ɒ ᶛ 1D9B | |
| Long | Half-long |
|---|---|
| ː 𐞁 10781 | ˑ 𐞂 10782 |
| ɑ | æ | β | ç | ð | ə | ɣ | ʃ | ʍ | χ | ʔ | ʼ | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Overscript | ◌ᷧ | ◌ᷔ | ◌ᷩ | ◌ᷗ | ◌ᷙ | ◌ᷪ | ◌ᷯ | ◌̉ | ◌̓ | |||
| Subscript | ᵦ | ₔ | * | ᵪ | ||||||||
| Underscript | ◌ᫀ | ◌̦ |
Composite characters
Unicode has special characters that look like superscripts and subscripts with other symbols. These help text look better.
The Latin-1 Supplement block has fractions like ½, ¼, and ¾. It also has the copyright © and registered trademark signs ®, which look like superscripts in some fonts. The General Punctuation block has the permille sign ‰ and the per-ten-thousand sign ‱, while Basic Latin has the percent sign %.
The Number Forms block has fractions like ⅐, ⅑, and ⅒. The Letterlike Symbols block has symbols made from subscript and superscript characters like ℀ and ℅. The Enclosed Alphanumeric Supplement block has three superscript abbreviations: 🅪, 🅫, and 🅬, used for trademarks in different languages. The Miscellaneous Technical block has a subscript 10 (⏨) for scientific notation. The Unified Canadian Aboriginal Syllabics and its Extended blocks have letters with superscripted letters to show special sounds.
Related articles
This article is a child-friendly adaptation of the Wikipedia article on Unicode subscripts and superscripts, available under CC BY-SA 4.0.
Images from Wikimedia Commons. Tap any image to view credits and license.
Safekipedia