c108.unicode

Unicode text transformation utilities.

Provides functions for converting ASCII text to Unicode variants: superscripts, subscripts, and other typographic transformations.

Useful for: - Mathematical notation (exponents, indices) - Chemical formulas (H₂O, CO₂) - Footnote markers (¹, ², ³) - Terminal/CLI formatting where markup isn't available

`to_sub(text)`

Convert text to Unicode subscript characters.

Useful for chemical formulas, mathematical notation, array indices, etc.

Parameters:

Name	Type	Description	Default
`text`	`str \| int \| float`	String, integer, or float to convert. Non-string types are converted via str(). Unsupported characters pass through unchanged.	required

Returns:

Type	Description
`str`	String with supported characters converted to subscript Unicode.

Supported characters

Digits: 0-9 → ₀₁₂₃₄₅₆₇₈₉
Operators: + - = ( ) → ₊₋₌₍₎
Letters: a, e, o, h, i, j, k, l, m, n, p, r, s, t, u, v, x (limited lowercase only)

Notes

Unicode has very limited subscript letter support
Unsupported characters pass through unchanged
Missing punctuation: . , : ; ! ? ' " '/' '' @ # $ % ^ & * _ ~ ` [ ] { } < > | and space
Missing letters: uppercase A–Z lowercase b, c, d, f, g, q, w, y, z

Examples:

>>> to_sub(2)
'₂'

>>> to_sub("H2O")
'H₂O'

>>> to_sub("x(n+1)")
'ₓ₍ₙ₊₁₎'

>>> to_sub("CO2")
'CO₂'

See Also

to_sup() - Companion function for superscript conversion

Source code in c108/unicode.py

def to_sub(text: str | int | float) -> str:
    """
    Convert text to Unicode subscript characters.

    Useful for chemical formulas, mathematical notation, array indices, etc.

    Args:
        text: String, integer, or float to convert. Non-string types are
              converted via str(). Unsupported characters pass through unchanged.

    Returns:
        String with supported characters converted to subscript Unicode.

    Supported characters:
        - Digits: 0-9 → ₀₁₂₃₄₅₆₇₈₉
        - Operators: + - = ( ) → ₊₋₌₍₎
        - Letters: a, e, o, h, i, j, k, l, m, n, p, r, s, t, u, v, x (limited lowercase only)

    Notes:
        - Unicode has very limited subscript letter support
        - Unsupported characters pass through unchanged
        - Missing punctuation: . , : ; ! ? ' " '/' '\' @ # $ % ^ & * _ ~ ` [ ] { } < > | and space
        - Missing letters: uppercase A–Z lowercase b, c, d, f, g, q, w, y, z

    Examples:
        >>> to_sub(2)
        '₂'

        >>> to_sub("H2O")
        'H₂O'

        >>> to_sub("x(n+1)")
        'ₓ₍ₙ₊₁₎'

        >>> to_sub("CO2")
        'CO₂'

    See Also:
        to_sup() - Companion function for superscript conversion
    """
    text = str(text)

    subscript_map = {
        # Digits
        "0": "₀",
        "1": "₁",
        "2": "₂",
        "3": "₃",
        "4": "₄",
        "5": "₅",
        "6": "₆",
        "7": "₇",
        "8": "₈",
        "9": "₉",
        # Operators and punctuation
        "+": "₊",
        "-": "₋",
        "=": "₌",
        "(": "₍",
        ")": "₎",
        # Limited letter support in Unicode
        "a": "ₐ",
        "e": "ₑ",
        "o": "ₒ",
        "h": "ₕ",
        "i": "ᵢ",
        "j": "ⱼ",
        "k": "ₖ",
        "l": "ₗ",
        "m": "ₘ",
        "n": "ₙ",
        "p": "ₚ",
        "r": "ᵣ",
        "s": "ₛ",
        "t": "ₜ",
        "u": "ᵤ",
        "v": "ᵥ",
        "x": "ₓ",
    }

    return "".join(subscript_map.get(c, c) for c in text)

`to_sup(text)`

Convert text to Unicode superscript characters.

Supports digits, common operators, parentheses, and letters. Useful for mathematical notation, footnotes, exponents, etc.