Glossary Terms - Knowledge Base

All the glossary terms, knowledge base, explanations of Unicode Symbols, fancy symbols, emojis, characters, scripts etc.
  • Abstract character: A character that is not associated with a specific visual representation. Instead, it represents a semantic concept or a logical unit of text.
  • ASCII: A 7-bit character set that contains 128 characters, including 95 printable characters and 33 control characters. ASCII is a subset of Unicode.
  • BIDI: Short for bidirectional text, BIDI refers to text that is written in a mixed direction, such as Arabic or Hebrew.
  • BMP: Short for Basic Multilingual Plane, the BMP is the first 65,536 code points in Unicode, representing most commonly used characters in modern languages.
  • Byte Order Mark (BOM): A special marker that indicates the byte order of a Unicode text stream.
  • CJK Unified Ideographs: A range of Unicode code points that represents the ideographic characters used in Chinese, Japanese, and Korean languages.
  • Code point: A unique integer value assigned to each character in Unicode.
  • Combining character: A character that is used in combination with another character to create a new glyph, such as diacritical marks used in Latin scripts.
  • Dingbat: A symbol used for ornamentation or decorative purposes, such as arrows, stars, or flowers.
  • Emoji: A type of pictorial symbol that is commonly used in electronic communication to convey emotions, reactions, or concepts.
  • Emoticon: A sequence of keyboard characters used to represent a facial expression, such as :-) for a smiling face.
  • Font family: A group of related fonts that share the same design characteristics, such as weight, width, and style.
  • Fullwidth character: A character that takes up the full width of a character cell, such as Chinese and Japanese characters used in monospaced fonts.
  • Glyph: A graphical representation of a character or combination of characters.
  • Grapheme: A single unit of a writing system that carries meaning, such as a letter, numeral, or punctuation mark.
  • Hanzi radical: A component of a Chinese character that provides semantic information about the character's meaning.
  • Hanzi stroke: A single line or curve that forms part of a Chinese character.
  • Ideograph: A symbol or character that represents an idea or concept directly, without using a sound or a spoken language.
  • Ligature: A glyph that represents the combination of two or more characters into a single unit, such as "æ" or "fi".
  • Pictograph: A picture or symbol that represents a word or idea, such as a smiley face or a traffic sign.
  • Radical: A component of a Chinese character that provides phonetic information about the character's pronunciation.
  • Script: A set of written characters used to represent a language or group of languages, such as the Latin script used for English and other European languages.
  • Supplementary Multilingual Plane: A range of Unicode code points that contains additional characters not found in the BMP, including historic scripts, mathematical symbols, and musical notation.
  • Unicode: A universal character encoding standard that assigns a unique code point to each character in most of the world's writing systems.
  • Variant: A glyph that has a different appearance than another glyph representing the same character, such as a serif font versus a sans-serif font.
Reference: Unicode