Wikidot converts certain ASCII character sequences into their Unicode
typographic equivalents before parsing. This module handles the following
conversions:
... (three dots) and . . . (spaced dots) become an ellipsis (U+2026)
Em dash conversion (-- to U+2014) is intentionally NOT handled here.
It is performed in the parser instead, because the -- sequence also appears
in HTML comment markers ([!-- and --]), and converting it during
preprocessing would break comment detection.
Typographic preprocessing for Wikidot markup.
Wikidot converts certain ASCII character sequences into their Unicode typographic equivalents before parsing. This module handles the following conversions:
`` ... ''becomes left/right double curly quotes (U+201C / U+201D),, ... ''becomes low-9 double quote + right double quote (U+201E / U+201D)` ... 'becomes left/right single curly quotes (U+2018 / U+2019)...(three dots) and. . .(spaced dots) become an ellipsis (U+2026)Em dash conversion (
--to U+2014) is intentionally NOT handled here. It is performed in the parser instead, because the--sequence also appears in HTML comment markers ([!--and--]), and converting it during preprocessing would break comment detection.