wdpr
    Preparing search index...
    • Run the full preprocessing pipeline on raw wikitext.

      Applies the following transformations in order:

      1. Whitespace normalization (DOS/Mac newlines, tabs, leading spaces, etc.)
      2. Typographic substitutions (curly quotes, ellipsis)

      Parameters

      • text: string

        Raw wikitext input

      Returns string

      Preprocessed text ready for tokenization