Skip to content

20250907_01-Release - CodExorcism Edition - Not just for Codex

Choose a tag to compare

@unixwzrd unixwzrd released this 07 Sep 23:56
· 9 commits to main since this release

20250907_01-Release - CodExorcism Edition - Not just for Codex

  • Expanded quote normalization: map additional Unicode quote/prime/angle/fullwidth marks to ASCII ' and " for shell-safe output
  • Added new options:
    • -Q / --keep-smart-quotes: preserve Unicode curly/smart quotes
    • -D / --keep-dashes: preserve EN/EM dashes
  • Normalize ellipses: (U+2026) and (U+22EF) → ...; (U+2025) → ..
  • Normalize Unicode spaces: replace NBSP (U+00A0), NARROW NBSP (U+202F), EN/EM/THIN spaces (U+2000–U+200A), IDEOGRAPHIC SPACE (U+3000), etc., with ASCII space
  • Remove bidi/zero-width controls: strip LRM/RLM, embeddings/overrides/isolates, ZWSP/ZWNJ/ZWJ, BOM
  • Refined VS Code filter handling: only apply newline compensation in filter mode; never in file-write modes; respect CI/CD env
  • Note: These artifacts were observed in content produced by Codex/VS Code extensions
  • No breaking changes; behavior unchanged for already-clean inputs