Preprocessing and Encoding for the Corpus Workbench (CWB).
Contains scripts for converting UZH CL corpora from XML format to a standardised PaCoCo format.