Explore projects
-
Contains scripts for converting UZH CL corpora from XML format to a standardised PaCoCo format.
Updated -
Contains scripts for converting UZH CL corpora from XML format to a standardised PaCoCo format.
Updated -
Updated
-
Course materials for History of the Contemporary World / Zeitgeschichte
Updated -
-
-
Updated
-
Updated
-
-
Updated
-
-
Codebase to detect and extract similar image clusters, so called motfis from image data using SIFT and HDBSCAN
Updated -
Quasi-fork of Graëns Cutter (cutter-ng, https://github.com/j0hannes/cutter-ng)
Implementation of a new ruleset for Cutter to correctly tokenize the CMC corpus COVIDComments.
Updated -
-
Updated