×
This Challenge was posted 6 years ago
Challenge view
Back to ProjectSex and Crime
und Kneippenschlägereien in Early Modern Zurich
Goal
Make the data ("Stillstandsprotokolle des 17. Jahrhunderts") better searchable and georeference it for visualization.
Team
- Ernst Rosser, ernst.rosser@gmail.com
- Barbara Leimgruber, Barbara.Leimgruber@ji.zh.ch
- Rebekka Plüss, Rebekka.Pluess@ji.zh.ch
- Ismail Prada, ismail.prada@gmail.com
- Matthias Mazenauer, matthias.mazenauer@statistik.ji.zh.ch
- Tobias Hodel, tobias.hodel@ji.zh.ch
Data sources:
Primary Data
Secondary data
Steps taken
- Create lookup for normalized strings (https://github.com/mmznr/Staatsarchiv-GLAMhack/blob/master/woerterStillstand_Result.tsv)
- Annotate named entities (normalization) -> places (also add BfS-data) -> persons (normalization to be used for auto-complete in search)
- Cluster words -> based on "Frequenztabelle Stillstandsprotokolle", see https://github.com/mmznr/Staatsarchiv-GLAMhack/blob/master/README.md#frequency-list-of-word-cluster -> to be used to refer to topic/concept
- Cluster documents -> to be used as keyword(s) in TEI header = Scripts for clustering, see folder "code"
- Create script to add information as tags (in body) to write in XML (in work)
Lemmatization/Normalisation
Done: Wordlist and Frequencies
ToDo: POS tagging
Named Entities
Names of persons: done A-D
Names of places: done A-K
Visualization
Word-Cluster
Visualization
(using fasttext)
Frequency list of Word-Cluster
https://docs.google.com/spreadsheets/d/1rFo7p9YsQRwJufMuWGw2677acOsWevcmm-lN5RVBJv4/edit?usp=sharing
GIS Visualization
https://beta.observablehq.com/@mmznrstat/sex-and-crime-und-kneipenschlagereien-in-der-fruhen-neuzei
Done: Borders from swisstopo via Linked Data, Matching of the settlements of the canton of Zurich
ToDo: Get List of old names of this settlements, match them and show all relating documents of a settlement (or municipality)