This Challenge was posted 6 years ago
 

Challenge view

Back to Project

Sex and Crime

und Kneippenschlägereien in Early Modern Zurich


#glamhack2018

Goal

Make the data ("Stillstandsprotokolle des 17. Jahrhunderts") better searchable and georeference it for visualization.

Team

  • Ernst Rosser, ernst.rosser@gmail.com
  • Barbara Leimgruber, Barbara.Leimgruber@ji.zh.ch
  • Rebekka Plüss, Rebekka.Pluess@ji.zh.ch
  • Ismail Prada, ismail.prada@gmail.com
  • Matthias Mazenauer, matthias.mazenauer@statistik.ji.zh.ch
  • Tobias Hodel, tobias.hodel@ji.zh.ch

Data sources:

Steps taken

  • Create lookup for normalized strings (https://github.com/mmznr/Staatsarchiv-GLAMhack/blob/master/woerterStillstand_Result.tsv)
  • Annotate named entities (normalization) -> places (also add BfS-data) -> persons (normalization to be used for auto-complete in search)
  • Cluster words -> based on "Frequenztabelle Stillstandsprotokolle", see https://github.com/mmznr/Staatsarchiv-GLAMhack/blob/master/README.md#frequency-list-of-word-cluster -> to be used to refer to topic/concept
  • Cluster documents -> to be used as keyword(s) in TEI header = Scripts for clustering, see folder "code"
  • Create script to add information as tags (in body) to write in XML (in work)

Lemmatization/Normalisation

  • Done: Wordlist and Frequencies

  • ToDo: POS tagging

Named Entities

  • Names of persons: done A-D

  • Names of places: done A-K

Visualization

Word-Cluster

Visualization

(using fasttext)

Frequency list of Word-Cluster

https://docs.google.com/spreadsheets/d/1rFo7p9YsQRwJufMuWGw2677acOsWevcmm-lN5RVBJv4/edit?usp=sharing

GIS Visualization

https://beta.observablehq.com/@mmznrstat/sex-and-crime-und-kneipenschlagereien-in-der-fruhen-neuzei

  • Done: Borders from swisstopo via Linked Data, Matching of the settlements of the canton of Zurich

  • ToDo: Get List of old names of this settlements, match them and show all relating documents of a settlement (or municipality)

Contributed 6 years ago by oleg for #GLAMhack 2018

Connect to our community on Team Chat | Twitter | Facebook

All attendees, sponsors, partners, volunteers and staff at our hackathon are required to agree with the Hack Code of Conduct. Organisers will enforce this code throughout the event. We expect cooperation from all participants to ensure a safe environment for everybody. For more details on how the event is run, see the Guidelines on our wiki.

Creative Commons LicenceThe contents of this website, unless otherwise stated, are licensed under a Creative Commons Attribution 4.0 International License.