Recipe Schema Fun

Exploring and repackaging V-ZUG data

We got an interesting database of recipes to play with from V-Zug Home at the Open Food Data Hackathon, used in a mobile application to program smart kitchen devices. We took a closer look and investigated ways of combining it with other data sources and tools responding to makeopendata challenge #12. See also the Beat project for an exploration of this data.

In the project repository there is a Jupyter notebook written in Python which explores the data, along with a script to convert the ~150 XML files we received according to a schema defined in recipe.py. We made a very quick visualization to demonstrate the use of data analysis libraries.

We created an example Data Package containing a summary of the dataset in CSV format, as well as a JSON formatted recipe schema proposal in recipe.json. These are proposed as a potential starting point for future discussions about developing an open standard, the advantages of which may include participation of the wider development community, better interaction with other manufacturers, and consumer trust. Here is a preview of our summary DataFrame:

name ingredients instructions skill duration
0 Mozzarella, green bean and onion mash 8 2 easy fast
1 Apple puff pastry horseshoes 6 5 easy medium
2 Tarte Tatin 4 2 moderate medium
3 Mashed potato with lime 4 3 easy fast
4 Duck breast with a honey and soy glaze and plu... 16 3 moderate medium

And JSON schema:

{
  "uuid": "vzug.internet.05.erdbeersirup",
  "supported-languages": [
    "de",
    "en",
    "fr",
    "it"
  ],
  "name": {
    "de": "Erdbeersirup",
    "en": "Strawberry syrup",
    "fr": "Sirop de fraise",
    "it": "Sciroppo di fragole"
}, 
...

Preparation

No special libraries are required to use the parsing script. The conversion script convert.py references the Python Data Analysis library for CSV file generation. The Jupyter notebook includes some data analysis using the Pandas, Numpy and Matplotlib libraries. You can find some setup instructions here. The schema of this Data Package was inferred using Frictionless Data CLI tools.

Research

In this project we conducted some background research of schemas used in other recipe application, particularly of interest are these cloud providers:

And these open source initiatives:

License

The licensing terms of this dataset have not yet been established. If you intend to use these data in a public or commercial product, check with each of the data sources for any specific restrictions.

This Data Package is made available by its maintainers under the Public Domain Dedication and License v1.0, a copy of the full text of which is in LICENSE.md.

Edited

11.09.2018 19:51 ~ oleg

Event finished

Joined the team

09.09.2018 00:55 ~ Dominik

Edited

09.09.2018 00:52 ~ oleg

Joined the team

09.09.2018 00:44 ~ oleg

First post View challenge

09.09.2018 00:44 ~ oleg

Challenge

Event started

 

Connect to our community on Team Chat | Twitter | Facebook

All attendees, sponsors, partners, volunteers and staff at our hackathon are required to agree with the Hack Code of Conduct. Organisers will enforce this code throughout the event. We expect cooperation from all participants to ensure a safe environment for everybody. For more details on how the event is run, see the Guidelines on our wiki.

Creative Commons LicenceThe contents of this website, unless otherwise stated, are licensed under a Creative Commons Attribution 4.0 International License.