{ "cells": [ { "cell_type": "markdown", "id": "0", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "# Mapping to the toolkit\n", "\n", "The MetObs-toolkit uses standard names and formats for your data. To use the toolkit,\n", "your observational data must be converted to the toolkit standards this is referred to as **mapping**.\n", "\n", "To specify how the mapping must be done a **template** is used. This template contains\n", "all the information on how to convert your tabular data to the toolkit standards.\n", "A template is saved as a file (JSON file) and can be reused or shared. In practice you only need to \n", "make one template file, for your network.\n", "\n", "On this page, you can find information on how to construct a template." ] }, { "cell_type": "markdown", "id": "3", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [ "remove-input" ] }, "source": [ "## Raw data Structures\n", "\n", "To make a template you must be aware of which format your data is in. The toolkit can handle the following data structures:" ] }, { "cell_type": "markdown", "id": "4", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "### Long-format\n", "Observations are stacked in rows per station. One column represents the station names." ] }, { "cell_type": "markdown", "id": "5", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "| Timestamp | 2m Temperature | 2m Humidity | ID |\n", "| -------- | ------- | ------- | ------- |\n", "| 2022-06-07 13:20:00 | 16.4 | 77.3 | Station_A |\n", "| 2022-06-07 13:30:00 | 16.7 | 75.6 | Station_A |\n", "| 2022-06-07 13:20:00 | 18.3 | 68.9 | Station_B |\n", "| 2022-06-07 13:30:00 | 18.6 | 71.9 | Station_B |\n" ] }, { "cell_type": "markdown", "id": "6", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "### Single-station-format\n", "The same as a long format but without a column indicating the station names. Be aware that the toolkit interprets it as observations coming from one station." ] }, { "cell_type": "markdown", "id": "7", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "| Timestamp | 2m Temperature | 2m Humidity |\n", "| -------- | ------- | ------- |\n", "| 2022-06-07 13:20:00 | 16.4 | 77.3 |\n", "| 2022-06-07 13:30:00 | 16.7 | 75.6 |" ] }, { "cell_type": "markdown", "id": "8", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "### Wide-format\n", "Columns represent different stations. The data represents one observation type." ] }, { "cell_type": "markdown", "id": "9", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "| Timestamp | Station_A | Station_B |\n", "| -------- | ------- | ------- |\n", "| 2022-06-07 13:20:00 | 16.4 | 18.3 |\n", "| 2022-06-07 13:30:00 | 16.7 | 18.6 |" ] }, { "cell_type": "markdown", "id": "10", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "## Template creation\n", "\n", "Once you have converted your tabular data files to either long-, wide-, or single-station-format, and saved them as a .csv file, a template can be made." ] }, { "cell_type": "markdown", "id": "11", "metadata": { "editable": true, "slideshow": { "slide_type": "" }, "tags": [] }, "source": [ "\n", "