Dataset#

The Dataset holds a collection of Stations. All methods applied on a dataset are applied on all present stations (and a target observationtype).

Constructor#

Dataset()

Dataset class for managing and processing meteorological observation data.

Data attributes#

A summary of all the attributes that hold or return data.

Dataset.stations

Get the list of Stations present in the Dataset.

Dataset.obstypes

Get the dictionary of known Obstypes by the Dataset.

Dataset.template

Get the Template instance used when the data was imported.

Dataset.df

Dataset DataFrame constructor.

Dataset.metadf

Construct a DataFrame representation of metadata.

Dataset.outliersdf

Construct a DataFrame representation of all the outliers.

Dataset.gapsdf

Construct a DataFrame representation of all the gaps.

Dataset.modeldatadf

Construct a DataFrame representation of all the present model data.

Dataset.present_observations

Get a list of all the present observation types.

Dataset.start_datetime

Get the earliest start datetime from the observation data.

Dataset.end_datetime

Get the latest end datetime from the observation data.

General methods and attributes#

Dataset.get_info([printout])

Retrieve and optionally print detailed information about the station.

Dataset.subset_by_stations(stationnames[, ...])

Create a subset of the dataset by selecting specific stations.

Dataset.get_station(stationname)

Retrieve a Station by name.

Dataset.rename_stations(renamedict)

Rename stations in the dataset.

Dataset.sync_records([obstype, ...])

Synchronize records of sensor data across stations.

Dataset.resample(target_freq[, obstype, ...])

Resample observation data to a specified frequency.

Dataset.add_new_observationtype(obstype)

Add a new observation type to the dataset known-obstypes.

Dataset.create_distancematrix()

Compute pairwise great-circle distances between all stations.

Exporting and converting methods#

Dataset.import_data_from_file(template_file)

Import observational data and metadata from files.

Dataset.save_dataset_to_pkl([filepath, ...])

Save the dataset to a pickle (.pkl) file.

Dataset.to_csv([filepath, overwrite])

Save the dataset observations to a CSV file.

Dataset.to_parquet([filepath, overwrite])

Save the dataset observations to a parquet file.

Dataset.to_xr()

Concatenate multiple station Datasets into one along a new 'name' dimension.

Dataset.to_netcdf([filepath, overwrite])

Save the Dataset as a netCDF file.

Visualisations#

Dataset.make_plot_of_modeldata([obstype, ...])

Generate a timeseries plot of model data for a specific observation type.

Dataset.make_plot([obstype, colorby, ...])

Generate a time series plot for observational data.

Dataset.make_gee_plot([gee_manager, ...])

Create an interactive spatial plot of the GEE dataset and stations.