Datasets
PME-toolkit operates on datasets stored in external files (typically .mat) and referenced through configuration files.
Dataset organization
Dataset metadata is documented in:
databases/glider/databases/airfoil/
These folders describe:
- dataset structure
- variable definitions
- required files
Benchmark datasets
The main benchmark datasets are distributed via Zenodo:
Glider
Serani, A., & Palma, G. (2026).
Design-Space Dimensionality Reduction Benchmark Dataset – Bio-Inspired Underwater Glider (1.1) [Data set].
https://doi.org/10.5281/zenodo.18936594
Airfoil
Serani, A., & Quagliarella, D. (2026).
Design-Space Dimensionality Reduction Benchmark Dataset – RAE2822 Airfoil (1.0) [Data set].
https://doi.org/10.5281/zenodo.18958555
Dataset file
The dataset is specified in the configuration file:
"io": {
"dbfile": "path/to/database.mat"
}
The file typically contains:
- geometric data
- design variables
- optional physical quantities
Data structure
Datasets are organized sample-wise, where:
- each column represents one design configuration
- all data sources are aligned across samples
The dataset may include:
Geometry
- discretized coordinates
- structured according to
geomdefinition
Variables
- parametric design variables
- dimension defined by
vars.Mbase
Physics (optional)
- distributed fields (e.g. pressure coefficients)
- scalar quantities (e.g. drag, lift)
Consistency requirements
All dataset components must:
- have the same number of samples
- be aligned across geometry, variables, and physics
- be free of missing values (after filtering)
Usage
To use a dataset:
- download the dataset from Zenodo
- extract it locally
- update the
dbfilepath in the JSON configuration - run the benchmark
Notes
- dataset structure must match the configuration file
- geometry, variables, and physics must match
geom,vars, andphys - incorrect alignment will lead to invalid results
Summary
Datasets provide the input to PME workflows and must be:
- versioned (via Zenodo)
- structured consistently
- correctly referenced in configuration files