split_yaml ¶
CLI module for splitting YAML configuration files.
This module provides functionality to split a single YAML configuration file into multiple YAML files, each containing a specific combination of data transformations and splits. The resulting YAML files can be used as input configurations for the stimulus package.
Functions:
-
get_args
–Get the arguments when using from the command line.
-
main
–Reads a YAML config file and generates all possible data configurations.
get_args ¶
get_args() -> Namespace
Get the arguments when using from the command line.
Source code in src/stimulus/cli/split_yaml.py
22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 |
|
main ¶
Reads a YAML config file and generates all possible data configurations.
This script reads a YAML with a defined structure and creates all the YAML files ready to be passed to the stimulus package.
The structure of the YAML is described here -> TODO paste here link to documentation. This YAML and it's structure summarize how to generate all the transform - split and respective parameter combinations. Each resulting YAML will hold only one combination of the above three things.
This script will always generate at least one YAML file that represent the combination that does not touch the data (no transform) and uses the default split behavior.
Source code in src/stimulus/cli/split_yaml.py
48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 |
|