diff --git a/README.md b/README.md index da4c718..9d62fad 100644 --- a/README.md +++ b/README.md @@ -153,7 +153,7 @@ inputs: ``` -Apart from identifiers to keep track of the configuration file format version and the datasets version, the configuration file is divided into two main sections: +Apart from identifiers to keep track of the configuration file format version and the dataset version (for you to keep track of changes that you make to the dataset), the configuration file is divided into two main sections: - `output`: defines the variables and dimensions of the output dataset produced by `mllam-data-prep`. These are the variables and dimensions that the inputs datasets will be mapped to. These should match the input variables and dimensions expected by the model architecture you are training. - `inputs`: a list of source datasets to extract data from. These are the datasets that will be mapped to the architecture defined in the `architecture` section.