diff --git a/docs/datasets/building.rst b/docs/datasets/building.rst
index 86138f1..1bd8ecc 100644
--- a/docs/datasets/building.rst
+++ b/docs/datasets/building.rst
@@ -4,15 +4,6 @@
  Building datasets
 ###################
 
-..
-   ************
-
-..
-   Principles
-
-..
-   ************
-
 ..
    .. figure:: build.png
 
@@ -22,9 +13,6 @@
 ..
    :scale: 50%
 
-..
-   Building datasets
-
 **********
  Concepts
 **********
@@ -90,46 +78,3 @@ concat
 
 .. literalinclude:: building.yaml
    :language: yaml
-
-****************
- Top-level keys
-****************
-
-dadkas;k;level
-
--  description
--  dataset_status
--  purpose
--  name
--  config_format_version
-
-*******
- Dates
-*******
-
-The ``dates`` block specifies the start and end dates of the dataset, as
-well as the frequency of the data. The frequency is specified in hours.
-
-*******
- Input
-*******
-
-The ``input`` block specifies the input data that will be used to build
-the dataset. The ``join`` block specifies the datasets that will be
-joined together to form the input data. The ``mars`` block specifies the
-MARS datasets that will be used. The ``constants`` block specifies the
-constants that will be used.
-
-********
- Output
-********
-
-The ``output`` block specifies the output data that will be built. The
-``chunking`` block specifies the chunking of the output data. The
-``dtype`` block specifies the data type of the output data. The
-``flatten_grid`` block specifies whether the output data will be
-flattened. The ``order_by`` block specifies the order of the output
-data. The ``statistics`` block specifies the statistics that will be
-calculated. The ``statistics_end`` block specifies the end date of the
-statistics. The ``remapping`` block specifies the remapping of the
-output data.
diff --git a/docs/datasets/building.yaml b/docs/datasets/building.yaml
index 39b677e..c991100 100644
--- a/docs/datasets/building.yaml
+++ b/docs/datasets/building.yaml
@@ -1,12 +1,7 @@
-description: Boundary condition for MetNO's LAM model rotated
-# dataset_status: experimental
-# purpose: aifs
-name: aifs-ea-an-oper-0001-mars-n320-2020-2023-6h-v2-metno-bc-rotated
-# config_format_version: 2
-
+description: Example dataset
 
 dates:
-  start: 2020-02-05 00:00:00
+  start: 2020-01-01 00:00:00
   end: 2023-12-31 18:00:00
   frequency: 6h
 
@@ -40,14 +35,10 @@ input:
         - insolation
 
 output:
-  # chunking: {dates: 1, ensembles: 1}
-  # dtype: float32
-  # flatten_grid: True
   order_by:
     - valid_datetime
     - param_level
     - number
   statistics: param_level
-  # statistics_end: 2022
   remapping:
     param_level: "{param}_{levelist}"
diff --git a/docs/datasets/filters.rst b/docs/datasets/filters.rst
index 20ffea0..aefac54 100644
--- a/docs/datasets/filters.rst
+++ b/docs/datasets/filters.rst
@@ -3,7 +3,3 @@
 #########
  Filters
 #########
-
-kjdlkjklld
-
-dsklajdlksa
diff --git a/docs/datasets/sources.rst b/docs/datasets/sources.rst
index 5dd049b..0cb125c 100644
--- a/docs/datasets/sources.rst
+++ b/docs/datasets/sources.rst
@@ -8,8 +8,6 @@
  mars
 ******
 
-ddd
-
 ******
  grib
 ******
@@ -21,5 +19,3 @@ ddd
 *********
  opendap
 *********
-
-ssss
diff --git a/docs/datasets/using.rst b/docs/datasets/using.rst
deleted file mode 100644
index a031fc7..0000000
--- a/docs/datasets/using.rst
+++ /dev/null
@@ -1,479 +0,0 @@
-#########################
- Using training datasets
-#########################
-
-A ``dataset`` wraps a ``zarr`` file that follows the format used by
-ECMWF to train its machine learning models.
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   ds = open_dataset("aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2")
-
-The dataset can be passed as a path or URL to a ``zarr`` file, or as a
-name. In the later case, the package will use the entry ``zarr_root`` of
-``~/.ecml-tool`` file to create the full path or URL:
-
-.. code:: yaml
-
-   zarr_root: /path_or_url/to/the/zarrs
-
-*************************
- Attributes of a dataset
-*************************
-
-As the underlying ``zarr``, the ``dataset`` is an iterable:
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   ds = open_dataset("aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2")
-
-Print the number of rows (i.e. dates):
-
-.. code:: python
-
-   print(len(ds))
-
-... iterate throw the rows,
-
-.. code:: python
-
-   for row in ds:
-       print(row)
-
-or access a item directly.
-
-.. code:: python
-
-   print(row[10])
-
-You can retrieve the shape of the dataset,
-
-.. code:: python
-
-   print(ds.shape)
-
-the list of variables,
-
-.. code:: python
-
-   print(ds.variables)
-
-the mapping between variable names and columns index
-
-.. code:: python
-
-   two_t_index = ds.name_to_index["2t"]
-   row = ds[10]
-   print("2t", row[two_t_index])
-
-Get the list of dates (as NumPy datetime64)
-
-.. code:: python
-
-   print(ds.dates)
-
-The number of hours between consecutive dates
-
-.. code:: python
-
-   print(ds.frequency)
-
-The resolution of the underlying grid
-
-.. code:: python
-
-   print(ds.resolution)
-
-The list of latitudes of the data values (NumPy array)
-
-.. code:: python
-
-   print(ds.latitudes)
-
-The same for longitudes
-
-.. code:: python
-
-   print(ds.longitudes)
-
-And the statitics
-
-.. code:: python
-
-   print(ds.statistics)
-
-The statistics is a dictionary of NumPy vectors following the order of
-the variables:
-
-.. code:: python
-
-   {
-       "mean": ...,
-       "stdev": ...,
-       "minimum": ...,
-       "maximum": ...,
-   }
-
-To get the statistics for ``2t``:
-
-.. code:: python
-
-   two_t_index = ds.name_to_index["2t"]
-   stats = ds.statistics
-   print("Average 2t", stats["mean"][two_t_index])
-
-*********************
- Subsetting datasets
-*********************
-
-You can create a view on the ``zarr`` file that selects a subset of
-dates.
-
-Changing the frequency
-======================
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   ds = open_dataset("aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       freqency="12h")
-
-The ``frequency`` parameter can be a integer (in hours) or a string
-following with the suffix ``h`` (hours) or ``d`` (days).
-
-Selecting years
-===============
-
-You can select ranges of years using the ``start`` and ``end`` keywords:
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   training = open_dataset("aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       start=1979,
-       end=2020)
-
-   test = open_dataset("aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2"
-       start=2021,
-       end=2022)
-
-The selection includes all the dates of the ``end`` years.
-
-Selecting more precise ranges
-=============================
-
-You can select a few months, or even a few days:
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   training = open_dataset("aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       start=202306,
-       end=202308)
-
-   test = open_dataset("aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2"
-       start=20200301,
-       end=20200410)
-
-The following are equivalent way of describing ``start`` or ``end``:
-
--  ``2020`` and ``"2020"``
--  ``202306``, ``"202306"`` and ``"2023-06"``
--  ``20200301``, ``"20200301"`` and ``"2020-03-01"``
-
-You can omit either ``start`` or ``end``. In that case the first and
-last date of the dataset will be used respectively.
-
-Combining both
-==============
-
-You can combine both subsetting methods:
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   training = open_dataset("aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       start=1979,
-       end=2020,
-       frequency="6h")
-
-********************
- Combining datasets
-********************
-
-You can create a virtual dataset by combining two or more ``zarr``
-files.
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   ds = open_dataset(
-       "dataset-1",
-       "dataset-2",
-       "dataset-3",
-       ...
-   )
-
-When given a list of ``zarr`` files, the package will automatically work
-out if the files can be *concatenated* or *joined* by looking at the
-range of dates covered by each files.
-
-If the dates are different, the files are concatenated. If the dates are
-the same, the files are joined. See below for more information.
-
-************************
- Concatenating datasets
-************************
-
-You can concatenate two or more datasets along the dates dimension. The
-package will check that all datasets are compatible (same resolution,
-same variables, etc.). Currently, the datasets must be given in
-chronological order with no gaps between them.
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   ds = open_dataset(
-       "aifs-ea-an-oper-0001-mars-o96-1940-1978-1h-v2",
-       "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2"
-   )
-
-.. figure:: concat.png
-   :alt: Concatenation
-
-   Concatenation
-
-Please note that you can pass more than two ``zarr`` files to the
-function.
-
-   **NOTE:** When concatenating file, the statistics are not recomputed;
-   it is the statistics of first file that are returned to the user.
-
-******************
- Joining datasets
-******************
-
-You can join two datasets that have the same dates, combining their
-variables.
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   ds = open_dataset(
-       "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       "some-extra-parameters-from-another-source-o96-1979-2022-1h-v2",
-   )
-
-.. figure:: join.png
-   :alt: Join
-
-   Join
-
-If a variable is present in more that one file, that last occurrence of
-that variable will be used, and will be at the position of the first
-occurrence of that name.
-
-.. figure:: overlay.png
-   :alt: Overlay
-
-   Overlay
-
-Please note that you can join more than two ``zarr`` files.
-
-***********************************************
- Selection, ordering and renaming of variables
-***********************************************
-
-You can select a subset of variables when opening a ``zarr`` file. If
-you pass a ``list``, the variables are ordered according the that list.
-If you pass a ``set``, the order of the file is preserved.
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   # Select '2t' and 'tp' in that order
-
-   ds = open_dataset(
-       "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       select = ["2t", "tp"],
-   )
-
-   # Select '2t' and 'tp', but preserve the order in which they are in the file
-
-   ds = open_dataset(
-       "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       select = {"2t", "tp"},
-   )
-
-You can also drop some variables:
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-
-   ds = open_dataset(
-       "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       drop = ["10u", "10v"],
-   )
-
-and reorder them:
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   # ... using a list
-
-   ds = open_dataset(
-       "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       reorder = ["2t", "msl", "sp", "10u", "10v"],
-   )
-
-   # ... or using a dictionnary
-
-   ds = open_dataset(
-       "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       reorder = {"2t": 0, "msl": 1, "sp": 2, "10u": 3, "10v": 4},
-   )
-
-You can also rename variables:
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-
-   ds = open_dataset(
-       "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       rename = {"2t": "t2m"},
-   )
-
-This will be useful when your join datasets and do not want variables
-from one dataset to override the ones from the other.
-
-*******************
- Using all options
-*******************
-
-You can combine all of the above:
-
-.. code:: python
-
-   from ecml_tools.data import open_dataset
-
-   ds = open_dataset(
-       "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-       "some-extra-parameters-from-another-source-o96-1979-2022-1h-v2",
-       start=2000,
-       end=2001,
-       frequency="12h",
-       select={"2t", "2d"},
-       ...
-   )
-
-*****************************************
- Building a dataset from a configuration
-*****************************************
-
-In practice, you will be building datasets from a configuration file,
-such as a YAML file:
-
-.. code:: python
-
-   import yaml
-   from ecml_tools.data import open_dataset
-
-   with open("config.yaml") as f:
-       config = yaml.safe_load(f)
-
-   training = open_dataset(config["training"])
-   test = open_dataset(config["test"])
-
-This is possible because ``open_dataset`` can be build from simple lists
-and dictionaries:
-
-From a string
-
-.. code:: python
-
-   ds = open_dataset("aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2")
-
-From a list of strings
-
-.. code:: python
-
-   ds = open_dataset(
-       [
-           "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-           "aifs-ea-an-oper-0001-mars-o96-2023-2023-1h-v2",
-       ]
-   )
-
-From a dictionary
-
-.. code:: python
-
-   ds = open_dataset(
-       {
-           "dataset": "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-           "frequency": "6h",
-       }
-   )
-
-From a list of dictionnary
-
-.. code:: python
-
-   ds = open_dataset(
-       [
-           {
-               "dataset": "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-               "frequency": "6h",
-           },
-           {
-               "dataset": "some-extra-parameters-from-another-source-o96-1979-2022-1h-v2",
-               "frequency": "6h",
-               "select": ["sst", "cape"],
-           },
-       ]
-   )
-
-And even deeper constructs
-
-.. code:: python
-
-   ds = open_dataset(
-       [
-           {
-               "dataset": "aifs-ea-an-oper-0001-mars-o96-1979-2022-1h-v2",
-               "frequency": "6h",
-           },
-           {
-               "dataset": [
-                   {
-                       "dataset": "aifs-od-an-oper-8888-mars-o96-1979-2022-6h-v2",
-                       "drop": ["ws"],
-                   },
-                   {
-                       "dataset": "aifs-od-an-oper-9999-mars-o96-1979-2022-6h-v2",
-                       "select": ["ws"],
-                   },
-               ],
-               "frequency": "6h",
-               "select": ["sst", "cape"],
-           },
-       ]
-   )
diff --git a/docs/index.rst b/docs/index.rst
index 9b69a7b..c1de965 100644
--- a/docs/index.rst
+++ b/docs/index.rst
@@ -63,7 +63,6 @@ and Zarr_.
 
 -  :doc:`datasets/about`
 -  :doc:`datasets/building`
--  :doc:`datasets/using`
 -  :doc:`datasets/sources`
 -  :doc:`datasets/filters`
 -  :doc:`/datasets/options`
@@ -75,7 +74,6 @@ and Zarr_.
 
    datasets/about
    datasets/building
-   datasets/using
    datasets/sources
    datasets/filters
    datasets/options