Preprocessing Reference¶
darts_preprocessing
¶
Data preprocessing and feature engineering for the DARTS dataset.
__version__ = importlib.metadata.version('darts-nextgen')
module-attribute
¶
preprocess_legacy(ds_optical, ds_arcticdem, ds_tcvis)
¶
Preprocess optical data with legacy (DARTS v1) preprocessing steps.
The processing steps are: - Calculate NDVI - Merge everything into a single ds.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
ds_optical
|
Dataset
|
The Planet scene optical data or Sentinel 2 scene optical data. |
required |
ds_arcticdem
|
Dataset
|
The ArcticDEM data. |
required |
ds_tcvis
|
Dataset
|
The TCVIS data. |
required |
Returns:
Type | Description |
---|---|
Dataset
|
xr.Dataset: The preprocessed dataset. |
Source code in darts-preprocessing/src/darts_preprocessing/preprocess.py
preprocess_legacy_fast(ds_merged, ds_arcticdem, ds_tcvis, tpi_outer_radius=100, tpi_inner_radius=0, device=DEFAULT_DEVICE)
¶
Preprocess optical data with legacy (DARTS v1) preprocessing steps, but with new data concepts.
The processing steps are: - Calculate NDVI - Calculate slope and relative elevation from ArcticDEM - Merge everything into a single ds.
The main difference to preprocess_legacy is the new data concept of the arcticdem. Instead of using already preprocessed arcticdem data which are loaded from a VRT, this step expects the raw arcticdem data and calculates slope and relative elevation on the fly.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
ds_merged
|
Dataset
|
The Planet scene optical data or Sentinel 2 scene optical dataset including data_masks. |
required |
ds_arcticdem
|
Dataset
|
The ArcticDEM dataset. |
required |
ds_tcvis
|
Dataset
|
The TCVIS dataset. |
required |
tpi_outer_radius
|
int
|
The outer radius of the annulus kernel for the tpi calculation in m. Defaults to 100m. |
100
|
tpi_inner_radius
|
int
|
The inner radius of the annulus kernel for the tpi calculation in m. Defaults to 0. |
0
|
device
|
Literal['cuda', 'cpu'] | int
|
The device to run the tpi and slope calculations on. If "cuda" take the first device (0), if int take the specified device. Defaults to "cuda" if cuda is available, else "cpu". |
DEFAULT_DEVICE
|
Returns:
Type | Description |
---|---|
Dataset
|
xr.Dataset: The preprocessed dataset. |
Source code in darts-preprocessing/src/darts_preprocessing/preprocess.py
115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 |
|