SAMBAH
The Subsetter And Multi-dimensional Batched Aggregation in Harmony (SAMBAH) chain executes several services:
- CMR Query: Retrieves information from the Common Metadata Repository (CMR) (source)
- PODAAC L2-Subsetter (optional): Performs subsetting on level-2 data granules (source)
- Batchee (optional): Groups together filenames so that further operations (such as concatenation) can be performed separately on each group of files (source)
- Stitchee (optional): Concatenates netCDF data along an existing dimension (source)
- PODAAC CONCISE (optional): Concatenates netCDF data along a newly created dimension (source)
- CASPER (optional): Converts SAMBAH NetCDF output files into CSV files by grouping the variables based on common dimensions and outputting each group into a separate CSV file. (source)
Known Limitations
- Panoply is unable to plot results when coordinate arrays contain null values on the edges.
- Polygons and other Shapefile formats are not supported for subsetting. Support for this is in development.
- A request for a single granule proceeds through the entire chain. Thus, the following modifications are made even though the data are not concatenated: (i) the filename is changed to the granule's collection ID + "_merged" and (ii) a history attribute is added.
- Converting large NetCDF files can result in CSV files that are too large to be opened in Excel. For this reason, CONCISE cannot be used in the SAMBAH processing chain if CSV output is requested. If both concatenation and CSV output are included in the Harmony request, the concatenation request will be ignored.
Missions supported
The SAMBAH service chain is currently configured to work only with data collections from the Tropospheric Emissions: Monitoring of Pollution (TEMPO) mission.
References
- Service (UMM-S) record ID in CMR: S2940253910-LARC_CLOUD
- Service versions in Harmony Production: https://harmony.earthdata.nasa.gov/versions
- Harmony API - Service Capabilities: https://harmony.earthdata.nasa.gov/docs#service-capabilities