Chapter-5: Create netCDF for profile Data

Chapter-5: Create netCDF for profile Data#

The CF Conventions introduced the following representations for profiles:

Single Profile (H.3.3): The file containes only one station and a single profile.
Orthogonal multidimensional array representation of profiles (H.3.1): Multiple profiles have same number of vertical coordinates and the coordinate values are identical. Similar to the case for time series, this representation can also be applied on multiple profiles with different vertical coordinate values while the efficient use of storage space is traded off.
Incomplete multidimensional array representation of profiles (H.3.2): Multiple profiles have the same number of vertical coordinates, but coordinate values are different. This representation is more efficient in storage space usage compared to H.3.1.
Contiguous ragged array representation of profiles (H.3.4): Multiple profiles have different lengths at vertical axis, and the whole dataset is complete (no more new observations will be made). This representation is the most efficient representation for storing this type of data regarding storage space.
Indexed ragged array representation of profiles (H.3.5): Multiple profiles have different lengths at vertical axis, and the dataset is not complete (dataset will be updated when new measurements are made).

As we can see, profile and time series share common representations, only the concern of profile data is vertical axis, rather than time axis. In last tutorial, we introduced how to form diverse netCDF files for time series data, and in this chapter, we prepared some hands-on exercises of creating netCDF for profile data. You can first try out the exercises on your own; if you encounter problems, feel free to take a look at the sample solutions.

For the exercises, you will need two datasets of CLAPPP : New Caledonian lagoons: CTD Profiles[5]. The sample solutions use the dataset “prony-1.nc” and “teremba-1.nc”.

When you have the data, try to do the following exercise:

Each downloaded dataset contains a single profile that is already CF-compliant. If you’re using the same datasets as the sample solution, try to subset the datasets to make them share the same depth axis. Try to merge the subsets into one netCDF in the form as given in Appendix H.3.1 in the CF Conventions. Since each profile contains many data variables, if you’re able to do it, try to write loops to facilitate the data processing.
Try to subset both datasets to be the same length on depth axis but have different depth coordinates, and merge the data in the form as given in Appendix H.3.2 in the CF Conventions.
The two downloaded profiles actually have different lengths on the vertical axis depth. Try to merge them into the form as given in Appendix H.3.4 in the CF Conventions.

import os
from glob import glob
import numpy as np
import pandas as pd
import xarray as xr
import matplotlib.pyplot as plt

# List available datasets. Please change it to your file path.
os.chdir('../src/data')
pf_files = glob(os.path.join(os.getcwd(), "dsg_profile", "*.nc"))

Chapter-5: Create netCDF for profile Data

Contents

Chapter-5: Create netCDF for profile Data#

Inspect the downloaded single profile#

1. Orthogonal multidimensional array representation of profiles (H.3.1)#

Alternative Approach: Loop#

2. Incomplete multidimensional array representation of profiles (H.3.2)#

3. Contiguous ragged array of representation of profiles (H.3.4)#