ARVOR Floats and SVP Drifters Ocean Converters #1009

mgharamti · 2025-11-25T23:24:14Z

Description:

This PR adds two new in-situ ocean converters (ARVOR profiling floats and SVP surface drifters). It also introduces a reusable CSV parsing utility in parse_args_mod. Both converters make use of this CSV interface, which simplifies code. In addition, documentation has been added to the converters.

The CSV parsing utilities build on already existing parsing infrastructure (like a wrapper). The functionality mimics our netcdf handling in the sense that a file is opened, and data is accessed with a single call before closing the file. A few helper functions have been also added. These can be used to access the header, to inquire if a field exists, to find the dimensions, etc.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update

Documentation changes needed?

My change requires a change to the documentation.
- I have updated the documentation accordingly.

Tests

Tested both converters using actual raw ASCII data files.

Checklist for merging

Updated changelog entry
Documentation updated
Update conf.py

Checklist for release

Merge into main
Create release from the main branch with appropriate tag
Delete feature-branch

Testing Datasets

ARVOR: /glade/derecho/scratch/gharamti/inacawo/DART/observations/obs_converters/ARVOR/work/obs_files.txt
SVP: /glade/derecho/scratch/gharamti/inacawo/DART/observations/obs_converters/SVP/work/obs_files.txt

This update introduces a new set of general-purpose CSV utilities to `parse_args_mod` for use across DART observation converters and other modules that ingest ASCII/tabular data. New utilities added: - `csv_file_type`: cached CSV handle storing filename, nrows, ncols, delimiter, and header fields. - `csv_open`/`csv_close`: initialize/reset CSV handle and preload header/dimensions. - `csv_get_field_char` - `csv_get_field_int` - `csv_get_field_real` Unified interface through `csv_get_field` for retrieving column strings, integers, or reals. - Normalization of delimiters (, or ;) with support for empty fields. - `csv_get_obs_num`: count data rows (excluding header) - `csv_find_field`: header lookup - Other internal helpers such as `split_fields`, `detect_delim`, `normalize_delims` These routines provide a reusable framework that is modeled after our existing NetCDF utilities.

A new ocean converter that uses profiling floats. The converter harvests temperature and salinity data at different depths and time. Depths are converted from pressure in dbar to height in meters. The converter uses the csv parsing utilities to read data from the raw input files.

This is an ocean conveter that uses surface drifters. It collects SST and surface currents data. It uses the csv parsing utilities to read the incoming ASCII files.

- `csv_get_field_index`: Get column index of a field - `csv_field_exists`: Check if field exists in file - `csv_print_header`: print the field names (my favorite) Additional debugging statements in the converters

nancycollins · 2025-12-05T18:58:20Z

moha - i'm going to file a review on the code in just a bit, but up front i wanted to say that it's great to pull out the CSV parsing into a module so it can be reused, tested and updated independently of the calling code.

if you were willing to do a bit more work on this, i think that the CSV routines are self-contained enough to merit their own separate module. they can call code from the parse module, but i think they're different enough to stand alone. let me know what you think about this. i'll put other more specific comments into my review.

also - do you have any tests you used on this code that could be added to the repo?

nancycollins

the converters themselves are easy to read and understand, which is good. i had a few comments - the biggest one is probably moving the csv routines to their own module.

nancycollins · 2025-12-05T18:02:34Z