Fix #11236 - CsvParser and ScheduleFile handling of edge cases to avoid crash #11249

jmarrec · 2025-10-02T16:38:36Z

Pull request overview

Fixes Schedule:File crash #11236

Description of the purpose of this PR

Improve CsvParser:
- skip extra columns (but register a warning)
- Emplace a null when two consecutive delimiters found (but register a warning)
- Neither of these issues are fatal yet, if you aren't actually requesting a column that is passed the parsed one or trying to get a number from the null column, you're just fine for now
Improve ScheduleManager:
- print the new warnings
- Gracefully Fatal:
  - if trying to dereference a column with a null
  - if trying to access a column that is Past the number of columns we parsed (instead of throwing a nlohmann json crypic message)
- Schedule:File:Shading: fatal out in the edge case where the number of headers is > to the number of actual parsed columns.
Add lots of tests for ScheduleManager, and a new test file CsvParser.unit.cc to test out lower level functionality

Pull Request Author

Reviewer

…ull values (two consecutive delimiters) These situations are not yet fatal, until you try to use a column with a null value or a column that is past the parsed one.

…edge cases

…ber of columns of values

…n't exist

jmarrec · 2025-10-02T16:39:02Z

src/EnergyPlus/InputProcessing/CsvParser.cc

+std::vector<std::pair<std::string, bool>> const &CsvParser::warnings()
+{
+    return warnings_;
+}
+
+bool CsvParser::hasWarnings()
+{
+    return !warnings_.empty();
+}


Putting back warnings in CsvParser

jmarrec · 2025-10-02T16:40:07Z

src/EnergyPlus/InputProcessing/CsvParser.cc

+            if (column_num < num_columns) {
+                columns.at(column_num).push_back(parse_value(csv, index));
+            } else {
+                // Just parse and ignore the value
+                parse_value(csv, index);
+                has_extra_columns = true;
+            }


Avoid crashing here if you end up finding more values on the row than the number of columns we've determined by parsing the first data row (after header if present)

We setup the has_extra_columns to true here, so we can register a warning

jmarrec · 2025-10-02T16:41:18Z

src/EnergyPlus/InputProcessing/CsvParser.cc

+            if (has_extra_columns) {
+                warnings_.emplace_back(
+                    fmt::format("CsvParser - Line {} - Expected {} columns, got {}. Ignored extra columns. Error in following line.",
+                                this_cur_line_num,
+                                num_columns,
+                                parsed_values),
+                    false);
+                warnings_.emplace_back(getCurrentLine(), true);
+            } else if (parsed_values != num_columns) {
                success = false;

-                size_t found_index = csv.find_first_of("\r\n", this_beginning_of_line_index);
-                std::string line;
-                if (found_index != std::string::npos) {
-                    line = csv.substr(this_beginning_of_line_index, found_index - this_beginning_of_line_index);
-                }
                errors_.emplace_back(
                    fmt::format(
                        "CsvParser - Line {} - Expected {} columns, got {}. Error in following line.", this_cur_line_num, num_columns, parsed_values),
                    false);
-                errors_.emplace_back(line, true);
+                errors_.emplace_back(getCurrentLine(), true);


When we reach the end of the line, we issue a warning if has_extra_columns, otherwise an error if the resulting number of parsed values is not the expected one.

jmarrec · 2025-10-02T16:42:28Z

src/EnergyPlus/InputProcessing/CsvParser.cc

        } else if (token == Token::DELIMITER) {
            next_token(csv, index);
+            token = look_ahead(csv, index);
+            if (token == Token::DELIMITER) {
+                // Two delimiters in a row means a blank value
+                // This is not yet an error, in case the user is not using this column... It will crash later if they do try to cast it to a number
+                size_t const next_col = column_num + 1;
+                if (next_col < num_columns) {
+                    // Push a nan for blank value
+                    columns.at(next_col).push_back(json::value_t::null);
+                    warnings_.emplace_back(fmt::format("CsvParser - Line {} Column {} - Blank value found, setting to null. Error in following line.",
+                                                       this_cur_line_num,
+                                                       next_col + 1),
+                                           false);
+                    warnings_.emplace_back(getCurrentLine(), true);
+                } else {
+                    has_extra_columns = true;
+                }
+                ++parsed_values;
+            }


In the Delimiter case, we scan ahead to check if another delimiter is coming up, in which case we emplace a null.

This is a warning, because unless you actually try to use that column, it's fine.

jmarrec · 2025-10-02T16:42:59Z

src/EnergyPlus/ScheduleManager.cc

+                    for (const auto &[warning, isContinued] : csvParser.warnings()) {
+                        if (isContinued) {
+                            ShowContinueError(state, warning);
+                        } else {
+                            ShowWarningError(state, warning);
+                        }
+                    }


Schedule:File:Shading, print the new warnings if any.

jmarrec · 2025-10-02T16:48:09Z