CDRIVER-3775 mongoc_structured_log #1795

ghost · 2024-11-19T02:53:10Z

This is a revival of an old pull request by @alcaeus to add structured logging to the C driver. (#684)

Structured logging is a new subsystem, independent from unstructured logging. The original PR introduced structured logging with a global scope, similar in design to unstructured logging. This version redesigns it to be per-client or per-pool. Internally, there is a structured logging "instance" owned by the mongoc_topology_t. A separate public mongoc_structured_log_opts_t object provides a configuration API, and the opts can then be passed on to the client or client pool in order to apply new log settings.

The internal API has been redesigned. The previous PR required each new type of log message to have functions and structures associated with that specific message format. In this redesign, I've split the single callback into a variable length list of callbacks, and added a set of macros to make it straightforward to build these function tables. This has some nice properties:

Submitting a log entry still doesn't require any deep copies, json serialization, or any memory allocation, only a stack-allocated table of references.
Now the mongoc_structured_log() call site explicitly names the keys that are included in the resulting document.
Now it's easy to define ad-hoc log entries or add new values to existing log entries.
The list-based approach also makes it easy to define reusable building blocks. This is immediately useful for command redaction and for server descriptions.

Here's a sample invocation:

   mongoc_structured_log (
      client->topology->structured_log,
      MONGOC_STRUCTURED_LOG_LEVEL_DEBUG,
      MONGOC_STRUCTURED_LOG_COMPONENT_COMMAND,
      "Command started",
      int32 ("requestId", client->cluster.request_id),
      server_description (server_stream->sd, SERVER_HOST, SERVER_PORT, SERVER_CONNECTION_ID, SERVICE_ID),
      utf8_n ("databaseName", cursor->ns, cursor->dblen),
      utf8 ("commandName", "getMore"),
      int64 ("operationId", cursor->operation_id),
      bson_as_json ("command", &doc));

The macros are explained by doc comments in mongoc-structured-log-private.h.

This PR includes updated unified tests from the command logging and monitoring spec, which now pass. This required several other changes to the unified test runner.

Contents:

Implement and document a structured logging facility
Structured logging items for command and reply redaction
Unified test runner support for: createEntities, observeLogMessages, waitForEvent, $$matchAsDocument, $$matchAsRoot
Add serverDescriptionChangedEvent to the unified test runner and unify its two event serialization systems
Private serialization utilities, *_append_contents_to_bson
Sync unified tests from the command-logging-and-monitoring spec
Enough command logging to pass the CLAM tests
bson-dsl support for oid() values
Private utilities for dealing with zero'ed oids
Minor drive-by cleanup

CDRIVER-3775 is the epic for structured logging in general. This PR currently covers:

Follow-up changes will address the remaining parts of CDRIVER-3775. In particular, these major items are out of scope for this PR:

Re-evaluation of JSON serialization for large documents, payloads, ellipsis, and character boundaries (includes CDRIVER-4814)
Prose tests for command logging
Cleanup of legacy OP_KILL_CURSORS code (now CDRIVER-5823)
SDAM logging, and associated refactoring
Server selection logging, and associated refactoring
CDRIVER-4560
CDRIVER-4566
CDRIVER-4758
CDRIVER-4812
CDRIVER-5717

And that's why you shouldn't trust your IDE to correctly add include statements

kevinAlbs

Great work. LGTM with minor comments addressed. I like the use of atomics to avoid repeated logs from invalid environment variable values.

src/libmongoc/examples/example-structured-log.c

src/libmongoc/src/mongoc/mongoc-client-pool.c

src/libmongoc/src/mongoc/mongoc-cmd.c

src/libmongoc/src/mongoc/mongoc-structured-log.c

Co-authored-by: Kevin Albertson <[email protected]>

The printf() is meant to stand in for a stream of some sort, but on its own doesn't motivate thread safety because stdio has its own locking. We could use a non-threadsafe file or queue, but Kevin suggested a counter and I like how simple this is. The code now includes a counter, and the comments mention that it could represent something mroe complex like a file or queue.

I had an earlier change which tried to move cmd serialization to mongoc-cmd, but I had to move it back to structured-log due to dependencies on the structured logging options. This reverts an include I missed when reverting the earlier move.

src/libmongoc/src/mongoc/mongoc-structured-log.c

The immediate provocation here is to work around what appears to be a clang static analyzer bug, but it's a nice cleanup anyhow. The noreturn in timegm was unused, but I found it when adding the new BSON_NORETURN needed for the assert implementation.

ghost · 2024-12-18T19:07:15Z

Re the scan-build-macos-14-arm64-clang failure that remains on evergreen, it seems to me like some of the build hosts may have an inconsistent environment? The failing runs here are unable to locate a scan-build binary. On the master branch runs that succeed, the script locates a locally installed scan-build from homebrew.

kevinAlbs · 2024-12-18T20:03:36Z

build hosts may have an inconsistent environment?

Seems unexpected. Asked on DEVPROD-12817.

jmikola

Updating copyright dates on headers is probably the only necessary change (and maybe the bson_t formatting in docs if you care about that).

I'll leave a provisional LGTM to avoid holding this up further.

src/libmongoc/doc/mongoc_structured_log_entry_message_as_bson.rst

src/libmongoc/doc/mongoc_structured_log_opts_new.rst

src/libmongoc/src/mongoc/mongoc-structured-log.c

src/libmongoc/src/mongoc/mongoc-cluster.c

#1826) This PR is a follow-up for #1795, #1821, and #1816. The new internal string and json APIs are used to implement missing functionality and tests for the structured logging subsystem. * Adds public APIs for the max_document_length within a mongoc_structured_log_opts_t. * Addresses the remainder of CDRIVER-4485: JSON documents within structured log messages are truncated correctly according to the rules set out in the Logging specification. * Addresses the remainder of CDRIVER-4486 by implementing prose tests. * Addresses CDRIVER-4814. Command or payload data beyond the truncation limit no longer degrades logging performance. (New serializer for command-with-payload) * Unified tests use the suggested max document length of 10000 from the spec * Replace atomic counter with atomic flag for environment error guards * Tests for environment defaults now skip if relevant variables are set externally

alcaeus added 30 commits November 18, 2024 15:38

WIP: first stab at structured logging

593089e

Make context generation lazy to save on resources

dfee17a

Remove syslog and log to stderr

826707f

Don't clear context_data

6896714

Update signatures

1518a7a

Explicitly track whether context was already built

a945695

Fix coding style

370b757

Extract command logging and add structs

79e52cb

Refactor signatures and log all command results

de7b41e

Keep default logger private

411502a

Rename context to message

c5b85e6

Add tests for structured logging

06f50fe

Run clang-format

de6370a

Update copyright years

405c4f6

Add rudimentary documentation for new public API

cb88c6d

Fix wrong log message for command failed log entry

e23f5ae

Add spec compliant default logger

b463f82

Run clang-format

2e49e0d

Log client creation

78f1970

Fix suggestions from code review

4b580f0

Remove unused variable

5aa50e0

Remove union in favour of void *

810d3fd

Use lowercase test names

a8be342

Free allocated memory in tests

2e73cb4

Fix wrong location for imports

c264c7e

And that's why you shouldn't trust your IDE to correctly add include statements

Attach OP_MSG document sequences when logging commands

eae3b51

Remove mixed declaration and code

42d50c0

Fix single-line comments

794d51b

Fix more single line comments

bfc69a1

Update logged fields to latest spec version

4e846ef

kevinAlbs approved these changes Dec 16, 2024

View reviewed changes

micah and others added 9 commits December 16, 2024 08:15

Update src/libmongoc/src/mongoc/mongoc-structured-log.c

7f0f43c

Co-authored-by: Kevin Albertson <[email protected]>

Update src/libmongoc/src/mongoc/mongoc-structured-log.c

42a247b

Co-authored-by: Kevin Albertson <[email protected]>

Merge branch 'master' into CDRIVER-3775

c371ee6

Return error on *_set_structured_log_opts misuse

7cbfbae

asserts for mongoc_structured_log_opts_set_max_level_for_all_components

e334612

asserts for mongoc_structured_log_opts_set_max_level_for_component

014bc8d

Include errorLabels in logged client-side command failures

20f9890

kevinAlbs reviewed Dec 17, 2024

View reviewed changes

src/libmongoc/src/mongoc/mongoc-structured-log.c Show resolved Hide resolved

Micah Scott added 5 commits December 17, 2024 08:17

Merge branch 'master' into CDRIVER-3775

c80b4df

Fix out-of-date comment about test_is_suppressing_structured_logs

9338b95

make ignored error in mongoc_structured_log_opts_new very clear

a75f151

More case insensitive: docs, unlimited length

5f13444

Merge branch 'master' into CDRIVER-3775

4633353

jmikola approved these changes Jan 8, 2025

View reviewed changes

ghost removed the request for review from vector-of-bool January 8, 2025 19:44

Micah Scott added 5 commits January 8, 2025 11:55

missing bson_t hyperlink in doc

932b73b

Warning about multiple instances writing to a shared log file

4d910a7

typo in comment

2d853e7

Typo in test runner error message

f40496c

Merge branch 'master' into CDRIVER-3775

eb793c0

ghost merged commit 6e5c6be into mongodb:master Jan 9, 2025
45 checks passed

ghost deleted the CDRIVER-3775 branch January 9, 2025 04:09

ghost mentioned this pull request Jan 15, 2025

CDRIVER-4485 string append and truncation fixes for structured logging #1826

Merged

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CDRIVER-3775 mongoc_structured_log #1795

CDRIVER-3775 mongoc_structured_log #1795

Uh oh!

ghost commented Nov 19, 2024 •

edited by ghost

Loading

Uh oh!

kevinAlbs left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ghost commented Dec 18, 2024

Uh oh!

kevinAlbs commented Dec 18, 2024

Uh oh!

jmikola left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CDRIVER-3775 mongoc_structured_log #1795

CDRIVER-3775 mongoc_structured_log #1795

Uh oh!

Conversation

ghost commented Nov 19, 2024 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kevinAlbs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ghost commented Dec 18, 2024

Uh oh!

kevinAlbs commented Dec 18, 2024

Uh oh!

jmikola left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ghost commented Nov 19, 2024 •

edited by ghost

Loading

jmikola left a comment •

edited

Loading