Add metric ibm_mq.channel.conns to ibm mq integration, add channel and connection metric tests #20519

mwdd146980 · 2025-06-15T03:44:05Z

What does this PR do?

This PR enhances the IBM MQ integration by adding two new metrics to provide better visibility into channel connections: ibm_mq.channel.conn_status (tracks individual connection status with connection name tags) and ibm_mq.channel.connections_active (counts total active connections per channel). These metrics enable customers to monitor individual channel connections, track connection counts per channel, and improve troubleshooting capabilities for IBM MQ connectivity issues.

Configuration Control: To address potential tag cardinality concerns, this PR introduces a new configuration option collect_connection_metrics (default: false) that allows users to control the collection of the ibm_mq.channel.conn_status metric. When enabled, this metric creates a new connection tag for each unique connection, which can lead to high cardinality in environments with many active connections. The option is disabled by default to prevent unintended tag cardinality issues.

Additionally, this PR significantly enhances the test coverage for channel and connection metrics by:

Adding comprehensive unit tests for channel connection handling in test_channel_metric_collector.py
Testing various connection scenarios including:
- Channels with active connections
- Channels without connections
- Channels with empty connection strings
  Proper tagging of connection metrics
Ensuring proper metric collection and tagging for the new connection metric
Verifying that connection metrics are properly aggregated and reported

Motivation

The motivation behind this PR is to enhance the monitoring capabilities of the IBM MQ integration by providing visibility into active connections per channel. This feature allows users to track connection changes over time and identify which connections are active, which is crucial for maintaining the health and performance of the messaging system.

This was requested in escalation AGENT-13489/FRAGENT-3166 by customer Broadridge (GTO) (org ID: 345886).

Manual QA Steps

Spin up an EC2 VM with AMI Ubu-ddev-docker (required for the correct architecture)
Install IBM MQ server and client libraries
Run pytest tests/test_ibm_mq_unit.py -v for unit tests
Run ddev --no-interactive test ibm_mq for unit tests with ddev containers
Spin up containers with ddev env start ibm_mq py3.12-9-cluster --dev
Run the manual check with with ddev env agent ibm_mq py3.12-9-cluster check and check for the new metrics in the output
Simulate connections with this Python script
- Connections can be simulated with a command like python simulate_mq_conn.py QM1 DEV.ADMIN.SVRCONN localhost 11414 APP.QUEUE.1 "Conn 1"
Check in Datadog for the metrics

Here's what it looked like in my own testing:

Review checklist (to be filled by reviewers)

Feature or bugfix MUST have appropriate tests (unit, integration, e2e)
Add the qa/skip-qa label if the PR doesn't need to be tested during QA.
[not applicable] If you need to backport this PR to another branch, you can add the backport/<branch-name> label to the PR and it will automatically open a backport PR once this one is merged

codecov · 2025-06-17T02:19:58Z

Codecov Report

Attention: Patch coverage is 97.46835% with 2 lines in your changes missing coverage. Please review.

Project coverage is 91.82%. Comparing base (0df4650) to head (49445b4).

Additional details and impacted files

Flag	Coverage Δ
active_directory	`?`
activemq	`?`
activemq_xml	`?`
aerospike	`?`
airflow	`?`
amazon_msk	`?`
ambari	`?`
apache	`?`
appgate_sdp	`?`
arangodb	`?`
argo_rollouts	`?`
argo_workflows	`?`
argocd	`?`
aspdotnet	`?`
avi_vantage	`?`
aws_neuron	`?`
azure_iot_edge	`?`
boundary	`?`
btrfs	`?`
cacti	`?`
calico	`?`
cassandra	`?`
cassandra_nodetool	`?`
celery	`?`
ceph	`?`
cert_manager	`?`
cilium	`?`
cisco_aci	`?`
citrix_hypervisor	`?`
clickhouse	`?`
cloud_foundry_api	`?`
cloudera	`?`
cockroachdb	`?`
confluent_platform	`?`
consul	`?`
coredns	`?`
couch	`?`
couchbase	`?`
crio	`?`
datadog_checks_base	`?`
datadog_checks_dev	`?`
datadog_checks_downloader	`?`
datadog_cluster_agent	`?`
dcgm	`?`
ddev	`?`
directory	`?`
disk	`?`
dns_check	`?`
dotnetclr	`?`
druid	`?`
duckdb	`?`
ecs_fargate	`?`
eks_fargate	`?`
elastic	`?`
envoy	`?`
esxi	`?`
etcd	`?`
exchange_server	`?`
external_dns	`?`
falco	`?`
fluentd	`?`
fluxcd	`?`
fly_io	`?`
foundationdb	`?`
gearmand	`?`
gitlab	`?`
gitlab_runner	`?`
glusterfs	`?`
go_expvar	`?`
gunicorn	`?`
haproxy	`?`
harbor	`?`
hazelcast	`?`
hdfs_datanode	`?`
hdfs_namenode	`?`
hive	`?`
hivemq	`?`
http_check	`?`
hudi	`?`
ibm_ace	`?`
ibm_db2	`?`
ibm_i	`?`
ibm_mq	`91.76% <97.46%> (+0.42%)`	⬆️
ibm_was	`?`
ignite	`?`
iis	`?`
impala	`?`
infiniband	`?`
istio	`?`
jboss_wildfly	`?`
kafka	`?`
kafka_consumer	`?`
karpenter	`?`
keda	`?`
kong	`?`
kube_apiserver_metrics	`?`
kube_controller_manager	`?`
kube_dns	`?`
kube_metrics_server	`?`
kube_proxy	`?`
kube_scheduler	`?`
kubeflow	`?`
kubelet	`?`
kubernetes_cluster_autoscaler	`?`
kubernetes_state	`?`
kubevirt_api	`?`
kubevirt_controller	`?`
kubevirt_handler	`?`
kuma	`?`
kyototycoon	`?`
kyverno	`?`
lighttpd	`?`
linkerd	`?`
linux_proc_extras	`?`
litellm	`?`
mac_audit_logs	`?`
mapr	`?`
mapreduce	`?`
marathon	`?`
marklogic	`?`
mcache	`?`
mesos_master	`?`
milvus	`?`
mongo	`?`
mysql	`?`
nagios	`?`
network	`?`
nfsstat	`?`
nginx	`?`
nginx_ingress_controller	`?`
nvidia_nim	`?`
nvidia_triton	`?`
octopus_deploy	`?`
openldap	`?`
openmetrics	`?`
openstack	`?`
openstack_controller	`?`
pdh_check	`?`
pgbouncer	`?`
php_fpm	`?`
postfix	`?`
postgres	`?`
powerdns_recursor	`?`
presto	`?`
process	`?`
prometheus	`?`
proxysql	`?`
pulsar	`?`
quarkus	`?`
rabbitmq	`?`
ray	`?`
redisdb	`?`
rethinkdb	`?`
riak	`?`
riakcs	`?`
sap_hana	`?`
scylla	`?`
silk	`?`
silverstripe_cms	`?`
singlestore	`?`
slurm	`?`
snmp	`?`
snowflake	`?`
solr	`?`
sonarqube	`?`
sonatype_nexus	`?`
spark	`?`
sqlserver	`?`
squid	`?`
ssh_check	`?`
statsd	`?`
strimzi	`?`
supabase	`?`
supervisord	`?`
system_core	`?`
system_swap	`?`
tcp_check	`?`
teamcity	`?`
tekton	`?`
teleport	`?`
temporal	`?`
teradata	`?`
tibco_ems	`?`
tls	`?`
tomcat	`?`
torchserve	`?`
traefik_mesh	`?`
traffic_server	`?`
twemproxy	`?`
twistlock	`?`
varnish	`?`
vault	`?`
velero	`?`
vertica	`?`
vllm	`?`
voltdb	`?`
vsphere	`?`
weaviate	`?`
weblogic	`?`
win32_event_log	`?`
windows_performance_counters	`?`
windows_service	`?`
wmi_check	`?`
yarn	`?`
zk	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

- Update channel metric collector to use channel_status_metrics() instead of channel_metrics() for discovered channels to properly collect buffers_rcvd metric - Update test assertions to match actual tags being sent in gauge calls - Fix unit tests in test_channel_metric_collector.py to pass This change ensures that channel status metrics like buffers_rcvd are properly collected and reported by the integration.

- Updated get_pcf_channel_metrics to submit configuration metrics instead of status metrics. - Modified unit tests to verify that configuration metrics are collected for channels with empty or no connections. - Added a new test test_channel_status_metrics to ensure status metrics and connection metrics are correctly submitted.

ibm_mq/datadog_checks/ibm_mq/collectors/channel_metric_collector.py

ibm_mq/metadata.csv

- create new metric ibm_mq.channel.connections_active which represents total num of active conns per channel

Review from buraizu is dismissed. Related teams and files:

documentation
- ibm_mq/metadata.csv

…annel.conn_status is only collected if this flag is enabled

Review from buraizu is dismissed. Related teams and files:

documentation
- ibm_mq/assets/configuration/spec.yaml
- ibm_mq/datadog_checks/ibm_mq/data/conf.yaml.example

buraizu

Approving with a minor update requested

ibm_mq/assets/configuration/spec.yaml

ibm_mq/datadog_checks/ibm_mq/data/conf.yaml.example

Co-authored-by: Bryce Eadie <[email protected]>

Review from buraizu is dismissed. Related teams and files:

documentation
- ibm_mq/assets/configuration/spec.yaml

Co-authored-by: Bryce Eadie <[email protected]>

…3166-channel-conns-metric

…ent_scripts/ --fix

add metric ibm_mq.channel.conns, add channel and connection metric tests

a8384b7

mwdd146980 self-assigned this Jun 15, 2025

temporal-github-worker-1 bot added agent/review-requested ecosystems/review-requested product/review-requested labels Jun 15, 2025

datadog-agent-integrations-bot bot added the integration/ibm_mq label Jun 15, 2025

mwdd146980 added 2 commits June 15, 2025 22:25

Add changelog entry

606305c

run black formatting

282e594

mwdd146980 force-pushed the mwdd146980/fragent-3166-channel-conns-metric branch from 8fe0745 to 290b3e5 Compare June 16, 2025 03:35

apply ruff check --config ../pyproject.toml --fix

90c5031

mwdd146980 added 5 commits June 16, 2025 23:19

reformat test_channel_metric_collector.py using black

8422084

add ibm_mq.channel.conns to metadata.csv

9ee02b0

sort metadata.csv by metric name

c60be2f

mwdd146980 marked this pull request as ready for review June 17, 2025 22:05

mwdd146980 requested review from a team as code owners June 17, 2025 22:05

datadog-agent-integrations-bot bot added team/agent-integrations team/documentation labels Jun 17, 2025

buraizu previously approved these changes Jun 17, 2025

View reviewed changes

temporal-github-worker-1 bot added the docs/approved label Jun 17, 2025

steveny91 reviewed Jun 18, 2025

View reviewed changes

ibm_mq/datadog_checks/ibm_mq/collectors/channel_metric_collector.py Show resolved Hide resolved

steveny91 reviewed Jun 18, 2025

View reviewed changes

ibm_mq/metadata.csv Outdated Show resolved Hide resolved

- rename ibm_mq.channel.conns to ibm_mq.channel.conn_status

4ddf802

- create new metric ibm_mq.channel.connections_active which represents total num of active conns per channel

temporal-github-worker-1 bot added docs/review-requested and removed docs/approved labels Jun 26, 2025

Test commit signing

84a59b2

mwdd146980 force-pushed the mwdd146980/fragent-3166-channel-conns-metric branch from 5add087 to 3a5f791 Compare June 27, 2025 03:15

mwdd146980 added 3 commits June 27, 2025 03:28

sorted metadata.csv with ddev validate metadata ibm_mq --sync

d08d25e

fix and sort metadata.csv; many metrics had been removed accidentally

ba4c01d

shorten changelog entry

9bb8165

mwdd146980 requested review from buraizu and steveny91 June 30, 2025 17:29

buraizu previously approved these changes Jun 30, 2025

View reviewed changes

temporal-github-worker-1 bot added docs/approved and removed docs/review-requested labels Jun 30, 2025

add config option collect_connection_metrics so that metric ibm_mq.ch…

b962cb0

…annel.conn_status is only collected if this flag is enabled

temporal-github-worker-1 bot added docs/review-requested and removed docs/approved labels Jun 30, 2025

datadog-agent-integrations-bot bot added the documentation label Jun 30, 2025

mwdd146980 requested a review from buraizu June 30, 2025 22:13

buraizu previously approved these changes Jul 1, 2025

View reviewed changes

ibm_mq/assets/configuration/spec.yaml Outdated Show resolved Hide resolved

ibm_mq/datadog_checks/ibm_mq/data/conf.yaml.example Outdated Show resolved Hide resolved

temporal-github-worker-1 bot added docs/approved and removed docs/review-requested labels Jul 1, 2025

Update ibm_mq/assets/configuration/spec.yaml

e3a1e2e

Co-authored-by: Bryce Eadie <[email protected]>

temporal-github-worker-1 bot added docs/review-requested and removed docs/approved labels Jul 1, 2025

mwdd146980 and others added 8 commits July 1, 2025 17:41

Update ibm_mq/datadog_checks/ibm_mq/data/conf.yaml.example

36f05ad

Co-authored-by: Bryce Eadie <[email protected]>

fix formatting with ddev test --fmt

7e64301

Merge remote-tracking branch 'origin/master' into mwdd146980/fragent-…

932d46c

…3166-channel-conns-metric

format with ddev test --fmt

aa543e3

reformat with ddev test --fmt after upgrading ddev

4495404

move simulate_mq_conn.py, add description, add licensing comment

56e0875

fix license header with ddev validate license-headers ibm_mq/tests/ag…

c02025e

…ent_scripts/ --fix

Merge branch 'master' into mwdd146980/fragent-3166-channel-conns-metric

49445b4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add metric ibm_mq.channel.conns to ibm mq integration, add channel and connection metric tests #20519

Add metric ibm_mq.channel.conns to ibm mq integration, add channel and connection metric tests #20519

Uh oh!

mwdd146980 commented Jun 15, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jun 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

buraizu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add metric ibm_mq.channel.conns to ibm mq integration, add channel and connection metric tests #20519

Are you sure you want to change the base?

Add metric ibm_mq.channel.conns to ibm mq integration, add channel and connection metric tests #20519

Uh oh!

Conversation

mwdd146980 commented Jun 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Motivation

Manual QA Steps

Review checklist (to be filled by reviewers)

Uh oh!

codecov bot commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

buraizu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mwdd146980 commented Jun 15, 2025 •

edited

Loading

codecov bot commented Jun 17, 2025 •

edited

Loading