3DT is a monitoring agent which exposes a HTTP API for querying from the /system/health/v1 DC/OS api. 3DT puller collects the data from 3dt agents and represents individual node health for things like system resources as well as DC/OS-specific services.
go get github.com/dcos/3dt
cd $GOPATH/src/github.com/dcos/3dt
make install
./3dt -version
Run 3DT once, on a DC/OS host to check systemd units:
3dt -diag
Get verbose log output:
3dt -diag -verbose
Run the 3DT aggregation service to query all cluster hosts for health state:
3dt -pull
Start the 3DT health API endpoint:
3dt
-agent-port int Use TCP port to connect to agents. (default 1050) -ca-cert string Use certificate authority. -command-exec-timeout int Set command executing timeout (default 120) -diag Get diagnostics output once on the CLI. Does not expose API. -diagnostics-bundle-dir string Set a path to store diagnostic bundles (default "/var/run/dcos/3dt/diagnostic_bundles") -diagnostics-job-timeout int Set a global diagnostics job timeout (default 720) -diagnostics-units-since string Collect systemd units logs since (default "24 hours ago") -diagnostics-url-timeout int Set a local timeout for every single GET request to a log endpoint (default 2) -endpoint-config string Use endpoints_config.json (default "/opt/mesosphere/endpoints_config.json") -exhibitor-ip string Use Exhibitor IP address to discover master nodes. (default "http://127.0.0.1:8181/exhibitor/v1/cluster/status") -force-tls Use HTTPS to do all requests. -health-update-interval int Set update health interval in seconds. (default 60) -master-port int Use TCP port to connect to masters. (default 1050) -port int Web server TCP port. (default 1050) -pull Try to pull checks from DC/OS hosts. -pull-interval int Set pull interval in seconds. (default 60) -pull-timeout int Set pull timeout. (default 3) -verbose Use verbose debug output. -version Print version.
- Test Changes