server/ System Metrics
Last Updated: October 20, 2018

Monitoring real-time resource usage across CPU, Memory, Disk, and Network.

System Metrics

Maintaining a healthy TigerGraph cluster requires consistent monitoring of resource usage. The gadmin metric command provides a unified interface for this data.

1. Monitoring CPU Usage

You can report CPU usage by specific service or across the entire cluster.

bashterminal
# Report CPU for all services on all nodes gadmin metric -t cpu # Report CPU specifically for the GPE service on node m1 gadmin metric gpe -t cpu -m m1

2. Memory Usage & Reporting

TigerGraph measures memory differently depending on the environment:

EnvironmentMetric UsedBehavior
Host (Bare Metal)Resident MemoryExcludes active page cache.
Container (K8s)Working SetIncludes active page cache (often looks higher).

Tracking Memory by Query

For deep performance tuning, you can track how much RAM a specific query consumes by searching the GPE logs:

bashterminal
grep -i "QueryMem" $(gadmin config get System.LogRoot)/gpe/log.INFO

3. Disk & Network I/O

Disk Usage

Check if your storage is approaching capacity:

bashterminal
gadmin metric -t disk

Network Traffic

Monitor TCP connections and data inflow/outflow:

bashterminal
gadmin metric -t net

The output includes:

  • TCP CONNS: Number of active connections.
  • IN/OUT: Total network traffic since the service started.

4. Critical Thresholds

By default, TigerGraph will abort all queries if system free memory drops below 10% (SysMinFreePct). This protects the system from crashing due to Out-Of-Memory (OOM) errors.

[!TIP] Use gadmin status -v to see real-time memory pressure and process IDs for every service.