ResourceManager Metrics

In addition to these base metrics, many aggregate metrics are available. If an entity type has parents defined, you can formulate all possible aggregate metrics using the formula base_metric_across_parents.

In addition, metrics for aggregate totals can be formed by adding the prefix total_ to the front of the metric name.

Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".

For example, the following metric names may be valid for ResourceManager:

  • alerts_rate_across_clusters
  • total_alerts_rate_across_clusters

Some metrics, such as alerts_rate, apply to nearly every metric context. Others only apply to a certain service or role.

For more information about metrics, see Cloudera Manager Metrics and Metric Aggregation.

Metric Name Description Unit Parents CDH Version
alerts_rate The number of alerts. events per second cluster, rack, yarn CDH 4, CDH 5
allocate_avg_time Allocate RPC call average time. ms cluster, rack, yarn CDH 4, CDH 5
allocate_rate The total number of allocate RPC calls. operations per second cluster, rack, yarn CDH 4, CDH 5
cgroup_cpu_system_rate CPU usage of the role's cgroup seconds per second cluster, rack, yarn CDH 4, CDH 5
cgroup_cpu_user_rate User Space CPU usage of the role's cgroup seconds per second cluster, rack, yarn CDH 4, CDH 5
cgroup_mem_page_cache Page cache usage of the role's cgroup bytes cluster, rack, yarn CDH 4, CDH 5
cgroup_mem_rss Resident memory of the role's cgroup bytes cluster, rack, yarn CDH 4, CDH 5
cgroup_mem_swap Swap usage of the role's cgroup bytes cluster, rack, yarn CDH 4, CDH 5
cgroup_read_bytes_rate Bytes read from all disks by the role's cgroup bytes per second cluster, rack, yarn CDH 4, CDH 5
cgroup_read_ios_rate Number of read I/O operations from all disks by the role's cgroup ios per second cluster, rack, yarn CDH 4, CDH 5
cgroup_write_bytes_rate Bytes written to all disks by the role's cgroup bytes per second cluster, rack, yarn CDH 4, CDH 5
cgroup_write_ios_rate Number of write I/O operations to all disks by the role's cgroup ios per second cluster, rack, yarn CDH 4, CDH 5
cpu_system_rate Total System CPU seconds per second cluster, rack, yarn CDH 4, CDH 5
cpu_user_rate Total CPU user time seconds per second cluster, rack, yarn CDH 4, CDH 5
events_critical_rate The number of critical events. events per second cluster, rack, yarn CDH 4, CDH 5
events_important_rate The number of important events. events per second cluster, rack, yarn CDH 4, CDH 5
events_informational_rate The number of informational events. events per second cluster, rack, yarn CDH 4, CDH 5
fd_max Maximum number of file descriptors file descriptors cluster, rack, yarn CDH 4, CDH 5
fd_open Open file descriptors. file descriptors cluster, rack, yarn CDH 4, CDH 5
finish_application_master_avg_time Finish ApplicationMaster average time. ms cluster, rack, yarn CDH 4, CDH 5
finish_application_master_rate Finish ApplicationMaster operations. operations per second cluster, rack, yarn CDH 4, CDH 5
force_kill_application_avg_time Force Kill Application Average Time ms cluster, rack, yarn CDH 4, CDH 5
force_kill_application_rate Force Kill Application Operations operations per second cluster, rack, yarn CDH 4, CDH 5
gc_count_concurrent_mark_sweep_rate The number of garbage collections by the Concurrent Mark Sweep Collector. garbage collections per second cluster, rack, yarn CDH 4, CDH 5
gc_count_par_new_rate The number of garbage collections by the Parallel Collector. garbage collections per second cluster, rack, yarn CDH 4, CDH 5
gc_time_ms_concurrent_mark_sweep_rate The total time spent in garbage collections by the Concurrent Mark Sweep Collector. ms per second cluster, rack, yarn CDH 4, CDH 5
gc_time_ms_par_new_rate The total time spent in garbage collections by the Parallel Collector. ms per second cluster, rack, yarn CDH 4, CDH 5
get_all_applications_avg_time Get All Applications Average Time ms cluster, rack, yarn CDH 4
get_all_applications_rate Get All Applications Operations operations per second cluster, rack, yarn CDH 4
get_application_report_avg_time Get application report average time. ms cluster, rack, yarn CDH 4, CDH 5
get_application_report_rate Get application report operations. operations per second cluster, rack, yarn CDH 4, CDH 5
get_cluster_metrics_avg_time Get Cluster Metrics Average Time ms cluster, rack, yarn CDH 4, CDH 5
get_cluster_metrics_rate Get Cluster Metrics Operations operations per second cluster, rack, yarn CDH 4, CDH 5
get_cluster_nodes_avg_time Get cluster nodes average time. ms cluster, rack, yarn CDH 4, CDH 5
get_cluster_nodes_rate Get cluster nodes operations operations per second cluster, rack, yarn CDH 4, CDH 5
get_delegation_token_avg_time Get Delegation Token Average Time ms cluster, rack, yarn CDH 4, CDH 5
get_delegation_token_rate Get Delegation Token Operations operations per second cluster, rack, yarn CDH 4, CDH 5
get_groups_for_user_avg_time Get Groups For User Average Time ms cluster, rack, yarn CDH 4, CDH 5
get_groups_for_user_rate Get Groups For User Operations operations per second cluster, rack, yarn CDH 4, CDH 5
get_hadoop_groups_avg_time Average Time to get Hadoop group for the user ms cluster, rack, yarn CDH 4, CDH 5
get_hadoop_groups_rate Get Hadoop User Operations operations per second cluster, rack, yarn CDH 4, CDH 5
get_new_application_avg_time Get new application average time. ms cluster, rack, yarn CDH 4, CDH 5
get_new_application_rate Get new application operations. operations per second cluster, rack, yarn CDH 4, CDH 5
get_queue_user_acls_avg_time Get Queue User ACLs Average Time ms cluster, rack, yarn CDH 4, CDH 5
get_queue_user_acls_rate Get Queue User ACLs Operations operations per second cluster, rack, yarn CDH 4, CDH 5
health_bad_rate Percentage of Time with Bad Health seconds per second cluster, rack, yarn CDH 4, CDH 5
health_concerning_rate Percentage of Time with Concerning Health seconds per second cluster, rack, yarn CDH 4, CDH 5
health_disabled_rate Percentage of Time with Disabled Health seconds per second cluster, rack, yarn CDH 4, CDH 5
health_good_rate Percentage of Time with Good Health seconds per second cluster, rack, yarn CDH 4, CDH 5
health_unknown_rate Percentage of Time with Unknown Health seconds per second cluster, rack, yarn CDH 4, CDH 5
jvm_blocked_threads Blocked threads threads cluster, rack, yarn CDH 4, CDH 5
jvm_gc_rate Number of garbage collections garbage collections per second cluster, rack, yarn CDH 4, CDH 5
jvm_gc_time_ms_rate Total time spent garbage collecting. ms per second cluster, rack, yarn CDH 4, CDH 5
jvm_heap_committed_mb Total amount of committed heap memory. MB cluster, rack, yarn CDH 4, CDH 5
jvm_heap_used_mb Total amount of used heap memory. MB cluster, rack, yarn CDH 4, CDH 5
jvm_max_memory_mb Maximum allowed memory. MB cluster, rack, yarn CDH 4, CDH 5
jvm_new_threads New threads threads cluster, rack, yarn CDH 4, CDH 5
jvm_non_heap_committed_mb Total amount of committed non-heap memory. MB cluster, rack, yarn CDH 4, CDH 5
jvm_non_heap_used_mb Total amount of used non-heap memory. MB cluster, rack, yarn CDH 4, CDH 5
jvm_runnable_threads Runnable threads threads cluster, rack, yarn CDH 4, CDH 5
jvm_terminated_threads Terminated threads threads cluster, rack, yarn CDH 4, CDH 5
jvm_timed_waiting_threads Timed waiting threads threads cluster, rack, yarn CDH 4, CDH 5
jvm_waiting_threads Waiting threads threads cluster, rack, yarn CDH 4, CDH 5
log_error_rate Logged Errors messages per second cluster, rack, yarn CDH 4, CDH 5
log_fatal_rate Logged Fatals messages per second cluster, rack, yarn CDH 4, CDH 5
log_info_rate Logged Infos messages per second cluster, rack, yarn CDH 4, CDH 5
log_warn_rate Logged Warnings messages per second cluster, rack, yarn CDH 4, CDH 5
login_failure_avg_time Average Failed Login Time ms cluster, rack, yarn CDH 4, CDH 5
login_failure_rate Login Failures operations per second cluster, rack, yarn CDH 4, CDH 5
login_success_avg_time Average Successful Login Time ms cluster, rack, yarn CDH 4, CDH 5
login_success_rate Login Successes operations per second cluster, rack, yarn CDH 4, CDH 5
mem_rss Resident memory used bytes cluster, rack, yarn CDH 4, CDH 5
mem_swap Amount of swap memory used by this role's process. bytes cluster, rack, yarn CDH 4, CDH 5
mem_virtual Virtual memory used bytes cluster, rack, yarn CDH 4, CDH 5
metrics_dropped_pub_all Dropped Metrics Updates By All Sinks updates cluster, rack, yarn CDH 4, CDH 5
metrics_num_active_sinks Active Metrics Sinks Count sinks cluster, rack, yarn CDH 4, CDH 5
metrics_num_active_sources Active Metrics Sources Count sources cluster, rack, yarn CDH 4, CDH 5
metrics_num_all_sinks All Metrics Sinks Count sinks cluster, rack, yarn CDH 4, CDH 5
metrics_num_all_sources All Metrics Sources Count sources cluster, rack, yarn CDH 4, CDH 5
metrics_publish_avg_time Metrics Publish Average Time ms cluster, rack, yarn CDH 4, CDH 5
metrics_publish_rate Metrics Publish Operations operations per second cluster, rack, yarn CDH 4, CDH 5
metrics_snapshot_avg_time Metrics Snapshot Average Time ms cluster, rack, yarn CDH 4, CDH 5
metrics_snapshot_rate Metrics Snapshot Average Operations operations per second cluster, rack, yarn CDH 4, CDH 5
node_heartbeat_avg_time NodeManager Heartbeat Average Time ms cluster, rack, yarn CDH 4, CDH 5
node_heartbeat_rate NodeManager Heartbeat Operations operations per second cluster, rack, yarn CDH 4, CDH 5
oom_exits_rate The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled. exits per second cluster, rack, yarn CDH 4, CDH 5
read_bytes_rate The number of bytes read from the device bytes per second cluster, rack, yarn CDH 4, CDH 5
refresh_admin_acls_avg_time Refresh Admin ACLs Average Time ms cluster, rack, yarn CDH 4, CDH 5
refresh_admin_acls_rate Refresh Admin ACLs Operations operations per second cluster, rack, yarn CDH 4, CDH 5
refresh_nodes_avg_time Refresh Nodes Average Time ms cluster, rack, yarn CDH 4, CDH 5
refresh_nodes_rate Refresh Nodes Operations operations per second cluster, rack, yarn CDH 4, CDH 5
refresh_service_acls_avg_time Refresh Service ACLs Average Time ms cluster, rack, yarn CDH 4, CDH 5
refresh_service_acls_rate Refresh Service ACLs Operations operations per second cluster, rack, yarn CDH 4, CDH 5
refresh_super_user_groups_configuration_avg_time Super User Groups Configuration Average Time ms cluster, rack, yarn CDH 4, CDH 5
refresh_super_user_groups_configuration_rate Super User Groups Configuration Operations operations per second cluster, rack, yarn CDH 4, CDH 5
refresh_user_to_groups_mappings_avg_time Refresh User to Groups Mapping Average Time ms cluster, rack, yarn CDH 4, CDH 5
refresh_user_to_groups_mappings_rate Refresh User to Groups Mapping Operations operations per second cluster, rack, yarn CDH 4, CDH 5
register_application_master_avg_time Register ApplicationMaster average time. ms cluster, rack, yarn CDH 4, CDH 5
register_application_master_rate Register ApplicationMaster operations. operations per second cluster, rack, yarn CDH 4, CDH 5
register_node_manager_avg_time NodeManager Registration Average Time ms cluster, rack, yarn CDH 4, CDH 5
register_node_manager_rate NodeManager Registration Operations operations per second cluster, rack, yarn CDH 4, CDH 5
rpc_authentication_failures_rate RPC Authentication Failures operations per second cluster, rack, yarn CDH 4, CDH 5
rpc_authentication_successes_rate RPC Authentication Successes operations per second cluster, rack, yarn CDH 4, CDH 5
rpc_authorization_failures_rate RPC Authorization Failures operations per second cluster, rack, yarn CDH 4, CDH 5
rpc_authorization_successes_rate RPC Authorization Successes operations per second cluster, rack, yarn CDH 4, CDH 5
rpc_call_queue_length RPC Call Queue Length items cluster, rack, yarn CDH 4, CDH 5
rpc_num_open_connections Open RPC Connections connections cluster, rack, yarn CDH 4, CDH 5
rpc_processing_time_avg_time Average RPC Processing Time ms cluster, rack, yarn CDH 4, CDH 5
rpc_processing_time_rate RPCs Processed operations per second cluster, rack, yarn CDH 4, CDH 5
rpc_queue_time_avg_time Average RPC Queue Time ms cluster, rack, yarn CDH 4, CDH 5
rpc_queue_time_rate RPCs Queued operations per second cluster, rack, yarn CDH 4, CDH 5
rpc_received_bytes_rate RPC Received Bytes bytes per second cluster, rack, yarn CDH 4, CDH 5
rpc_sent_bytes_rate RPC Sent Bytes bytes per second cluster, rack, yarn CDH 4, CDH 5
submit_application_avg_time Submit application average time. ms cluster, rack, yarn CDH 4, CDH 5
submit_application_rate Submit application operations. operations per second cluster, rack, yarn CDH 4, CDH 5
unexpected_exits_rate The number of times the role's backing process exited unexpectedly. exits per second cluster, rack, yarn CDH 4, CDH 5
uptime For a host, the amount of time since the host was booted. For a role, the uptime of the backing process. seconds cluster, rack, yarn CDH 4, CDH 5
write_bytes_rate The number of bytes written to the device bytes per second cluster, rack, yarn CDH 4, CDH 5
cancel_delegation_token_avg_time Cancel Delegation Token Average Time ms cluster, rack, yarn CDH 5
cancel_delegation_token_rate Cancel Delegation Token Operations operations per second cluster, rack, yarn CDH 5
get_applications_avg_time Get Applications Average Time ms cluster, rack, yarn CDH 5
get_applications_rate Get Applications Operations operations per second cluster, rack, yarn CDH 5
renew_delegation_token_avg_time Renew Delegation Token Average Time ms cluster, rack, yarn CDH 5
renew_delegation_token_rate Renew Delegation Token Operations operations per second cluster, rack, yarn CDH 5