blog

ClusterControl runtime configuration options

Sebastian Insausti

Published: July 20, 2021
Last Updated: May 4, 2022

When you install ClusterControl, it has a default configuration that maybe doesn’t fit your requirements so probably you will need to customize this installation. For this, you can modify the configuration files, but you can also check or modify the runtime ClusterControl settings. In this blog, we will show you where you can see this configuration and what are the available options to use here.

Where can you see the ClusterControl Runtime Configuration?

There are two different ways to check this. First, you can go to ClusterControl -> Global Settings -> Runtime Configurations, then choose your Cluster.

Another way is ClusterControl -> Select Cluster -> Settings -> Runtime Configurations.

In both cases, you will go to the same place, the Runtime Configuration section.

Runtime Configuration Parameters

Now, let’s see these parameters one by one. Keep in mind that these parameters depend on the database technology that you are using, so most probably you won’t see all of them at the same time in the same cluster.

Backup

Name	Default Value	Description
disable_backup_email	false	This setting controls if emails are sent or not if a backup is finished or failed.
backup_user	backupuser	The username of the database account used for managing backups.
backup_create_hash	true	Configures ClusterControl if it has to calculate md5hash on the created backup files and verify them.
pitr_retention_hours	0	Retention hours (to erase old WAL archive logs) for PITR.
netcat_port	9999,9990-9998	List of Netcat ports and port ranges used to stream backups. Defaults to ‘9999,9990-9998’ and port 9999 will be preferred if available.
backupdir	/home/user/backups	The default backup directory, to be pre-filled in Frontend.
backup_subdir	BACKUP-%I	Set the name of the backup subdirectory. This string may hold standard “%X” field separators, the “%06I” for example will be replaced by the numerical ID of the backup in 6 field-wide format that uses ‘0’ as leading fill characters. Here is the list of fields the backend currently supports: – B The date and time when the backup creation was beginning. – H The name of the backup host, the host that created the backup. – i The numerical ID of the cluster. – I The numerical ID of the backup. – J The numerical ID of the job that created the backup. – M The backup method (e.g. “mysqldump”). – O The name of the user who initiated the backup job. – S The name of the storage host, the host that stores the backup files. – % The percent sign itself. Use two percent signs, “%%” the same way the standard printf() function interprets it as one percent sign.
backup_retention	31	The setting of how many days to keep the backups. Backups matching retention period are removed.
backup_cloud_retention	180	The setting of how many days to keep the backups uploaded to a cloud. Backups matching retention period are removed.
backup_n_safety_copies	1	The setting of how many completed full backups will be kept regardless of their retention status.

Cluster

Name	Default Value	Description
cluster_name		The name of the cluster for easy identification.
enable_node_autorecovery	true	Node auto-recovery setting.
enable_cluster_autorecovery	true	If true, ClusterControl will perform cluster auto-recovery, if false no cluster recovery will be done automatically.
configdir	/etc/	The database server config directory.
created_by_job		The ID of the job created this cluster.
ssh_keypath	/home/user/.ssh/id_rsa	The SSH key file used for connection to nodes.
server_selection_try_once	true	MongoDB connection URI option. Defines if server selection should be repeated on failure until a server selection timeout expires, or just return with failure at once.
server_selection_timeout_ms	30000	MongoDB connection URI option. Defines the timeout value till mongodriver should try to do a successful server selection operation.
owner		The ClusterControl user ID of the owner of the cluster object.
group_owner		The ClusterControl group ID of the group that owns the cluster object.
cdt_path		The location of the cluster object in the ClusterControl Directory Tree.
tags	/	A set of strings the user can specify.
acl		The Access Control List as a string controlling the access to the cluster object.
mongodb_user	admindb	The MongoDB username.
mongodb_basedir	/usr/	The basedir for MongoDB installation.
mysql_basedir	/usr/	The basedir for MySQL installation.
scriptdir	/usr/bin/	The scripts dir of MySQL installation.
staging_dir	/home/user/s9s_tmp	A staging path for temporary files.
bindir	/usr/bin	The /bin directory of the MySQL installation.
monitored_mysql_port	3306	The monitored MySQL server’s port number.
ndb_connectstring	127.0.0.1:1186	The NDB connect string setting for MySQL Cluster.
ndbd_datadir		The datadir of the NDBD nodes.
mgmd_datadir		The datadir of the NDB MGMD nodes.
os_user		The SSH username used for accessing nodes.
repl_user	cmon_replication	The replication username.
vendor		The database vendor name used for deployments.
galera_version		The used Galera version number.
server_version		The used database server version for deployments.
postgresql_user	admindb	The PostgreSQL user name.
galera_port	4567	The galera port to be used when adding nodes/garbd, and constructing wsrep_cluster_address. Do not change at runtime.
auto_manage_readonly	true	Allow ClusterControl to manage the read-only flag of the managed MySQL servers.
node_recovery_lock_file		Specify a lock file and if present on a node, the node will not recover. It is the responsibility of the administrator to create/remove the file.

Cmondb

Name	Default Value	Description
cmon_db	cmon	The local ClusterControl database name.
cmondb_hostname	127.0.0.1	The local ClusterControl database MySQL server hostname.
mysql_port	3306	The local ClusterControl database MySQL server port.
cmon_user	cmon	The account name for accessing the local ClusterControl database.

Controller

Name	Default Value	Description
controller_id	5a3a993d-xxxx	An arbitrary identifier string of this controller instance.
cmon_hostname	192.168.xx.xx	The controller hostname.
error_report_dir	/home/user/s9s_tmp	Storage location of error reports.

Long_query

Name	Default Value	Description
long_query_time	0.5	Threshold value for slow query checking.
query_monitor_alert_long_running_query	true	Raises an alarm if a query is executed for longer than query_monitor_long_running_query_ms.
query_monitor_kill_long_running_query	false	Kill the query if the query executed for longer than query_monitor_long_running_query_ms.
query_monitor_long_running_query_time_ms	30000	Raises an alarm if a query is executed for longer than query_monitor_long_running_query_ms. The minimum value is 1000.
query_monitor_long_running_query_matching_info		Match only queries with an ‘Info’ only matching this POSIX regex. No default value, match any Info.
query_monitor_long_running_query_matching_info_negate	false	Negate the result of query_monitor_long_running_query_matching_info.
query_monitor_long_running_query_matching_host		Match only queries with a ‘Host’ only matching this POSIX regex. No default value, matches any Host.
query_monitor_long_running_query_matching_db		Match only queries with a ‘Db’ only matching this POSIX regex. No default value, matches any Db.
query_monitor_long_running_query_matching_user		Match only queries with a ‘User’ only matching this POSIX regex. No default value, matches any User.
query_monitor_long_running_query_matching_user_negate	false	Negate the result of query_monitor_long_running_query_matching_user.
query_monitor_long_running_query_matching_command	Query	Match only queries with a ‘Command’ only matching this POSIX regex. Defaults to ‘Query’.

Replication

Name	Default Value	Description
max_replication_lag	10	Max allowed replication lag in seconds before sending an Alarm.
replication_stop_on_error	true	Controls if the failover/switchover procedures should fail if errors are encountered that may cause data loss.
replication_auto_rebuild_slave	false	If the SQL THREAD is stopped and the error code is non-zero then the slave will be automatically rebuilt.
replication_failover_blacklist		Comma-separated list of hostname: port pairs. Blacklisted servers will not be considered as a candidate during failover. replication_failover_blacklist is ignored if replication_failover_whitelist is set.
replication_failover_whitelist		Comma-separated list of hostname: port pairs. Only whitelisted servers will be considered as a candidate during failover. If no server on the whitelist is available (up/connected) the failover will fail. replication_failover_blacklist is ignored if replication_failover_whitelist is set.
replication_onfail_failover_script		This script is executed as soon as it has been discovered that failover is needed. If the script returns non-zero or does not exist, the failover will be aborted. Four arguments are supplied to the script and set if they are known, else empty: arg1=’all servers’ arg2=’failed master’ arg3=’selected candidate’, arg4=’slaves of oldmaster (the candidates)’ and passed like this: ‘scripname arg1 arg2 arg3 arg4’ The script must be accessible on the controller and executable.
replication_pre_failover_script		This script is executed before the failover happens, but after a candidate has been elected and it is possible to continue the failover process. If the script returns non-zero or does not exist, the failover will be aborted. Four arguments are supplied to the script and set if they are known, else empty: arg1=’all servers’ arg2=’failed master’ arg3=’selected candidate’, arg4=’slaves of oldmaster (the candidates)’ and passed like this: ‘scripname arg1 arg2 arg3 arg4’ The script must be accessible on the controller and executable.
replication_post_failover_script		This script is executed after the failover happens ( a new master is elected and up and running). If the script returns non-zero or does not exist, the failover will be aborted. Four arguments are supplied to the script and set if they are known, else empty.: arg1=’all servers’ arg2=’failed master’ arg3=’selected candidate’, arg4=’slaves of oldmaster (the candidates)’ and passed like this: ‘scripname arg1 arg2 arg3 arg4’ The script must be accessible on the controller and executable.
replication_post_unsuccessful_failover_script		This script is executed if the failover attempt fails. If the script returns non-zero or does not exist, the failover will be aborted. Four arguments are supplied to the script and set if they are known, else empty.: arg1=’all servers’ arg2=’failed master’ arg3=’selected candidate’, arg4=’slaves of oldmaster (the candidates)’ and passed like this: ‘scripname arg1 arg2 arg3 arg4’ The script must be accessible on the controller and executable.

Retention

Name	Default Value	Description
ops_report_retention	31	The setting of how many days to keep operational reports. Reports matching retention period are removed.

Sampling

Name	Default Value	Description
enable_icmp_ping	true	Toggles if ClusterControl shall measure the ICMP ping times to the host.
host_stats_collection_interval	30	Setting for the host (CPU, memory, etc) collection interval.
host_stats_window_size	180	Setting the window size (in seconds) to examine stats to raise/clear host stats alarms.
db_stats_collection_interval	30	Setting for database stats collection interval.
db_proc_stats_collection_interval	5	Setting for database process stats collection interval. Min allowed value is 1 second. Requires a cmon service restart.
lb_stats_collection_interval	15	Setting for load balancer stats collection interval.
db_schema_stats_collection_interval	108000	Setting for schema stats monitoring interval.
db_deadlock_check_interval	0	How often to check for deadlocks. Specified in seconds. Deadlock detection will affect CPU usage on database nodes.
log_collection_interval	600	Controls the interval between logfile collections.
db_hourly_stats_collection_interval	5	Controls how many seconds are between every individual sample in the hourly range statistics.
monitored_mountpoints		The list of mount points to be monitored.
monitor_cpu_temperature	false	Monitor CPU temperature.
log_queries_not_using_indexes	false	Set the query monitor to detect queries not using indexes.
query_sample_interval	1	Controls the query monitor interval in seconds, -1 means no query monitoring.
query_monitor_auto_purge_ps	false	If enabled, the P_S table events_statements_summary_by_digest will be auto-purged (TRUNCATE TABLE) every hour.
schema_change_detection_address		Checks will be executed (using SHOW TABLES/SHOW CREATE TABLE) to determine if the schema has changed. The checks are executed on the address specified and are of the format HOSTNAME:PORT. The schema_change_detection_databases must also be set. A diff of a changed table is created.
schema_change_detection_databases		Comma separated list of databases to monitor for schema changes. If empty, no checks are made.
schema_change_detection_pause_time_ms	0	Pause time in ms between each SHOW CREATE TABLE. The pause time will affect the duration of the detection process.
enable_is_queries	true	Specifies whether queries to the information_schema will be executed or not. Queries to the information_schema may not be suitable when having many schema objects (100s of databases, 100s of tables in each database, triggers, users, events, sprocs). If disabled, the query that would be executed will be logged so it can be determined if the query is suitable in your environment.

Swapping

Name	Default Value	Description
swap_warning	20	Warning alarm threshold for swap usage.
swap_critical	90	Critical alarm threshold for swap usage.
swap_inout_period	0	The interval for swap I/O alarms (<= 0 disables).
swap_inout_warning	10240	The number of pages swapped I/O in the specified interval (swap_inout_period, by default 10 minutes) for warning.
swap_inout_critical	102400	The number of pages swapped I/O in the specified interval (swap_inout_period, by default 10 minutes) for critical.

System

Name	Default Value	Description
cmon_config_path	/etc/cmon.d/cmon_x.cnf	The config file path. This configuration value is read-only.
os	debian/redhat	The OS type. Possible values are ‘debian’ or ‘redhat’.
libssh_timeout	30	The network timeout value for SSH connections.
sudo	sudo -n 2>/dev/null	The command used to obtain superuser privileges.
ssh_port	22	The port for SSH connections to the nodes.
local_repo_name		The used local repository names for cluster deployment.
frontend_url		The URL sent in the emails to direct the recipient to the ClusterControl web interface.
purge	7	How long ClusterControl shall keep data. Measured in days, jobs, job messages, alarms, collected logs, operational reports, database growth information older than this will be deleted.
os_user_home	/home/user	The HOME directory of the user used on nodes.
cmon_mail_sender		The used email sender for sent emails.
plugin_dir		The path of the plugins directory.
use_internal_repos	false	Setting which disabled the 3rd party repository to be set up.
cmon_use_mail	false	Setting to use the ‘mail’ command for e-mailing.
enable_html_emails	true	Enables sending of HTML emails.
send_clear_alarm	true	Toggles the email sending in case of cluster alarms being cleared.
software_packagedir		This is the storage location of software packages, i.e, all necessary files to successfully install a node, if there is no yum/apt repository available, must be placed here. Applies mainly to MySQL Cluster or older Codership/Galera installations.

Threshold

Name	Default Value	Description
ram_warning	80	Warning alarm threshold for RAM usage.
ram_critical	90	Critical alarm threshold for RAM usage.
diskspace_warning	80	Warning alarm threshold for disk usage.
diskspace_critical	90	Critical alarm threshold for disk usage.
cpu_warning	80	Warning alarm threshold for CPU usage.
cpu_critical	90	Critical alarm threshold for CPU usage.
cpu_steal_warning	10	Warning alarm threshold for CPU steal.
cpu_steal_critical	20	Critical alarm threshold for CPU steal.
cpu_iowait_warning	50	Warning alarm threshold for CPU IO Wait.
cpu_iowait_critical	60	Critical alarm threshold for CPU IO Wait.
slow_ssh_warning	6	A Warning alarm will be raised if it takes longer than the specified time to set up an SSH connection (secs).
slow_ssh_critical	12	A Critical alarm will be raised if it takes longer than the specified time to set up an SSH connection (secs).

Conclusion

As you can see, there are many parameters to change if you need to adapt ClusterControl to your workload or business. It could be a time-consuming task to review all the values and change them accordingly, but at the end of the day, it will save time as you can make the most out of all the ClusterControl features.

Subscribe to our newsletter

You’ll get two emails every month full of fresh database ops tips and strategic considerations.

Comparing DevOps tooling approaches: Terraform, Ansible, Chef, Puppet, and DIY scripting

Why Cloud Repatriation Matters Now More Than Ever

Automating Day 2 operations: Scaling, upgrades and maintenance

Clustercontrol adds PostgreSQL Bi-Directional Logical Replication and Valkey Sentinel v8.0 in latest release