MySQL Monitoring – What's really needed

May 17, 2010 by ronald

The implementation of MySQL Monitoring is critical for any organization that uses a database and wants to avoid the inevitable disaster. There are 3 important components that all serve a key purpose to “MySQL Monitoring” in general:

Monitoring – Historical and graphical information
Alerting – Tell me when something is wrong
Dashboard – The State of NOW

Monitoring

There is no one option for Monitoring that is significantly better then another. A short list of what’s on offer can be found at http://monitoring-mysql.com/monitoring-products. What’s important is you have monitoring in place so historically you can review situations and compare across your servers and enabling the better identification of physical or database bottlenecks. My recommendations for products are Cacti which is packaged with most popular Linux distrubtions, so can be installed via a single apt-get/yum/rpm command, and the MySQL Cacti Templates. This is not the best product solution or combination, it’s in my opinion the most common and covers all the essential bases.

It’s best to define a different web server (i.e. publically accessible) to be the monitoring server rather then installing the web interface on any single DB server.

Alerting

Alerting is key to notify you of a problem without the need for somebody to be viewing a screen and see it happening via your monitoring. The identification of high CPU load, a disk nearing capacity, database locking etc often helps avoid a current problem before it becomes some level of disaster. Almost all companies use Nagios or a derivative such as the main fork Icigna or products that include Nagios like Opsview or Groundworks.

Dashboard

While both Monitoring and Alerting are necessary, they both however lack a key component necessary for successful administration. That is timing. Both of these earlier options sample, e.g. 1 minute or generally 5 minutes (by default), but problems can happen quickly. This is why each organization needs to have a Dashboard. I don’t know of any products here unless you try and adjust a monitoring product, but want’s needed is a very lightweight and very business centric single page of Green/Yellow/Red status’s of your environment including databases, webservers, response time and traffic etc. This is for the state of NOW. A Dashboard should sample every 5 to 10 seconds. I have seen larger and more successful companies have various home grown implementations. I developed a product for one company and it included the following on a single page. You can see the screen output in a presentation at http://www.slideshare.net/ronaldbradford/10x-performance-improvements. This included

5 DB servers monitoring load average, ping time, database connections, active,free,locked, replication availability and lag
5 web servers monitoring load average, ping time, apache connections
Application metrics monitoring 3 different page load times, and page size

It is often important to be able to identify a key problem and then drill down to this more quickly rather then the usual “the website is slow” question and having to investigate the same repetitive tasks. You need to automate, and be more pro-active in response especially to load and locking issues.

Advanced Monitoring

Information is only have the requirement, it is what you do with this information that determines how to be proactive rather then reactive. If I was the DBA of a company I’d do even more then these initial 3 steps which are a necessary base. I would also monitor for example:

Database size and growth. This is important to be preemptive about your capacity. Example SQL at http://ronaldbradford.com/mysql-dba/#allschemas
Error Log changes
Backup timing. This is important as your DB grows as it affects recovery.
Recovery timing
Gather raw MySQL status information because monitoring tools only capture what you ask it to do, not everything. While you may not analyze all now, you may want to in the future look back in time. Example scripts at http://ronaldbradford.com/mysql-dba/#log-stats
Hourly/Daily text reports. Producing a easy readable SHOW GLOBAL status report such as with statpack will for example enable me to know network throughtput in the DB, transaction throughput and key indicators of locking, disk access etc. While you may have a graphic interface, it’s a lot easier to automate and grep text reports.
Proactive restrictions. The Twitter failed whale is a great example of when the system moves closer to known limits, but before those limits they start limiting load. This includes for example to disable less critical but resource intensive functionality, e.g. people search. The also start rejecting connections so they do not reach a crash state. This could be proactively changing timeout values so the DB fails queries, and the webservers respond accordingly with a try again approach.

While these are important if you have only limited resources, too much information can be just as much of a burden then people just start ignoring the information and miss what’s important.

Monitoring MySQL with MONyog

November 25, 2009 by ronald

It just works. In absence of any MySQL monitoring for your site, I have found no solution that gets you operational as quickly and easily. MONyog can be deployed in 60 seconds, and configured in another 60 seconds. Within 5 minutes you can have visual monitoring of your MySQL environment.

MONyog is an agentless process, which is an advantage for easy install, but does not provide for monitoring redundancy in the capture of information due to agentless nature. It’s a static standalone executable which is great if you need something to work out of the box. You can easily configure multiple servers in a replication topology, or different servers in your environment. You get the ability to monitor all the usual information, with a dashboard and detailed graphs. While MONyog does provide customizations of rules for the graphs and presentation order, that’s about it. You can’t at this time for example change the colors, what’s on graphs except for what MONyog monitors or the security of certain options in the GUI to different users, however I hope they offer these suggestions in future releases.

MONyog includes some nice features that are overlooked in other products. You have the ability to monitor the MySQL error log (if configured appropriately) which is a common complaint of end users. You can also see the process list, and when configured you can also perform query gathering and analysis.

MONyog is a well priced commercial product with a free trial download without registration requirements which gives no barrier to access and evaluate. As a solution and ease of deployment, there is no excuse not to evaluate this product. If you have no monitoring, you can now quickly and easily. I find a number of clients that simply have no monitoring. There really is no excuse as it’s critical information you need to have and record for a successful business.

You can get it from www.webyog.com.

Monitoring MySQL Product Options

October 16, 2009 by ronald

I’ve had plenty of comments on specific products to Monitoring MySQL Options before providing the completed list. Here are the results from my survey to give everybody a more complete list.

Nagios	25	xxxxxxxxxxxxxxxxxxxxxxxxx
MONyog	8	xxxxxxxx
Cacti	4	xxxx
Munin	3	xxx
MySQL Enterprise Monitor/Merlin	3	xxx
Hyperic	2	xx
KontrolBase	2	xx
Zabbix	2	xx
Big Brother	1	x
iGlass	1	x
MyDBA	1	x
MySQL AR	1	x
pacemaker	1	x
Panopta	1	x
Opsview	1	x
Monit	1	x
Tivoli	1	x

NOTE: Some answers included multiple products, these are all counted separately in the above figures.

There are a few products that are not listed at Monitoring MySQL in this list.

If you want to list what you use, please continue to use the MySQL Alert Monitoring Survey. Thanks to all those that replied.

Monitoring MySQL options

October 15, 2009 by ronald

My recent poll What alert monitoring do you use? showed 25% of the 58 respondents to bravely state they had no MySQL monitoring. I see 1 in 3, ~33% in my consulting so this is consistent.

There is no excuse to not have some MySQL Monitoring on your production system. At the worse case, you should be logging important MySQL information for later analysis. I use my own Logging and Analyzing scripts on every client for an immediate assessment regardless of what’s available. I combine that with my modified statpack to give me immediate text based analysis, broken down by hour chunks for quick reference. These help me in troubleshooting, but they are not a complete solution.

The most popular options I see and are also reflected in the results are:

Cacti with MySQL Cacti Templates – Open Source
Nagios used in conjunction with Cacti for alert monitoring – Open Source
MONyog – Commercial with Free trial

There is a good list, including some products I did not know. My goal is to get this information included in the Monitoring-MySQL information site.

I have some additional information on Cacti and MONyog, and I’ll be sharing this information in upcoming posts.

What do you monitor in MySQL?

September 3, 2009 by ronald

If you are unfamiliar with what to monitor in MySQL, starting with looking at what popular Monitoring products monitor. For example, the following is the list of MySQL Cacti Plugin measurements.

Innodb Buffer Pool Activity

Pages Created
Pages Written
Pages Read

Innodb Buffer Pool Pages

Pool Size
Database Pages
Free Pages
Modified Pages

Inoodb File I/O

File Reads
Files Writes
Log Writes
File Fsyncs

Innodb Pending I/O

Aio Log Ios
Aio Sync ios
Buffer Pool Flushes
Chkp Writes
Ibuf Aio Reads
Log Flushes
Log Writes
Normal Aio Reads
Normal Aio Writes

Innodb Insert Buffer

Inserts
Merged
Merges

Innodb Log

Log Buffer Size
Log Bytes Written
Log Bytes Flushed
Unflushed Log

Innodb Row Operations

Rows Read
Rows Deleted
Rows Updated
Rows Inserted

Innodb Semaphores

Spin Rounds
Spin Waits
OS Waits

Innodb Transactions

Innodb Transactions
Current Transactions
History List
Read Views

MySQL Binary/Relay Logs

Binlog Cache use
Binlog Cache Disk Use
Binary Log Space
Relay Log Space

MySQL Command Counters

Questions
SELECT
DELETE
INSERT
UPDATE
REPLACE
LOAD
DELETE MULTI
INSERT SELECT
UPDATE MULTI
REPLACE SELECT

MySQL Connections

Max Connections
Max Used Connections
Aborted Clients
Aborted Connects
Threads Connected
Connections

MySQL Files and Tables

Table Cache
Open Tables
Open Files
Opened Tables

MySQL Network Traffic

Bytes Received
Bytes Sent

MySQL Processlist

State Closing Tables
State Copying to Tmp Table
State End
State Freeing Items
State Init
State Locked
State Login
State Preparing
State Reading From Net
State Sending Data
State Sorting Result
State Statistics
State Updating
State Writing to Net
State None
State Other

MySQL Query Cache

Queries In Cache
Hits
Inserts
Not Cached
Lowmem Prunes

MySQL Query Cache Memory

Query Cache Size
Free Memory
Total Blocks
Free Blocks

MySQL Replication

Slave Running
Slave Stopped
Slave Lag
Slave Open Temp Tables
Slave Retried Transactions

MySQL Select Types

Select Full Join
Select Full Range Join
Select Range
Select Range Check
Select Scan

MySQL Sorts

Sort Rows
Sort Range
Sort Merge Passes
Sort Scan

MySQL Table Locks

Table Locks Immediate
Table Locks Waited
Slow Queries

MySQL Temporary Objects

Created Tmp Tables
Created Tmp Disk Tables
Created Tmp Files

MySQL Threads

Thread Cache Size
Threads Created