MySQL Cluster Certified

April 28, 2007 by ronald

Jonathon Coombes recently blogged in MySQL Cluster Certified that he passed the MySQL Cluster DBA Certification as was the first Australian. Lucky for him I passed the exam after my presentation on the second day of the conference. I guess us Australian’s are leading the world!

As Jonathon said it was rather hard, certainly more difficult then the other DBA exams but nothing for an experienced Cluster DBA.

MySQL Conference – YouTube

April 27, 2007 by ronald

MySQL Conference 2007 Day 4 rolled quickly into the second keynote Scaling MySQL at YouTube by Paul Tuckfield.

The introduction by Paul Tuckfield was; “What do I know about anything, I was just the DBA at PayPal, now I’m just the DBA at youTube. There are only 3 DBA’s at YouTube.”

This talk had a number of great performance points, with various caching situations. Very interesting.

Scaling MySQL at YouTube

Top Reasons for YouTube Scalability

The technology stack:

Python
Memcache
MySQL Replication

Caching outside the database is huge.

It a display of numbers of hits per day it was said “I can neither confirm or deny the interpretation will work here (using an Alexa graph)”. This is not the first time I’ve heard this standard “Google” response. They must get lessons by lawyers in what you can say.

Standardizing on DB boxes (but they crash almost daily)

4x2ghz opteron core
16G RAM
12x10k scsi
LSI hardware raid 10
Replication played a big part in fixing
Get a reliable H/W supplier

Replication Lessons

You don’t worry about it when a replicas fail.
One thing that sucks, Innodb doesn’t recover very fast. It does that durability think, but it takes hours to finish recovering (was it going to finish)
How many backups can you restore. When you switch you a replica, are you sure it’s right?
Did you test recovery, did you test your backups.
replication was key to trying different H/W permutations to identify incompatible H/W (combinations of controllers/disks)
we got good at re-parenting/promoting replicas, really fast
we built up ways to clone databases as fast as possible
Excellent way to test tuning changes or fixes (powerful place to test things)
Keep “intentional lag”/Stemcell replicas – Stop SQL thread, keeps a server a few hours or a day behind. Say if you drop a table you have a online backup.
When upgrading, always mysqldump then reload, rather then upgrade database.
Don’t care about CPU’s. I want as much memory as possible, I want as many spindles as possible.
For YouTube 2-3 second lag is acceptable.

If you db fits in ram, great otherwise

Cache is king
Writes should be cached by raid controller (buffered really) not the OS
Only the db should cache reads (not raid, not Linux buffer cache)

Only DB should cache reads

Hit in db cache means lower caches went unused.
Miss in db cache can only miss in other caches since they’re smaller.
Caching reads is worse then useless. It’s serialized writes.
Avoiding serialization in reads reaps compounds benefits under high concurrency

An important lesson learned. Do no cache reads in F/S and Raid Controller.

Caching Lessons
Overcoming Mystery Serialization

Use O_DIRECT
vm.swappiness=1-5
if you’re >80% buys — your not doing I/O concurrently look at other figures e.g. 80% busy 8 I/O’s, next configuration 80%, only 4 I/O’s
Mirror in H/W strip in S/W

Scale Out

Writes are parallel to master, but serialized to replicas. We need true horizontal partitioning.
We want true independent masters
EMD – Even More Databases — Extreme Makeover Database
Slave transactions must serialize to preserve commit order (this is why replication is always way slower)
The oracle caching algorithm (that’s a small o) — predicting the future
Replication lags: one IO bound thread. You do know the future, commands are coming up serially.
Write a script to do reads, before updates coming up (because they are cache hits).
The diamond. For golive, play shards binlogs back to original master for fallback.

MySQL Conference – Get Behind Dorsal Source

April 27, 2007 by ronald

In a community session yesterday at MySQL Conference 2007, I first heard about Dorsal Source. A collaboration between Solid DB and Proven Scaling that allows for community people to upload patches to MySQL, get it compiled across multiple platforms, and have a downloadable distribution available on H/W individual contributors will never have access to.

That’s a great idea. I also hope we get the opportunity to get compiling of patches into multiple versions, as well to get builds of a lot of patches together. Personally, I’m running 3 versions just to diagnose one problem. 5.0.36 with a custom binary change, 5.0.37 so I have SHOW PROFILE, and 5.0.33 so I have microslow patch.

With new patches becoming available from the community, I hope I can see builds that combine all known patches that Dorsal Source may have.

I think this is going to be a great project.

MySQL Conference – PHP on Hormones

April 27, 2007 by ronald

MySQL Conference 2007 Day 4 started early again at 8:20 am with PHP on Hormones by the father of PHP Ramus Lerdorf.

A very funny man, one of the best insightful talks of the conference (rather scary actually). Here are some opening comments.

In his own words as Keynote speaker. “I’m here because I’m old”.
Php 1 from 1994 started after seeing Mozilla in 1993. Because it was just me using it, I could change the language any time.
In 2005 the code looks like this (in comparison on 1995) — I’m not sure if this is worth 10 years of development
I wrote PHP to avoid programming
It’s changed to be more OO because people expect that. Universities teach this.
Hey, I was fixing bugs in my sleep. Iwould wake up, and in my mail box there would be bug fixes to bugs I didn’t even know I had.

Why do people contribute?

Self-interest
self expression
hormones
Improve the world

The slide included a great Chemical equation of “The Neuropeptide oxytocin” — Nature’s trust hormone

People need to attract other people, it makes you feel good, it comes out when you interact with people.

It’s not what people think about you, but rather what they think about themselves.

PHP was my baby, giving up control, just because I started it, doesn’t mean I have a bigger say in it.
Systems that harness network effects and get better the more people use them in a way that caters to their own self-interest. — Web 2.0
Once you build a framework your done, the users build the site, they drive the content.
The same people that work on open source projects, are the same people that use websites.
- Self-interest
- self expression
- hormones
- Improve the world

1. Performance
It your sites falls apart your done.

Benchmark
- http_load
- Callgrind inside valgrind
- XDebug

valgrind –tool=callgrind

Excellent tool to see where time is spent in the code. You have to run a profiler.
Example of using Drupal. It turns out 50% of time was spent in the them, it had 47 SQL queries, 46 Selects.
Went from 4 per second to 80 per second, without any code changes. Some performance options, and some caching.
Guaranteed you can double the speed of your website by using a profiler.

2. Security
Critical problem areas.

404 pages
Search page
PHP_SELF
$_GET, $_POST, $_COOKIE
$_SERVER
Lots of stupidity in IE (e.g. Always send a charset)

The web is broken you can all go home now.

People are venerable because people run older versions of browsers, and their data is not secure, and you can’t secure their data.

What can happen??
9 out of 10 of you have cross-site scripting hole on your site

Remote Greasemonkey
Profile Hacks
JS Trojans

Added a PHP logo to the MySQL User Website, it’s really the PHP website
IBM webpage, on article about security.

Tool to find holes, banks, insurance companies, CIA, even Yahoo where I work.

You know if they have been to bankofamerica.com, you can tell if they are a customer, you can tell if they are logged, you can then see their cookie credentials.

You don’t know if any sites have these problems.

JS trojan, iframe that captures
reconfigures your wireless router, moves it outside your DMZ, then uses traditional techniques to attack your machine (that you thought was secure inside a firewall)

You should never ever click on a link. It sort of defeats the purpose of the web.

Never use the same browser instance to do personal stuff and browsing.

So what are we doing about this?
There isn’t much we (PHP) can do to secure sites developed.
Built a filter extension in 5.2, back in 5.1.

http://php.net/filter *** YOU MUST IMPLEMENT THIS
filter.default=special_chars

3. APIs are Cool!

Two lines to grap the Atom feed from flickr of photos just uploaded.
That’s all I have to add to my code.

The really make you want to use the servers. It’s so easy.

API drives passion, drive people to use your site.
You can add a lot of cool things to your sites.

What to do

Avoid Participation Gimmicks
Get their Oxytocin flowing
Solve One Problem
Clean and Intuitive UI
API’s
Make it work

A full copy of the slides can be found at http://talks.php.net/show/mysql07key

MySQL Conference – Google

April 27, 2007 by ronald

MySQL: The Real Grid Database

Introduction

Can’t work on performance problems until we solve the availability
We want MySQL to fix our problems first.

The problem

Deploy a DBMS for a workload with
- too many queries
- to many transactions
- to much data

A well known solution

deploy a grid database

-use many replicas to scale read performance
-shard your data over many master to scale write performance
-sharding is easy, resharding is hard

availability and manageability trump performance

– make it easy to run many severs
– unbretable aggregate perfomance

we describe problems that matter to us.

The grid database approach

Deploy a large number of small servers
use highly redundant commodity components
added capacity has a low incremental cost
not much capacity lost when a server fails
support many servers with a few DBAs

Managability
Make it easy to do the tasks that must be done. Reduce, Reduce.
Make all tasks scriptable
Why does it mater, support hundreds of servers, spend time solving more interesting problems. You generally have lots of problems to solve.

Underutilize your severs
Require less maintenance
Requre less tuning
tolerate load spikes better
tolerate bad query plans better

In a Perfect World
Short running queries
uses kill mistake and runaway queries
accounts new use to many connections
query plans are good
new apps increase database workload by a small amount
only appropiate date is stored in the database

Reality

Long running transactions, create replication delays everywhere
servers with round robin DNS aliases make queries hard to find
applications create more connections where the database is slow
some storage engines use sampling to get query plan statistics
new applications create new database performance problems
applications use the database as long as rows are never deleted
many long running queries on replicas

Solutions

Improve your ability to respond because prevention is impossible
Need tools to make monitoring easier
determine what is happening across servers
detemine what happened in the past

Mantra

Monitor everything you can, and archive as long as possible. (vmstat 5 secs, iostat, mysql error logs)
You will need these to reconstruct failures
save as much as possible
script as much as possible

Monitoring Matters

Display what is happening

-which table, account or statemsns caused most of the load
-many fast queries can be as much a problem as one slow query

Record what happending

–archivce show status counters somweere
– query data from the archive
— visualise data from the archivce

record queries that have been run

— archive show processlist output (do every 30 seconds)
— support queries on this archive

All of this much scale to an environment with many servers

Monitoring Tools

Display counters and rate change for counters
aggregate values over many servers
visualize and rang results
display results over time

Google mpgrep tools

New Commands
We changed mysql, three new commands
SHOW USER _STATISTICS
SHOW TABLE STATISTICS
SHOW INDEX STATISTICS

Per Account Activity
USER_STATISTICS
seconds executing commands
number of rows fetched and changed
total connections
number of select/updates/other/commits/rollback/binlog bytes written.

TABLE STATISTICS
number of rows fetched/changed

INDEX STATISTICS
display number of rows fetched per index
helps find indexes that are never used

available in code.google.com in 4.0, porting to 5.0

MySQL High Availability

Great options
- Cluster
- Replication
- Middelware — e.g. continum
- DRBD
We need some features right now
we are committed to innodb and mysql replication

*a lot of appplicaton code works on this
*our tools and processed support this

We favor commodity hardware

There are all great features but we are much more limited in what we can use.
Management want to know we don’t loose transactions, not loose some transactions.

Desired HA Functionality

Zero transaction loss on failures of a master
minimal downtime on failures of a master
reasonable cost in performance and dollars
fast and automatic failover to local or remove server
no changes to our programming model
- does it support MVCC
replication and reporting are concurrent on a slave

MVCC must have update concurrent with query.

Failures happen everywhere
OS – kernal oom or panic (older 2.4 32 bit systems)
mysqld – caused also by code we added
disk, misdirected write, corrupt write (love innodb checksums)
file system – inconsisted after unplanned hardware reboot (use ext2)
server – bad RAM
lan, switch – lose
Rack – reboot
Data center – power loss, overheading, lightning, fire
People – things get killed or rebooted by mistake ( a typo can take out the wrong server, when names differ by a character or a digit)

ext2 and 4.0 are great, there are the same generation.
Trying not to use RAID, not battery backed raid etc, we try work around with software solutions. We do use RAID 0, but we also try software solution.
When we have the right HA solution, we won’t need RAID.

Mark. “Yes, Google programmers have bugs. Not me personally, it was my predecessor.”

HA Features we want in MySQL
Synchronous replication as an option
a product that watches a master and initiates a failover
archives of the master binlogs stored elsewhere
state stored in the filesytstem t obe consistent after a crash
. innodb and mysql dictionaries can get out of sync
.replicatoin state on a slave can get out of sync

We could not wait
Features we added to MySQL 4.0.26
We can do things a lot faster
. we have more developers lying around
. Our needs as specific, not a general product solution

Transactional replications for slaves
semi-synchronous replication
mirrored binlogs
fast and automated failover

Transactional Replication
Replication state on a slave is stored in files
slave sql thread commits to storage engines and then updates a file
a crash between the two can make replication state inconsistent
transactional replication
MySQL can solve this in the future by storing replication state in tables

Semi-synchronous replication
Block return from commit on a master until at least one slave has acknowledged receipt of
slave io thread acknowledges receipt after buffering the changes
modified mysql replication protocol to support acknowledgments
conifuration options
where to the master uses it
where a slave used it
how long the maser waits for an acknowledgement

can run a server with some semi-sync replication slaves and some regulare replication salves
this can be worked with any storage engines that supports commit, but we only use innodb

* This is how we guarantee to management for Zero Transaction Loss.

Latency single stream 1ms, multi-stream 10ms. This is acceptable for us.

The MySQL Replication Protocol

The current replication protocol is efficient
a slaves makes one request

Replication Acknowledgment

Mirrored Binlogs
mysql does not provide a way to maintain a copy of a master’s binlog on a replica. By copy we me a file of same name and equivalent byte for byte.
Hierarachial replication works much better where a slave can disconnect from one replication proxy and reconnect to another with adjusting binlog offsets.
Hot backups taken before a failover and difficult to use after a failover

Mirrored Binlog Implementions
Slave IO threads write their own relay log and a copy of the bin log
all events but the rotate log event are written

After failover, start a new binlog on new master

Fast Failover

Slaves use a hostname, rather then an IP
You can’t enable the binlog dynamically (in 4.0)
Added new SQL STATEMENTS that does
disconnect users with SUPER privilege
disable new connections
enable the bin log
enable connections from all users

Automatic failover
Something must decided that a master has failed
Something must choose the new master

Q: What keeps up from moving to 5.0?
A: Queries don’t parse (Joins)

Data sets, 8GB servers, 50-100GB’s

Quote – 26 April 2007

April 27, 2007 by ronald

“The web is broken you can all go home now.”

Ramus Lerdorf — Father of PHP — MySQL Conference 2007

Quote – 25 April 2007

April 26, 2007 by ronald

“Don’t complain, do something about it”

Baron Schwartz – Creator of MySQL Toolkit — MySQL Conference 2007

MySQL Roadmap

April 26, 2007 by ronald

Here are some notes from the MySQL Server Roadmap session at the MySQL Conference 2007.

MySQL: Past and Future

2001: 3:23
2003: 4.0 UNION query Cache Embedded
2004: 41. Subqueries
2005: 5.0 Stored Procedures, Triggers, Views
Now: 5.1.17 Partitioning, Events, Row-based replication
2007?: 6.0 Falcon, Performance, Conflict detection
2008?: 6.1 Online Backup, FK Constraints

2007 Timeline

Q1: 5.1 Beta, 5.1 Telco Production Ready, Monitoring Service 1.1, MySQL 6.0 Alpha, Community GA
Q2: MySQL 6.0 Beta, New Connectors GA
Q3: 5.1 RC, 6.0 Beta, MS 2.0, Enterprise Dashboard beta
Q4: 5.1 GA, 6.0 Beta

Where are we today?

We are by fare the most populate open source database
The Enterprise world is moving online and MySQL is well-positioned for that trend, But:
- Transactional scalability
- Manageability
- Specific online features

MySQL Server Vision – The Future

Always Online — 24×7, Online backup,online analytics, online schema changes
Dynamic Scale-out — online partitioning, add node, replication aides,
Reliable — fault-tolerant, easy disagnosis, stable memory, ultimately self-healing
High-performance — Interactive web, real-time response, apps, 10,000-100,000 clients
Ease of use — Portable, Best for development, multiple connectors, easy tuning
Modularity and Ubiquity — Storage engines, plug ins

How can you help?

Bug finding and fixing — Community Quality Contributor
Feature/patch contribution
But, to expedite your patch

The goal: “Be the Best Online Database for Modern Applications”

Quote – 25 April 2007

April 25, 2007 by ronald

“What ever advice you got, keep it to yourself, your not the target market.”

Red Hat & One Laptop Per Child UI Designer to bunch of suits – MySQL Conference 2007

MySQL Conference – For Oracle DBAs and Developers

April 25, 2007 by ronald

I have just completed my presentation at the MySQL Conference 2007 on MySQL for Oracle DBAs and Developers.

Not mentioned in my slides, but referenced during the presentation was what I consider the most important page to document from the MySQL Manual — 5.2.1. Option and Variable Reference

You can download a PDF copy of my presentation here.

MySQL Conference – Building a Vertical Search Engine in a Day

April 25, 2007 by ronald

Moving into the user sessions on the first day at MySQL Conference 2007, I attended Building a Vertical Search Engine in a Day.

Some of my notes for reference.

Web Crawling 101

Injection List – What is it seed URL’s you are starting from
Fetching the pages
Parsing the content – words and links
Updating the crawl DB
Whitelist
Blacklist
Convergence — avoiding the honey pots
Index
Map-reduce — split a large problem into little pieces, process in parallel, then combine results

Focused content == vertical crawl

20 Billion Pages out there, a lot of junk
Bread-first would take years and cost millions of lives

OPIC + Term Vectors = Depth-first

OPIC is “On-line Page Importance Calculation”. Fixing OPIC Scoring Paper
Pure OPIC means “Fetch well-linked pages first”
We modify it to “fetch pages about MySQL first”

Nutch & Hadoop are the technologies that run on a 4 server cluster. Sample starting with www.mysql.com in 23 loops, 150k pages fetched, 2M URL’s found .

Serving up the results

Generating the index
Setting up Nutch with Tomcat (can also run with Resin) Introduction to Nutch, Part 2: Searching, Running Nutch with Mac OSX
Single searcher vs. multiple searchers
Optimizing the user interface

MySQL Conference – RedHat Keynote – One Laptop Per Child

April 25, 2007 by ronald

Our third keynote at MySQL Conference 2007 was titled Building the Ultimate Database Container with RHEL, MySQL, and Virtualization by Michael Evans.

The presentation was on Red Hat & One Laptop Per Child. His initial Quote was “Thinking Past Platforms: The Next Challenge for Linux”, By Doc Sears, 2007-04-16 http://www.linuxjournal.com/node/1000210

OLPC

A Non profit idea from Nicholas Negroponte.
Aim is to build & distribute inexpensive laptop systems to primary & secondary school students worldwide.
Sell to young children in developing countries.

In summary at presentation to Red Hat — “Non-profit, run by a professor, we make hardware and sell to governments.”

The overall dynamics have attracted a lot of interesting people in the world.

The ability and goal is to make the device together, bringing all H/W and S/W people together.

The people that get behind this project have the ethos — “I’m willing to jump into this to change the world.”

This is the first time for a new opportunity in the last 10 years.

The sugar user interface is a completely new experience.

When the UI designer was presenting to a room of head executives. “What ever advice you got, keep it to yourself, your not the target market.”

One key point — No backward compatibility needs.

More information at www.laptop.org. Wikipedia Reference. Some videos at You Tube Inside One Laptop per Child: Episode one and Slightly better demo of the OLPC User Interface.

MySQL Conference – The next keynote with Guy Kawasaki

April 25, 2007 by ronald

Without missing a beat at MySQL Conference 2007, we moved from Marten’s keynote to The Art of Innovation by Guy Kawasaki.

Extremely fun and entertaining. His 10 points.

1. Make Meaning

“To change the world”
To a VC, do not say “you want to make money”, that is understood. You will attract the wrong team.

2. Make Mantra

Not a Mission statement (50-60 words long), but 2 or 3 words.
- Wendy’s – “Healthy fast food”
- Mike – “Authentic Athletic Performance”
- FedEx – “Peace of Mind”
- eBay – “Democratize commerce”
Create a mantra — Why do you exist?

If you get stuck try the Dilbert mission statement generator.

3. Jump to the next curve

Not 50% or 100% better, but “Do things 10x better”

4. Roll the DICEE

“Create great stuff”
- Deep: Fanning (Reef) Sandal that open beer bottles
- Intelligent: BF-104 Flashlight (Panasonic) (takes 3 sizes of batteries)
- Complete: Lexus
- Elegant: Nano (Apple)
- Emotive: Harley Davidson (They generate strong emotions)

5. Don’t worry, be crappy

Get it out there.

6. Polarize people

People love it or hate it.

7. Let a hundred flowers blossom

People that are not your target market are using it.
Take the money, ask the people why you are buying, ask what you can do better.

8. Churn baby, churn.

Ok to ship stuff with crappy stuff in it, but important to continually revised and improve.

9. Niche thyself
With a nice graph.

Vertical — Ability to provide unique product or service
Horizontal –Value to customer

bottom right — Price
top left — Stupid
bottom left — Dotcom
top right — X You need to be High and to the right.

Fandango — It’s either Fandango, or Clubbin.
Breitling Emergency – watch
Smart car – park perpendicular
LG Kimchi refrigerator

You need to be like the President of the United States – You need to high and to the right. Got a great laugh from the crowd.

10. Follow the 10/20/30 rule

Innovative, you need to pitch for what you want.

The optimum number of slides in 10 slides.
Given the slides in 20 minutes.
Use 30 point font

11. Don’t let the bozos grind you down

A bonus to our friends in the community.

“I think there is world market for five computers”
“This telephone has too many shortcomings to be seriously considered as a means of communication. The device is inherently of no value to us.” –Western Union 1876
“There is no reason why anyone would want a computer in their home.” — Digital Equipment Corp 1977
“It’s too far to drive, and I don’t see how it can be a business.” – Guy Kawasaki – Bozo (The company was Yahoo)

Guy commenting on his lost opportunity with Yahoo — “It only covers the first billion, it’s the second billion that pisses me off.”

Read more about Guy at his website Guy Kawasaki.

The Art of Innovation. If you a copy of slides, send an email to [email protected]

MySQL Conference – PBXT Recognized

April 25, 2007 by ronald

As I mentioned in MySQL Conference – Rewarding the Community, Paul McCullagh, the creator of the Community Transactional Storage Engine PBXT won the Community Code Contributor of the Year award.

A photo for Paul. Great work.

MySQL Conference – Opening Keynote with Marton Mickos

April 25, 2007 by ronald

It’s an early start this morning at 8:20am at MySQL Conference 2007 with CEO Mårten Mickos keynote talk Welcome and State of MySQL AB.

Here are some of the key points that impressed on me.

“The Participatory & Disruptive spirit of the Dolphin.”.

Open Source disrupts inefficient models and produces new wealth.

Architecture of Participation.

GPL and FOSS licenses
Google
del.ico.us

MySQL Architecture of Participation
You have the forge, the planet, community contributer program and many more.

Production of Amateurs

You Tube
Second Life
Wikipedia

Some really great Quotes:

“Fail often to succeed soon.” IDEO company slogan

“Noah’s Ark was built by amateurs, but the Titanic by professionals.”

Innovation Happens Here

MySQL Monitoring & Advisory Service

In his presentation of MySQL Network Enterprise Dashboard, If you were quick you would have noticed the MySQL Version 6.0.0-alpha-pb518.log

Leading Applications

Open Source ISVs
Web
Web 2.0
On-Demand
Hosting
Telco & VOIP
Security
Enterptise 2.0
Hub & Spoke

We want to be known as The Online Database.

Drawn to the Lamp

Microsoft
Oracle
IBM
Sun
HP
Dell
Unisys

They all have an Open Source strategy, the develop open source products, they use and partner with MySQL.

He also mentioned MySQL Enterprise Connection Alliance – MECA.

Global Distributed Organization

70% work from home
100 major locations in nearly 30 different countries.
There is a great focus on culture – global culture.

Disruptive Businesses

A smarter way to produce the goods
A smarter way to distribute them
Serve the underserved
Keep it simple
Innovate with technology to give customers more convenience
Make money

Gartner: 66% of enterprises are deploying MySQL or planning to
750,000 newsletter subscribers
MySQL installed in over 11 million installations

Going Forward

Scalability
Availability
Manageability

MySQL – The best online database in the world.

MySQL Conference – Rewarding the Community

April 25, 2007 by ronald

At MySQL Conference 2007, CEO Mårten Mickos in his opening keynote Welcome and State of MySQL AB rewarded the community. Those that contributed to “The best database in the world”.

2007 MySQL Applications of the Year
#1 in Online Video
#1 in 3G Mobile Entertainment
#1 in Creative Software

And the Winners- YouTube, Amp’d mobile, and Adobe

2007 MySQL Partners of the Year
#1 reseller of MySQL Enterprise to govt
#1 in MySQL Enterprise integration
#1 in Open Source

And the Winners – Carasoft, HP, and RedHat

2007 Community Members of the Year
Quality Contributor
Community Code Contributor
Community Advocate

And the Winners

Martin Freibe
Paul McCullagh
Sheeri Kritzer

MySQL 6.0

April 25, 2007 by ronald

If you were quick you may have noticed at the MySQL Conference 2007 keynote Welcome and State of MySQL AB talk this morning, Robin Schumacher in his quick demo of MySQL Monitoring & Advisory Service showed the MySQL Network Enterprise Dashboard, you would have noticed the MySQL Version 6.0.0-alpha-pb518.log

Did anybody else spot it?

Opening Keynote at MySQL Conf

April 25, 2007 by ronald

It’s an early start this morning at 8:20am at MySQL Conference 2007 with CEO Mårten Mickos keynote talk Welcome and State of MySQL AB.

His spirit is evident with his opening slide “The Participatory & Disruptive spirit of the Dolphin.”. Stay tuned for more, it’s going to be a big day today.

MYSQL Conference – Scaling and High Availablilty Architectures Tutorial

April 24, 2007 by ronald

My first tutorial today at MySQL Conference 2007 is Scaling and High Availablilty Architectures by Jeremy Cole and Eric Bergen of Proven Scaling.

Basic Tenets

While not discussed, the premise is to Cache Everything. MemCache is a key component to any scalable system.

Lifetime of a scalable system

Using the analogy from a newborn child Jeremy stepped us through the categories Newborn, Toddler, Teenager, Late teens to 20s, Adult.

In Late teens to 20s, is where most systems die a slow death, he termed “the awkward stage”. This is where scalability is critical, and a meltdown for example can ruin you. Downtime is also just not acceptable for your user community.

When your Adult you need to perfect the ability to deploy incremental changes to your application seamlessly.

As the system grows, optimizations changes that may have worked are now affecting your system. It’s important to revisit during each stage.

Partitioning

Most applications mainly implement a horizontal partitioning model. Different components of your systemcan be scaled by a “partition key”. The different models include fixed “Hash key” partitioning, Dynamic “directory” partitioning, Partition by “group” and partition by “user”.

The Dynamic “directory” is a lot harder to implement, but is ultimately more scalable.

One of Partitioning Difficulties, is inter-partition interactions. Ultimately the solution is duplicating meta-data or duplicating data. Overall reporting is also more difficult. What if we want average for users per location, if we partition by users. Most systems user driven and partition by user. A newer strategy is to partition by group.

For implementing a Fixed Hash Key partitioning.

Divide data into B buckets
Divide the B buckets over M machines

You define 1024 physical buckets (can then the easily dividable) 0-1023 (user_id % 1024). Coded then by range to physical machines, 0-255, 256-511, 512-767, 768-1023. The plus side is very easy to implement, you can always derive where something is. The biggest problems is scalability, e.g. going from 4 machines to 5. You also don’t have any fine grained control over buckets.

For Dynamic Directory partitioning you maintain a database of mappings to partitions. A user can be easily moved at a later date in a much finer grain. MySQL Cluster is designed for this type of application. It is not necessary however, a well configured Innodb Hardware solution with memcache can easily provide the same functionality. The only writes are new users, or update partition keys, with a lot of reads.

HiveDB

This open source product implements a “standard” partition-by-key MySQL system written in Java.
Many organizations have a somewhat similar built system, but this is an example of something that’s been open sourced.

More information at www.hivedb.org.

The Hive API language should be the only code that should be re-written to be application development language (e.g. PHP,Ruby) when needed.

High Availability

The obvious goals.

Avoid downtime due to failures.
No single point of failure.
Extremely fast failover.
No dependency of DNS changes.
No Dependency on code changes.
Painless and seamless failover.
Fail-back must be just as painless.

The overall objective is speed.

Understanding MySQL Replication is important to understanding HA options.

MySQL Replication is Master-Slave One Way asynchronous replication.

Slave requests binary logs from last position.
Master sends binary logs up to current time.
Master keeps sending binary logs in real-time.

Dual Master provides an easy configuration to fail over, it doesn’t provide benefits in throughput. Can help solve online schema changes without downtime. Assuming existing queries will perform both pre and post schema. (set-sql-bin-log=0 for the session is the tip). There are a number of caveats.

Ultimately for High Availability you have a trade off, data loss (minuet) to scalability.

MySQL Conference & Expo

April 24, 2007 by ronald

The MySQL Conference has started. It will be a long week, still yet to prepare my own presentation for tomorrow. Old friendships already renewed, plenty of faces to names already, and we have yet to hit the first session.

Today is tutorial day, this morning I’m with Paul McCullagh Mr PBXT in Scaling and High Availablilty Architectures by Jeremy Cole and Eric Bergen of Proven Scaling

SHOW PROFILE

April 20, 2007 by ronald

I’ve been playing more with the SHOW PROFILE command available as part of MySQL Community 5.0.37. Thanks to the author Jeremy Cole. This command can provide some interesting insight into the workings of MySQL. It does however like most new releases of software enable users to make suggestions for new features and functionality. Here is my wish list. Some I figure are practical, some are wish list, but if you never ask you never know.

The Unit of measure for duration is Second. It would be great if it could be tuned for display, say millisecond. If you look at my first Example, all figures are effectively represented in milli-second or even micro-second granularity.
I would like to see a total for the query. Again in Example 1, you have to add up all the figures to determine this query took 8ms.
Again in Example 1, perhaps a percentage of total time for each line duration may be helpful.
More descriptive status descriptions (this is part of the MySQL internal code and not the patch)
SET PROFILING=1; can only be set on a current session, making it impossible to easily monitor a JDBC multi-connection test. There needs to be a way to enable for a session, other then current interactive one you are viewing, but also be able to see results. You can’t do a SHOW PROFILE via a JDBC connection!
I’d like to see a threshold, so queries running under threshold are discarded, much like a long-query-time option. This enables you to run a large number of SQL Statements and only profiles for longer running ones are logged
I’d like to see a level of logging to file, again like a slow query log, so you can simply gather information on a working system and review at some later time. Combined with the previous point, you now have microsecond slow query log with explicit details.

One major benefit of the SHOW PROFILE command is I can accurately get a figure for how long a query it taking (time in milliseconds). You just have to sum all figures (See wish list point 2).

By default, the source details are not provided, you need to specify the SOURCE operand, which helps in both comparing with any debugging output and also trouncing through the code. As in Example 1, I needed to find why 95% of time was in a step with the most descriptive line of ‘end’.

Example 1

mysql> show profile SOURCE,MEMORY for query 4;
+--------------------+------------+-----------------------+---------------+-------------+
| Status             | Duration   | Source_function       | Source_file   | Source_line |
+--------------------+------------+-----------------------+---------------+-------------+
| Opening tables     | 0.00013200 | open_tables           | sql_base.cc   |        2106 |
| System lock        | 0.00001800 | mysql_lock_tables     | lock.cc       |         153 |
| Table lock         | 0.00000600 | mysql_lock_tables     | lock.cc       |         162 |
| init               | 0.00001300 | mysql_select          | sql_select.cc |        2073 |
| optimizing         | 0.00004800 | optimize              | sql_select.cc |         617 |
| statistics         | 0.00002500 | optimize              | sql_select.cc |         773 |
| preparing          | 0.00005200 | optimize              | sql_select.cc |         783 |
| executing          | 0.00002200 | exec                  | sql_select.cc |        1407 |
| Sending data       | 0.00000500 | exec                  | sql_select.cc |        1925 |
| end                | 0.00786600 | mysql_select          | sql_select.cc |        2118 |
| query end          | 0.00001400 | mysql_execute_command | sql_parse.cc  |        5085 |
| freeing items      | 0.00000700 | mysql_parse           | sql_parse.cc  |        5973 |
| closing tables     | 0.00001900 | dispatch_command      | sql_parse.cc  |        2120 |
| logging slow query | 0.00001000 | log_slow_statement    | sql_parse.cc  |        2178 |
| cleaning up        | 0.00000500 | dispatch_command      | sql_parse.cc  |        2143 |
+--------------------+------------+-----------------------+---------------+-------------+
15 rows in set (0.01 sec)

Example 2

We were experiencing increased latency in JDBC with a particular query, With a recommendation from MySQL Support we tried SET SQL_BUFFER_RESULT=1;

mysql> show profile source for query 14;
+------------------------------+------------+-------------------------+---------------+-------------+
| Status                       | Duration   | Source_function         | Source_file   | Source_line |
+------------------------------+------------+-------------------------+---------------+-------------+
| Opening tables               | 0.00006025 | open_tables             | sql_base.cc   |        2106 |
| System lock                  | 0.00004875 | mysql_lock_tables       | lock.cc       |         153 |
| Table lock                   | 0.00000400 | mysql_lock_tables       | lock.cc       |         162 |
| init                         | 0.00001600 | mysql_select            | sql_select.cc |        2073 |
| optimizing                   | 0.00005675 | optimize                | sql_select.cc |         617 |
| statistics                   | 0.00001250 | optimize                | sql_select.cc |         773 |
| preparing                    | 0.00005175 | optimize                | sql_select.cc |         783 |
| Creating tmp table           | 0.00001275 | optimize                | sql_select.cc |        1206 |
| executing                    | 0.00006025 | exec                    | sql_select.cc |        1407 |
| Copying to tmp table         | 0.00000400 | exec                    | sql_select.cc |        1547 |
| converting HEAP to MyISAM    | 0.04820900 | create_myisam_from_heap | sql_select.cc |        9914 |
| Copying to tmp table on disk | 0.04049075 | create_myisam_from_heap | sql_select.cc |        9968 |
| Sending data                 | 1.29302000 | exec                    | sql_select.cc |        1925 |
| end                          | 0.09398425 | mysql_select            | sql_select.cc |        2118 |
| removing tmp table           | 0.00004975 | free_tmp_table          | sql_select.cc |        9856 |
| end                          | 0.00089125 | free_tmp_table          | sql_select.cc |        9884 |
| query end                    | 0.00001850 | mysql_execute_command   | sql_parse.cc  |        5085 |
| freeing items                | 0.00000825 | mysql_parse             | sql_parse.cc  |        5973 |
| closing tables               | 0.00003425 | dispatch_command        | sql_parse.cc  |        2120 |
| logging slow query           | 0.00001325 | log_slow_statement      | sql_parse.cc  |        2178 |
| cleaning up                  | 0.00000675 | dispatch_command        | sql_parse.cc  |        2143 |
+------------------------------+------------+-------------------------+---------------+-------------+
21 rows in set (0.00 sec)

Looking at the lines helped to indicate that the temporary table was being flushed to disk, indicating we need to Add SET SESSION tmp_table_size=20*1024*1024;

mysql> show profile source for query 18;
+----------------------+------------+-----------------------+---------------+-------------+
| Status               | Duration   | Source_function       | Source_file   | Source_line |
+----------------------+------------+-----------------------+---------------+-------------+
| Opening tables       | 0.00006050 | open_tables           | sql_base.cc   |        2106 |
| System lock          | 0.00001250 | mysql_lock_tables     | lock.cc       |         153 |
| Table lock           | 0.00000400 | mysql_lock_tables     | lock.cc       |         162 |
| init                 | 0.00000775 | mysql_select          | sql_select.cc |        2073 |
| optimizing           | 0.00005475 | optimize              | sql_select.cc |         617 |
| statistics           | 0.00001225 | optimize              | sql_select.cc |         773 |
| preparing            | 0.00005075 | optimize              | sql_select.cc |         783 |
| Creating tmp table   | 0.00001350 | optimize              | sql_select.cc |        1206 |
| executing            | 0.00006125 | exec                  | sql_select.cc |        1407 |
| Copying to tmp table | 0.00000375 | exec                  | sql_select.cc |        1547 |
| Sending data         | 0.29110925 | exec                  | sql_select.cc |        1925 |
| end                  | 0.08023800 | mysql_select          | sql_select.cc |        2118 |
| removing tmp table   | 0.00001525 | free_tmp_table        | sql_select.cc |        9856 |
| end                  | 0.05971400 | free_tmp_table        | sql_select.cc |        9884 |
| query end            | 0.00001925 | mysql_execute_command | sql_parse.cc  |        5085 |
| freeing items        | 0.00000425 | mysql_parse           | sql_parse.cc  |        5973 |
| closing tables       | 0.00004625 | dispatch_command      | sql_parse.cc  |        2120 |
| logging slow query   | 0.00000800 | log_slow_statement    | sql_parse.cc  |        2178 |
| cleaning up          | 0.00000300 | dispatch_command      | sql_parse.cc  |        2143 |
+----------------------+------------+-----------------------+---------------+-------------+
19 rows in set (0.00 sec)

Google's worse nightmare

April 9, 2007 by ronald

Today while waiting at the airport, I took a look at the news stand, and right there on the cover of Fast Company were two words Google, and Wikipedia. Given Wikipedia is a poster boy of MySQL it was an immediate purchase just to see what was being said.

So the title of the cover was Google’s worse nightmare – Wikipedia’s Jimmy Wales has his sights set on the search business.

Now, often I use Wikipedia to search for things directly rather then using Google. I’ve found it usually to be more accurate, particular on topics I know it will contain. References to search users being disappointed, Google and Yahoo tied with a 2.3 of 5 in user satisfaction hits about home for me as week, and that’s exactly the ideas behind www.wikia.com.

The article states “Google doesn’t have an effective method of locking in its customers the way earlier info-tech leaders did”. Well I’d agree in relation to searching, but Google is rapidly releasing new products, new features and acquiring good ideas. It has the wealth (money and gene pool) to make an impact.

Some more on www.wikia.com — Wikia offers free MediaWiki hosting for your community to build a free content wiki-based website. Find out why you should use Wikia and request a new wiki in any language or see our list of wikis to visit an existing one.

Today, it has 69 categories, I’m sure tomorrow it will be more. today’s feature page is The Marvel Database, others like Wookieepedia – The Star Wars encyclopedia and 100’s of entries alone in the Games Category make it something to look at one of day’s when I have free time.

And best of all it runs the LAMP Stack. Linux, Apache, MySQL, PHP.

Get behind a new exciting site

April 9, 2007 by ronald

As I write this blog I have over 90 draft blog posts. That’s 9-0. Why do I have so many posts? The main reason is I want to say something, and I’ve either not completed it, or researched it sufficiently to consider the entry complete.

This frustrates me as sometimes I just want to get the word out on something, or of my opinion, or of something great I’ve discovered. I do it for me, I don’t really care if anybody actually reads my stuff, but I’m surprised sometimes when I get comments how people actually get to see my blog.

JotThat is a surprisingly simple yet brilliant idea. It’s quite simply a site for making Jots, making quick notes, making a passing comment, noting a thought, something you want to either remember or something you want to say in a simple Jot form.

What makes JotThat in my eyes? Well it’s simple, and I strive for simple. It’s extremely easy to use. It’s built on MySQL, so that gains my attention. It’s something just a little different from everything you see each day.

I especially like the new feature of being able to email your Jots to your account. I was at the train station tonight, then later at dinner with some great input from somebody. I wanted to write it down (I forget things a lot more these days), so instead of adding it to notes in my PDA, I simply emailed it to my JotThat account. Easy as!

The site is still in early stages, but please get behind a project. I always like to get referrals from people on everything from websites, products, even services like a good dentist. I recommend you check this one out, as well as supporting the MySQL open source community.

Google Maps for Personal Use

April 6, 2007 by ronald

Just as I talked yesterday about Google Maps for Traffic Updates, today Google Maps released a new feature today called My Maps.

I’ve just started using it, my first map is incomplete but you can see the workings at Travel Locations

MySQL LOAD DATA Trick

April 5, 2007 by ronald

I leaned a new trick today with LOAD DATA INFILE. I’m migrating some data from an external source, and the Date Format is not the MySQL required YYYY-MM-DD, it was DD-MMM-YY. So how do you load this into a Date Field.

$ echo "02-FEB-07" > /tmp/t1.psv $ mysql -umysql USE test; DROP TABLE IF EXISTS t1; CREATE TABLE t1(d1 DATE); # echo "02-FEB-07" > /tmp/t1.psv LOAD DATA LOCAL INFILE '/tmp/t1.psv' INTO TABLE t1 (@var1) SET d1=STR_TO_DATE(@var1,'%d-%M-%y'); SELECT * FROM t1; EXIT

The trick is to bind the appropriate column within the file being loaded to a variable, @var1 in my example and use the SET syntax to perform a function on the variable. Rather cool.

A good tip to know.

Google Maps for Traffic Updates

April 5, 2007 by ronald

Saw a new button on Google Maps today called Traffic. Seems they are highlighting traffic congestion.

Some real life examples.
San Francisco
New York.

Below is a screen print I took today of the SF area. It was all green in Redwood City, so I had to look a little further.