Clustering eZ Publish

Equipment and software used

This article is based on the cluster setup currently running on ez.no. There are two back-end servers running in this setup. Both are Dell PowerEdge 1U dual 2.8GHz Xeon with 2GB RAM and SCSI hard discs.

The software running on the servers is:

  • Debian GNU/Linux stable
  • Apache 1.3.33 (latest stable release at the time)
  • MySQL 4.0.18
  • PHP 4.3.10
  • APC CVS version
  • eZ Publish 3.5.1 stable

The setup contains two servers and two directors as shown on the image below. The directors route incoming requests to the correct server. We use Keepalived for our directors. Keepalived periodically polls the two servers to check their health (and was originally written for the Linux Virtual Server project).

Every eZ publish installation comes with a URL which returns the text 'eZ publish is alive' if everything is working fine. The health check URL for ez.no is http://ez.no/ezinfo/is_alive.

The second director is a backup for the main director. It will automatically take over if the main director fails. This setup provides good failover and load balancing. If you want a simpler setup to get load balancing you can use a round robin Round_robin DNS setup, however this does not provide proper failover functionality.

The servers do the actual work of serving the pages. Each server is identically configured, with one exception: only one server runs a MySQL® server. Alternately, you can use MySQL's master/slave replication or the new cluster functionality.

When we were planning our cluster, we considered using MySQL Cluster for this setup but it required too much memory: the entire cluster database, complete with indexes and all, needs to stay in RAM. We would have required more than 3GB of RAM per server just for the ez.no database. Because the cluster needs to have redundant data in memory we would need to keep one copy per server. If you have more servers this becomes more efficient since the database can be spread across more servers. So if you have a smaller database, lots of RAM or more cluster nodes, MySQL cluster could be a good alternative for you.

To make eZ Publish run well in a cluster we need to ensure that all data is properly synchronized between the servers. In eZ Publish, most data is stored in the database, however some data is also stored on the filesystem. Data stored on the filesystem includes binary files, images, compiled templates and cache files.

Important data like files and images are synchronized via rsync when changed. Cache files and easily re-generated files are deleted from the other server when it changes. To make this work we patched PHP to automatically log every file created or modified.

To make ez.no sync files between the nodes in the cluster we have made some patches to PHP and eZ Publish that automatically log any new, modified or removed files. Since eZ Publish is using Imagemagick to scale and convert images there are also patches to apply to eZ Publish in order to log the changes to images. The log file created by this patch is then parsed and used by rsync to synchronize the servers.

The script used to rsync the changed files from one server to another is available together with the patches. If you want more servers in this cluster setup then you need to alter the rsync script to sync the files to all other servers in the setup instead of just one. Currently this script is written with a master / slave setup in mind.

This cluster setup is not ideal. If a page is visited on server A which causes an image variation to be created, e.g. the first time an article is shown with an embedded image, and server B gets a hit on the same page before the image variation has been synced from server A to B then the image shown on server B will be broken. The reason for this is that the image is not found on server B at time the page is rendered and hence the cached version of the page will be wrong. The fix for this problem is simply to clear the cache for the page which has the problem image, or to re-publish the object. This problem will be solved for eZ Publish in the future.

In order to make the MySQL server run a bit faster we have have enabled the MySQL query cache and created a larger key buffer. The configuration for this is:

Once the cluster is up and running we ran some tests with using the Apache Bench (ab) benchmarking tool which comes with Apache. Some of the tests performed are shown below. These tests only measure the maximum amount of requests the cluster can handle in an ideal situation. The diagram below shows the number of pages the cluster can server per second with different combinations of servers enabled and with different concurrent users.

The detailed output from the benchmark tests are shown below.

One server, concurrency 1

One server, concurrency 2

Two servers, concurrency 5

Two servers, concurrency 40

Two servers, concurrency 20

Two servers, concurrency 18

By running eZ Publish in a clustered environment you get both failover and load balancing. From the numbers in these tests we see that this cluster with 2 back-end servers can serve a maximum of 64 pages per second. With the ez.no setup this means 753Kb/sec + the bandwidth needed for images and CSS files.

Powered by eZ Publish™ CMS Open Source Web Content Management. Copyright © 1999-2014 eZ Systems AS (except where otherwise noted). All rights reserved.

eZ debug

Timing: Jan 18 2025 03:07:32
Script start
Timing: Jan 18 2025 03:07:32
Module start 'layout'
Timing: Jan 18 2025 03:07:32
Module start 'content'
Timing: Jan 18 2025 03:07:32
Module end 'content'
Timing: Jan 18 2025 03:07:32
Script end

Main resources:

Total runtime0.0138 sec
Peak memory usage2,048.0000 KB
Database Queries3

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0061 589.0078152.6094
Module start 'layout' 0.00610.0026 741.617239.4141
Module start 'content' 0.00870.0035 781.031394.0313
Module end 'content' 0.01220.0016 875.062534.3047
Script end 0.0138  909.3672 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.002216.2356140.0002
Check MTime0.00107.3372140.0001
Mysql Total
Database connection0.00117.842110.0011
Mysqli_queries0.002719.456230.0009
Looping result0.00000.079310.0000
Template Total0.00139.710.0013
Template load0.00075.260810.0007
Template processing0.00064.394110.0006
Override
Cache load0.00053.649610.0005
General
dbfile0.00021.495780.0000
String conversion0.00000.050040.0000
Note: percentages do not add up to 100% because some accumulators overlap

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1print_pagelayout.tpl<No override>extension/community/design/community/templates/print_pagelayout.tplEdit templateOverride template
 Number of times templates used: 1
 Number of unique templates used: 1

Time used to render debug report: 0.0001 secs