Content Storage - architectural Question

Author Message

Gabriel Ambuehl

Friday 18 April 2003 8:02:20 am

I've been looking at the implementation of the content module and the whole attribute stuff strikes me as being kinda inefficient by cramming everything into the same table.

Why doesn't content just create a new table for each class with fields for each of the attributes? (And a meta table to describe field much like it is now). Another field containing a readily assembled XML representation of the row would be great for caching, I suppose.

Seems to me that would be a much more efficient way to store the data AND it would make it much much easier for modules to operate on the data. It could pose a few problems for searching though (mostly because you need to search through all tables instead of one).

I mean don't get me wrong, I love the functionality of the content module, it just runs too slow. Most of the design is really elegant and easily the best open source CMS code I've ever looked on but this strikes me as rather weird. Maybe there are reasons to this I don't get so please enlighten me.

Visit http://triligon.org

Paul Borgermans

Saturday 19 April 2003 3:56:24 am

Don't worry about the database structure and performance too much: its only a very minor part of the processing (as opposed to the 2.2.x series < 2.2.8) . The ezp 3.0 database is heavily normalised as far as i can tell and according to schoolbooks, one should always do so. It also leads to the flexibility with content classes without altering table structures on the fly.

Most of the time (up to 80%) is spent processing templates. This area will be improved as you can read elsewhere.

Paul

eZ Publish, eZ Find, Solr expert consulting and training
http://twitter.com/paulborgermans

Paul Borgermans

Saturday 19 April 2003 4:00:22 am

>Another field containing a readily assembled XML
>representation of the row would be great for caching, I
>suppose.

This is a matter for caching actually. I guess it all boils down to a sound database structure (not that it is subject to improvements), where the performance issues are mainly addressed by the various cache levels. As mysql 4 has query caching, this will increase speed too.

Paul

eZ Publish, eZ Find, Solr expert consulting and training
http://twitter.com/paulborgermans

Gabriel Ambuehl

Saturday 19 April 2003 4:54:48 am

I have mixed views on normalization. The theory not to have any data more than one time is sound but in practive normalizing all the tables will very often give you bad performance as you need to do way too many queries.

When I develop databases, I normally don't care for textbook advice and instead will often have data more than one time grouped together to allow very few queries to retrieve as most apps will have orders of magnitude more accesses compared to writes which obviously get more expensive when you need to update several rows at once. In a way, normalization is opposed to caching data like you read it. In many of my apps there will be ten times more data that's simply there readily assembled for caching purposes than unique data.

I just thought it would make it much easier for other modules to use content's really nice rendering infrastructure if content could read other tables as well as they could fill/pull data from the data without going thru the complete API.

I wanted to implement a calendar for one. That would be easier to implement with one row per event instead of having a calender container group and a lot of calendar event instances.

Visit http://triligon.org

Sergiy Pushchin

Wednesday 23 April 2003 1:01:42 am

When you fetch children of node with all their attributes, if all classes have separate tables we will need for each child object send query to database. But now we can fetch objects in one query. (Actualy not one but still less then for each one). It is one smallest point.

--SP
P.S. School books are not that bad as you think :)
Chears.

Gabriel Ambuehl

Wednesday 23 April 2003 1:32:27 am

Your professors must have been better at chosing them than mine then. One of the reasons why I switched away from CS major :-(

Visit http://triligon.org

Powered by eZ Publish™ CMS Open Source Web Content Management. Copyright © 1999-2014 eZ Systems AS (except where otherwise noted). All rights reserved.

eZ debug

Timing: Jan 18 2025 15:02:15
Script start
Timing: Jan 18 2025 15:02:15
Module start 'layout'
Timing: Jan 18 2025 15:02:15
Module start 'content'
Timing: Jan 18 2025 15:02:16
Module end 'content'
Timing: Jan 18 2025 15:02:16
Script end

Main resources:

Total runtime0.8717 sec
Peak memory usage4,096.0000 KB
Database Queries68

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0045 589.1484152.6406
Module start 'layout' 0.00450.0023 741.789139.4609
Module start 'content' 0.00680.8632 781.2500615.8984
Module end 'content' 0.86990.0017 1,397.148416.1406
Script end 0.8716  1,413.2891 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00320.3649160.0002
Check MTime0.00130.1477160.0001
Mysql Total
Database connection0.00070.084810.0007
Mysqli_queries0.807592.6345680.0119
Looping result0.00070.0835660.0000
Template Total0.845297.020.4226
Template load0.00180.209320.0009
Template processing0.843396.746020.4217
Template load and register function0.00020.020110.0002
states
state_id_array0.00080.088310.0008
state_identifier_array0.00090.107720.0005
Override
Cache load0.00160.1808320.0000
Sytem overhead
Fetch class attribute can translate value0.00060.066030.0002
Fetch class attribute name0.00130.147580.0002
XML
Image XML parsing0.00310.357030.0010
class_abstraction
Instantiating content class attribute0.00000.002290.0000
General
dbfile0.00350.4070230.0002
String conversion0.00000.000740.0000
Note: percentages do not add up to 100% because some accumulators overlap

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1node/view/full.tplfull/forum_topic.tplextension/sevenx/design/simple/override/templates/full/forum_topic.tplEdit templateOverride template
6content/datatype/view/ezxmltext.tpl<No override>extension/community_design/design/suncana/templates/content/datatype/view/ezxmltext.tplEdit templateOverride template
7content/datatype/view/ezxmltags/paragraph.tpl<No override>extension/ezwebin/design/ezwebin/templates/content/datatype/view/ezxmltags/paragraph.tplEdit templateOverride template
3content/datatype/view/ezimage.tpl<No override>extension/sevenx/design/simple/templates/content/datatype/view/ezimage.tplEdit templateOverride template
2content/datatype/view/ezxmltags/line.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/line.tplEdit templateOverride template
1print_pagelayout.tpl<No override>extension/community/design/community/templates/print_pagelayout.tplEdit templateOverride template
 Number of times templates used: 20
 Number of unique templates used: 6

Time used to render debug report: 0.0001 secs