Forums / Setup & design / Reporting - how to get custom data out of ezpub database?

Reporting - how to get custom data out of ezpub database?

Author Message

chris mol

Saturday 01 March 2008 7:49:17 pm

I work for a small company that is considering using ezpub as our enterprise content management tool. It has all the cms functionality we need and more. However, we have some concerns around ezpub's OO database and extracting data for reporting.

My company's business is to book events online. We book about 50,000 events per year, with a user base of 20,000 users that belong to 35,000 organizations that then roll up into about 50 clients. We store a lot of data, probably close to 500,000 records.

We are in talks with a local web shop to customize ezpub as our web scheduling application.

Our daily business depends heavily on the ability of the IT dept to deliver reports including all the data points listed above. We have some concerns that nearly all our custom data points will be stored in what amounts to 4-5 tables in ezpub (ezcontentclass, ezcontentobject, etc.).

Can anyone provide input regarding extracting data from the ezpub db for custom reporting (BI suites like Pentaho, MS, etc.) ? Is it even possible? If so, does it take a lot of effort in an writing ETLs to separate my data from the ezpub object data?

This is a huge issue for us and I would appreciate any input from users who have experience using the ezpub database for custom reporting.

Thanks.

Felix Laate

Sunday 02 March 2008 2:12:55 am

Hi Chris,

so the ezp-database is kind of abstract, BUT there are excellent ways to make a proxy that produces the output you need.

Say e.g. that you plan to use Pentaho. It supports many data sources, amongst them XML-based data sources. Then you could quite easily make a "view" with e.g. the layout-module that produces the XML you need. It then works pretty much like any feed.

Felix

Publlic Relations Manager
Greater Stavanger
www.greaterstavanger.com

Piotrek Karaś

Sunday 02 March 2008 3:46:30 am

That is all possible plus more - you can extend eZ Publish to handle data with eZ API rather than though presentation layer. Also, the content model is not that difficult once you've learned how eZ Publish handles content, so then you can directly pull from the DB in any way that suits you.

One question, though, to Felix and other experienced developers is: should those 500000 records be attempted to be managed with that model? Would you go for that? Or would you choose some sort of integration or extension with dedicated DB tables and interfaces? Looks like data management rather than content management project to me.

What do you think?

--
Company: mediaSELF Sp. z o.o., http://www.mediaself.pl
eZ references: http://ez.no/partners/worldwide_partners/mediaself
eZ certified developer: http://ez.no/certification/verify/272585
eZ blog: http://ez.ryba.eu

Felix Laate

Monday 03 March 2008 12:49:31 pm

Hi again,

>> should those 500000 records be attempted to be managed with that model? Would you go >> for that? Or would you choose some sort of integration or extension with dedicated DB
>> tables and interfaces? Looks like data management rather than content management
>> project to me.

Obviously, with that amount of data, a joint solution (ezp CMS + separate database) would be a good one. My suggestion (XML view of CMS-data) is not that efficient of course, but on the other side, it's quite easy to set up.

If you want more control and a more efficient solution, then I would opt for an separate extension based on the API.

Anyhow, I think the ezp-approach is a good one for projects like this. Where you have the classic needs for a CMS in combination with the need to provide access to and from just about any kind of database-systems. You need flexibility most of all. And that's, IMHO, what ezp is all about.

Felix

Publlic Relations Manager
Greater Stavanger
www.greaterstavanger.com

Björn Dieding@xrow.de

Monday 03 March 2008 4:50:25 pm

Hi,

the main problem of the 500,000 records stored in the database is the content object tree table. Due it`s architecture and design it can`t deliver certain fetches very effective ( path like 'mytree/%' ). A better model has been already developed for the eZ components.

So your keys to success are:
* Only store the least necessary data in the content object related tables
* Get a cool hardware for the db, buy a lot of ram, tweak your db(maybe your model holds the demand just by doing this)
* Get technology from the components as an early adopter.

I say get a good eZ partner and build a proof of concept with them. Since your model isn`t complex proving that it would not brake shouldn`t be expensive..

For reporting I would let the reporting tool directly access the db and create it`s reports.

If one knows it better, prove me wrong :-).

Looking for a new job? http://www.xrow.com/xrow-GmbH/Jobs
Looking for hosting? http://hostingezpublish.com
-----------------------------------------------------------------------------
GMT +01:00 Hannover, Germany
Web: http://www.xrow.com/

eZ debug

Timing: Jan 18 2025 22:21:23
Script start
Timing: Jan 18 2025 22:21:23
Module start 'content'
Timing: Jan 18 2025 22:21:24
Module end 'content'
Timing: Jan 18 2025 22:21:24
Script end

Main resources:

Total runtime1.0625 sec
Peak memory usage4,096.0000 KB
Database Queries204

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0055 589.3828180.8125
Module start 'content' 0.00550.9501 770.1953654.4375
Module end 'content' 0.95560.1069 1,424.6328341.0234
Script end 1.0625  1,765.6563 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00400.3782210.0002
Check MTime0.00160.1471210.0001
Mysql Total
Database connection0.00060.053810.0006
Mysqli_queries0.988092.98762040.0048
Looping result0.00170.16272020.0000
Template Total1.033397.220.5166
Template load0.00190.182420.0010
Template processing1.031397.065120.5157
Template load and register function0.00020.014410.0002
states
state_id_array0.00060.060910.0006
state_identifier_array0.00130.118220.0006
Override
Cache load0.00170.1601420.0000
Sytem overhead
Fetch class attribute can translate value0.00120.117650.0002
Fetch class attribute name0.00130.120780.0002
XML
Image XML parsing0.00150.144750.0003
class_abstraction
Instantiating content class attribute0.00000.001390.0000
General
dbfile0.00420.3966400.0001
String conversion0.00000.000630.0000
Note: percentages do not add up to 100% because some accumulators overlap

CSS/JS files loaded with "ezjscPacker" during request:

CacheTypePacklevelSourceFiles
CSS0extension/community/design/community/stylesheets/ext/jquery.autocomplete.css
extension/community_design/design/suncana/stylesheets/scrollbars.css
extension/community_design/design/suncana/stylesheets/tabs.css
extension/community_design/design/suncana/stylesheets/roadmap.css
extension/community_design/design/suncana/stylesheets/content.css
extension/community_design/design/suncana/stylesheets/star-rating.css
extension/community_design/design/suncana/stylesheets/syntax_and_custom_tags.css
extension/community_design/design/suncana/stylesheets/buttons.css
extension/community_design/design/suncana/stylesheets/tweetbox.css
extension/community_design/design/suncana/stylesheets/jquery.fancybox-1.3.4.css
extension/bcsmoothgallery/design/standard/stylesheets/magnific-popup.css
extension/sevenx/design/simple/stylesheets/star_rating.css
extension/sevenx/design/simple/stylesheets/libs/fontawesome/css/all.min.css
extension/sevenx/design/simple/stylesheets/main.v02.css
extension/sevenx/design/simple/stylesheets/main.v02.res.css
JS0extension/ezjscore/design/standard/lib/yui/3.17.2/build/yui/yui-min.js
extension/ezjscore/design/standard/javascript/jquery-3.7.0.min.js
extension/community_design/design/suncana/javascript/jquery.ui.core.min.js
extension/community_design/design/suncana/javascript/jquery.ui.widget.min.js
extension/community_design/design/suncana/javascript/jquery.easing.1.3.js
extension/community_design/design/suncana/javascript/jquery.ui.tabs.js
extension/community_design/design/suncana/javascript/jquery.hoverIntent.min.js
extension/community_design/design/suncana/javascript/jquery.popmenu.js
extension/community_design/design/suncana/javascript/jScrollPane.js
extension/community_design/design/suncana/javascript/jquery.mousewheel.js
extension/community_design/design/suncana/javascript/jquery.cycle.all.js
extension/sevenx/design/simple/javascript/jquery.scrollTo.js
extension/community_design/design/suncana/javascript/jquery.cookie.js
extension/community_design/design/suncana/javascript/ezstarrating_jquery.js
extension/community_design/design/suncana/javascript/jquery.initboxes.js
extension/community_design/design/suncana/javascript/app.js
extension/community_design/design/suncana/javascript/twitterwidget.js
extension/community_design/design/suncana/javascript/community.js
extension/community_design/design/suncana/javascript/roadmap.js
extension/community_design/design/suncana/javascript/ez.js
extension/community_design/design/suncana/javascript/ezshareevents.js
extension/sevenx/design/simple/javascript/main.js

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1node/view/full.tplfull/forum_topic.tplextension/sevenx/design/simple/override/templates/full/forum_topic.tplEdit templateOverride template
5content/datatype/view/ezxmltext.tpl<No override>extension/community_design/design/suncana/templates/content/datatype/view/ezxmltext.tplEdit templateOverride template
7content/datatype/view/ezxmltags/paragraph.tpl<No override>extension/ezwebin/design/ezwebin/templates/content/datatype/view/ezxmltags/paragraph.tplEdit templateOverride template
4content/datatype/view/ezimage.tpl<No override>extension/sevenx/design/simple/templates/content/datatype/view/ezimage.tplEdit templateOverride template
2content/datatype/view/ezxmltags/line.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/line.tplEdit templateOverride template
1pagelayout.tpl<No override>extension/sevenx/design/simple/templates/pagelayout.tplEdit templateOverride template
 Number of times templates used: 20
 Number of unique templates used: 6

Time used to render debug report: 0.0001 secs