Forums / Extensions / eZ Find / Indexing external database content

Indexing external database content

Author Message

Kedar Deshpande

Friday 14 August 2009 12:07:08 pm

Moving the post into the right forum section.

I am trying to index content from an external database and see the updatesearchindexsolr.php file has fields that allow connections to a DB but filling these in dont seem to work at all.

array( 'db-host' => "localhost",
         'db-user' => "root",
         'db-password' => "root",
         'db-database' => "cccms",
         'db-driver' => "mysql",
         'db-type' => "mysql",
         ...

Could someone let me know if indeed this is the way to provide the parameters for external DB indexing and also how to specify which tables within the external DB to index.

thanks.

Gaetano Giunta

Friday 14 August 2009 2:00:15 pm

Is this an external eZ Publish database that you want to index or an altogether different database from another application?

Principal Consultant International Business
Member of the Community Project Board

Jean-Bernard Valentaten

Tuesday 08 September 2009 5:26:00 am

Hi there,
I'm having exactly the same request as the original poster but cannot seem to find a solution to the problem.
I have an eZPublish based website with loads of content that is being perfectly indexed by eZFind.
Additionally I have two external datasources (databases) containing arbitrary data (non eZPublish-structured) that need to be indexed by eZFind and should be searchable in the eZPublish-based website.
So far I have already written extensions that provide fetch-operators for the external data. That step wasn't too hard to take, but how can I get that data indexed?
The eZFind overview page (http://ez.no/ezfind) states that it is possible to index external data, wherever it may come from, but I can't find any documentation about that aspect.
Does anyone have had experience considering external data?
Any hint is appreciated :)

/* No comment */

Carlos Revillo

Tuesday 08 September 2009 5:40:35 am

As a starting point, i would suggest this patch
http://issues.ez.no/IssueView.php?Id=15222&activeItem=1.
But i think it will only work if there is some kind of relation between your content objects and the data of the other database...

On the other hand, and please correct me if i'm wrong, i don't think that eZ Find can index external database if this database is not an ez one.

If you see the code at updatesearchindexsolr.php script, you will see that it starts by fetching eZ nodes for get attributes content and send them to the index.

But, if you have something like.
- and ezpublish database with their nodes and objects
- another database
- some kinda relation that tells you that your object_id 1000 needs data from some external dabase,

then you can index this data via the above patch.

Relating to doc, if you read carrefully, it says

eZ Find can not only search multiple eZ Publish sites, but external sites as well. The search index can be populated with information from all different types of websites using external plugins and XML importers

This is more logical. You can populate data to the index usign a custom php code, or maybe a developed module if your other site is Joomla, Drupal, OpenCMS... whatever.

Maybe the title of the article should be changed to avoid confussions...

Then, with ezfind you can get those results and know where they came from...

Best Regards.

Jean-Bernard Valentaten

Tuesday 08 September 2009 6:11:08 am

Hi Carlos,

thank you for the quick response. As you've already stated, using that patch I'll need a relation between the external data and some sort of content object. As far as I could see in the documentation of the patch that relation would be a specific content class, which I do not have as the data in question is totally outside of eZPublish, making it virtually impossible to trigger the virtual attribute the patch brings along. On the other hand I might just create dummy classes with the sole purpose of triggering the virtual attributes.
I'll try that one but if anyone has another idea, please keep posting :)

/* No comment */

Carlos Revillo

Tuesday 08 September 2009 6:21:12 am

Is the other database used by a CMS like Drupal or something?

If is not the case, maybe you can give a try to Zend Lucene Search [1] or usign Search component from eZ Components [2]

[1] http://ganeshhs.com/zend-framework/zend-search-lucene-part1
[2] http://www.ezcomponents.org/docs/api/trunk/introduction_Search.html

Jean-Bernard Valentaten

Tuesday 08 September 2009 7:13:00 am

Hi Carlos,

you just saved my day, thanks alot mate :)
The eZ Components Search is the winner, exactly what I was looking for *yay*

Cheers,
Jean

/* No comment */

Carlos Revillo

Tuesday 08 September 2009 7:26:37 am

Glad it helped. Good luck :).

eZ debug

Timing: Jan 18 2025 02:45:36
Script start
Timing: Jan 18 2025 02:45:36
Module start 'content'
Timing: Jan 18 2025 02:45:36
Module end 'content'
Timing: Jan 18 2025 02:45:36
Script end

Main resources:

Total runtime0.1895 sec
Peak memory usage2,048.0000 KB
Database Queries141

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0089 588.9766180.7734
Module start 'content' 0.00890.0063 769.7500103.1016
Module end 'content' 0.01530.1742 872.8516532.3750
Script end 0.1894  1,405.2266 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00371.9390200.0002
Check MTime0.00160.8363200.0001
Mysql Total
Database connection0.00060.290210.0006
Mysqli_queries0.138473.03801410.0010
Looping result0.00120.64171390.0000
Template Total0.173791.710.1737
Template load0.00080.438410.0008
Template processing0.172991.235710.1729
Override
Cache load0.00050.283310.0005
Sytem overhead
Fetch class attribute can translate value0.00110.588710.0011
XML
Image XML parsing0.00030.161410.0003
General
dbfile0.00472.4999200.0002
String conversion0.00000.004730.0000
Note: percentages do not add up to 100% because some accumulators overlap

CSS/JS files loaded with "ezjscPacker" during request:

CacheTypePacklevelSourceFiles
CSS0extension/community/design/community/stylesheets/ext/jquery.autocomplete.css
extension/community_design/design/suncana/stylesheets/scrollbars.css
extension/community_design/design/suncana/stylesheets/tabs.css
extension/community_design/design/suncana/stylesheets/roadmap.css
extension/community_design/design/suncana/stylesheets/content.css
extension/community_design/design/suncana/stylesheets/star-rating.css
extension/community_design/design/suncana/stylesheets/syntax_and_custom_tags.css
extension/community_design/design/suncana/stylesheets/buttons.css
extension/community_design/design/suncana/stylesheets/tweetbox.css
extension/community_design/design/suncana/stylesheets/jquery.fancybox-1.3.4.css
extension/bcsmoothgallery/design/standard/stylesheets/magnific-popup.css
extension/sevenx/design/simple/stylesheets/star_rating.css
extension/sevenx/design/simple/stylesheets/libs/fontawesome/css/all.min.css
extension/sevenx/design/simple/stylesheets/main.v02.css
extension/sevenx/design/simple/stylesheets/main.v02.res.css
JS0extension/ezjscore/design/standard/lib/yui/3.17.2/build/yui/yui-min.js
extension/ezjscore/design/standard/javascript/jquery-3.7.0.min.js
extension/community_design/design/suncana/javascript/jquery.ui.core.min.js
extension/community_design/design/suncana/javascript/jquery.ui.widget.min.js
extension/community_design/design/suncana/javascript/jquery.easing.1.3.js
extension/community_design/design/suncana/javascript/jquery.ui.tabs.js
extension/community_design/design/suncana/javascript/jquery.hoverIntent.min.js
extension/community_design/design/suncana/javascript/jquery.popmenu.js
extension/community_design/design/suncana/javascript/jScrollPane.js
extension/community_design/design/suncana/javascript/jquery.mousewheel.js
extension/community_design/design/suncana/javascript/jquery.cycle.all.js
extension/sevenx/design/simple/javascript/jquery.scrollTo.js
extension/community_design/design/suncana/javascript/jquery.cookie.js
extension/community_design/design/suncana/javascript/ezstarrating_jquery.js
extension/community_design/design/suncana/javascript/jquery.initboxes.js
extension/community_design/design/suncana/javascript/app.js
extension/community_design/design/suncana/javascript/twitterwidget.js
extension/community_design/design/suncana/javascript/community.js
extension/community_design/design/suncana/javascript/roadmap.js
extension/community_design/design/suncana/javascript/ez.js
extension/community_design/design/suncana/javascript/ezshareevents.js
extension/sevenx/design/simple/javascript/main.js

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1pagelayout.tpl<No override>extension/sevenx/design/simple/templates/pagelayout.tplEdit templateOverride template
 Number of times templates used: 1
 Number of unique templates used: 1

Time used to render debug report: 0.0002 secs