Indexing any other web site

Author Message

Kevin Gaudin

Thursday 05 June 2008 1:21:18 am

Is eZ Find capable of indexing any other web site, i.e. not-eZ Publish sites ?

This would be useful for a real enterprise search engine as we can't always have a full-eZ Publish strategy.

Twitter: @kevingaudin

Ivo Lukac

Tuesday 17 June 2008 8:06:00 am

AFAIK it is not possible, not without extra java coding with solr.

For this kind of cases you can try http://lucene.apache.org/nutch/ . Nice tool for indexing web sites and can be integrated with ez with simple custom search operator.

Greetz

http://www.linkedin.com/in/ivolukac
http://www.netgen.hr/eng/blog
http://twitter.com/ilukac

Powered by eZ Publish™ CMS Open Source Web Content Management. Copyright © 1999-2014 eZ Systems AS (except where otherwise noted). All rights reserved.

eZ debug

Timing: Jan 18 2025 05:03:21
Script start
Timing: Jan 18 2025 05:03:21
Module start 'layout'
Timing: Jan 18 2025 05:03:21
Module start 'content'
Timing: Jan 18 2025 05:03:22
Module end 'content'
Timing: Jan 18 2025 05:03:22
Script end

Main resources:

Total runtime0.6325 sec
Peak memory usage4,096.0000 KB
Database Queries56

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0061 589.0781152.6250
Module start 'layout' 0.00610.0044 741.703139.4453
Module start 'content' 0.01040.6203 781.1484531.3750
Module end 'content' 0.63080.0017 1,312.52348.1563
Script end 0.6324  1,320.6797 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00330.5177160.0002
Check MTime0.00140.2165160.0001
Mysql Total
Database connection0.00140.220110.0014
Mysqli_queries0.586592.7293560.0105
Looping result0.00060.0892540.0000
Template Total0.594394.020.2971
Template load0.00170.274520.0009
Template processing0.592593.683520.2963
Template load and register function0.00010.015510.0001
states
state_id_array0.00090.144710.0009
state_identifier_array0.00110.175420.0006
Override
Cache load0.00130.2088110.0001
Sytem overhead
Fetch class attribute can translate value0.00070.118620.0004
Fetch class attribute name0.00180.276840.0004
XML
Image XML parsing0.00110.175220.0006
class_abstraction
Instantiating content class attribute0.00000.001540.0000
General
dbfile0.00110.1703230.0000
String conversion0.00000.001440.0000
Note: percentages do not add up to 100% because some accumulators overlap

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1node/view/full.tplfull/forum_topic.tplextension/sevenx/design/simple/override/templates/full/forum_topic.tplEdit templateOverride template
2content/datatype/view/ezimage.tpl<No override>extension/sevenx/design/simple/templates/content/datatype/view/ezimage.tplEdit templateOverride template
2content/datatype/view/ezxmltext.tpl<No override>extension/community_design/design/suncana/templates/content/datatype/view/ezxmltext.tplEdit templateOverride template
2content/datatype/view/ezxmltags/paragraph.tpl<No override>extension/ezwebin/design/ezwebin/templates/content/datatype/view/ezxmltags/paragraph.tplEdit templateOverride template
1print_pagelayout.tpl<No override>extension/community/design/community/templates/print_pagelayout.tplEdit templateOverride template
 Number of times templates used: 8
 Number of unique templates used: 5

Time used to render debug report: 0.0002 secs