ez find indexing binaryfiles

Author Message

Matthieu Sévère

Wednesday 02 December 2009 7:32:36 am

Hello community !

How to configure ezfind to index binaryfiles like pdf, is this something related to this settings : SolrFieldMapSettings and this class : ezfSolrDocumentFieldBase

It seems to map a given datatype to raw result

Thanks for help :-)

--
eZ certified developer: http://ez.no/certification/verify/346216

Paul Borgermans

Thursday 03 December 2009 9:01:25 am

Indexing binary files with ez find requires external converters to extract plain text

You may use for example:

http://projects.ez.no/eztika

For pdfs, configure xpdf's pdftotext for best results (a wraper script is provided in the eztika extension)

hth

Paul

eZ Publish, eZ Find, Solr expert consulting and training
http://twitter.com/paulborgermans

Powered by eZ Publish™ CMS Open Source Web Content Management. Copyright © 1999-2014 eZ Systems AS (except where otherwise noted). All rights reserved.

eZ debug

Timing: Jan 29 2025 14:41:48
Script start
Timing: Jan 29 2025 14:41:48
Module start 'layout'
Timing: Jan 29 2025 14:41:48
Module start 'content'
Timing: Jan 29 2025 14:41:49
Module end 'content'
Timing: Jan 29 2025 14:41:49
Script end

Main resources:

Total runtime0.8035 sec
Peak memory usage4,096.0000 KB
Database Queries56

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0044 588.2578151.2109
Module start 'layout' 0.00440.0029 739.468836.6484
Module start 'content' 0.00730.7955 776.1172477.7891
Module end 'content' 0.80280.0007 1,253.90637.8438
Script end 0.8034  1,261.7500 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00300.3690160.0002
Check MTime0.00130.1561160.0001
Mysql Total
Database connection0.00060.080110.0006
Mysqli_queries0.768295.6089560.0137
Looping result0.00040.0523540.0000
Template Total0.768995.720.3844
Template load0.00190.239220.0010
Template processing0.766995.452720.3835
Template load and register function0.00010.014510.0001
states
state_id_array0.00140.178710.0014
state_identifier_array0.00160.202920.0008
Override
Cache load0.00160.1969160.0001
Sytem overhead
Fetch class attribute can translate value0.00060.080320.0003
Fetch class attribute name0.00110.141940.0003
XML
Image XML parsing0.00220.278920.0011
class_abstraction
Instantiating content class attribute0.00000.000640.0000
General
dbfile0.00300.3680230.0001
String conversion0.00000.000540.0000
Note: percentages do not add up to 100% because some accumulators overlap

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1node/view/full.tplfull/forum_topic.tplextension/sevenx/design/simple/override/templates/full/forum_topic.tplEdit templateOverride template
2content/datatype/view/ezimage.tpl<No override>extension/sevenx/design/simple/templates/content/datatype/view/ezimage.tplEdit templateOverride template
2content/datatype/view/ezxmltext.tpl<No override>extension/community_design/design/suncana/templates/content/datatype/view/ezxmltext.tplEdit templateOverride template
2content/datatype/view/ezxmltags/paragraph.tpl<No override>extension/ezwebin/design/ezwebin/templates/content/datatype/view/ezxmltags/paragraph.tplEdit templateOverride template
1print_pagelayout.tpl<No override>extension/community/design/community/templates/print_pagelayout.tplEdit templateOverride template
 Number of times templates used: 8
 Number of unique templates used: 5

Time used to render debug report: 0.0001 secs