Forums / Setup & design / Indexing Binary Files - Excel and Powerpoint

Indexing Binary Files - Excel and Powerpoint

Author Message

Mindshare Interactive Campaigns

Wednesday 05 July 2006 12:25:25 pm

Hey everyone - terribly sorry, but we've found a bug in our script (well, sort of). If you use system instead of passthru, you only get back the last line of whatever file you're indexing. passthru gives you all the output, but will also result in a segmentation fault if you try to index a copy-protected PDF file with xpdf. exec has the same problem.

We're working on a solution but haven't found one yet.

http://www.mindshare.net

Mindshare Interactive Campaigns

Thursday 31 August 2006 8:26:12 am

OK, we finally have a fully functional solution that is now working on both a 3.6 site and a 3.8 site, both of which have tons of binary files. You can read the full article here:

http://ez.no/community/articles/indexing_multiple_binary_file_types

http://www.mindshare.net

Andy Caiger

Tuesday 18 May 2010 1:23:08 am

Is there any more up to date information on how to debug binary file searching/indexing, especially indexing of PDF files?

I'm using eZ Publish 4.2 and PDF files are not being indexing, but I can't get any error information. How can I do some debugging and find out what's going on?

EAB - Integrated Internet Success
Offices in England, France & China.
http://www.eab.co.uk http://www.eab-china.com http://www.eab-france.com

eZ debug

Timing: Jan 30 2025 11:39:40
Script start
Timing: Jan 30 2025 11:39:40
Module start 'content'
Timing: Jan 30 2025 11:39:40
Module end 'content'
Timing: Jan 30 2025 11:39:40
Script end

Main resources:

Total runtime0.2836 sec
Peak memory usage8,192.0000 KB
Database Queries141

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0075 588.5859370.2734
Module start 'content' 0.00750.0118 958.85941,001.8828
Module end 'content' 0.01930.2642 1,960.74223,892.0391
Script end 0.2835  5,852.7813 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00461.6192200.0002
Check MTime0.00130.4550200.0001
Mysql Total
Database connection0.00110.382610.0011
Mysqli_queries0.162557.30381410.0012
Looping result0.00190.66941390.0000
Template Total0.263793.010.2637
Template load0.00090.311310.0009
Template processing0.262892.658410.2628
Override
Cache load0.00060.202810.0006
Sytem overhead
Fetch class attribute can translate value0.00592.075310.0059
XML
Image XML parsing0.00020.082110.0002
General
dbfile0.030510.7637200.0015
String conversion0.00000.001830.0000
Note: percentages do not add up to 100% because some accumulators overlap

CSS/JS files loaded with "ezjscPacker" during request:

CacheTypePacklevelSourceFiles
CSS0extension/community/design/community/stylesheets/ext/jquery.autocomplete.css
extension/community_design/design/suncana/stylesheets/scrollbars.css
extension/community_design/design/suncana/stylesheets/tabs.css
extension/community_design/design/suncana/stylesheets/roadmap.css
extension/community_design/design/suncana/stylesheets/content.css
extension/community_design/design/suncana/stylesheets/star-rating.css
extension/community_design/design/suncana/stylesheets/syntax_and_custom_tags.css
extension/community_design/design/suncana/stylesheets/buttons.css
extension/community_design/design/suncana/stylesheets/tweetbox.css
extension/community_design/design/suncana/stylesheets/jquery.fancybox-1.3.4.css
extension/bcsmoothgallery/design/standard/stylesheets/magnific-popup.css
extension/sevenx/design/simple/stylesheets/star_rating.css
extension/sevenx/design/simple/stylesheets/libs/fontawesome/css/all.min.css
extension/sevenx/design/simple/stylesheets/main.v02.css
extension/sevenx/design/simple/stylesheets/main.v02.res.css
JS0extension/ezjscore/design/standard/lib/yui/3.17.2/build/yui/yui-min.js
extension/ezjscore/design/standard/javascript/jquery-3.7.0.min.js
extension/community_design/design/suncana/javascript/jquery.ui.core.min.js
extension/community_design/design/suncana/javascript/jquery.ui.widget.min.js
extension/community_design/design/suncana/javascript/jquery.easing.1.3.js
extension/community_design/design/suncana/javascript/jquery.ui.tabs.js
extension/community_design/design/suncana/javascript/jquery.hoverIntent.min.js
extension/community_design/design/suncana/javascript/jquery.popmenu.js
extension/community_design/design/suncana/javascript/jScrollPane.js
extension/community_design/design/suncana/javascript/jquery.mousewheel.js
extension/community_design/design/suncana/javascript/jquery.cycle.all.js
extension/sevenx/design/simple/javascript/jquery.scrollTo.js
extension/community_design/design/suncana/javascript/jquery.cookie.js
extension/community_design/design/suncana/javascript/ezstarrating_jquery.js
extension/community_design/design/suncana/javascript/jquery.initboxes.js
extension/community_design/design/suncana/javascript/app.js
extension/community_design/design/suncana/javascript/twitterwidget.js
extension/community_design/design/suncana/javascript/community.js
extension/community_design/design/suncana/javascript/roadmap.js
extension/community_design/design/suncana/javascript/ez.js
extension/community_design/design/suncana/javascript/ezshareevents.js
extension/sevenx/design/simple/javascript/main.js

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1pagelayout.tpl<No override>extension/sevenx/design/simple/templates/pagelayout.tplEdit templateOverride template
 Number of times templates used: 1
 Number of unique templates used: 1

Time used to render debug report: 0.0002 secs