Cannot extract text from openoffice documents via apache

Author Message

Massimo Sanna

Tuesday 11 September 2007 3:28:53 am

Hi there,
I'm implementing a website which should feature around 13gb of contents in pdf, doc and odt.
To index all this content, I'm using pdftotext and openoffice from command line, with eZ Publish 3.10rc1 and eZ Find 1.0beta3. There's a custom parser in place which launches a nifty python script which just spits out on stdout all the content of every readable openoffice document.

I'm able to run the php-cli scripts to reindex the website as root, but when I upload a new file to the website I get the following error in the index log file:

09/11/2007 [10:26]  filename: var/ezwebin_site/storage/original/application/b9b6c03340525f50463500b08883e1b2.odt
09/11/2007 [10:26]  creation of executable memory area failed: Permission denied
Error (<class uno.com.sun.star.uno.RuntimeException at 0xb7f3e0bc>) :exception type not found: bad_allocpure virtual method called
St9type_infoSt8bad_castSt10bad_typeidN10__cxxabiv117__class_type_infoEN10 __cxxabiv120__si_class_type_infoEN10__cxxabiv121__vmi_class_type_ infoEPKePeePKdPddPKfPffPKyPyyPKxPxxPKmPmmPKlPllPKjPjjPKiPiiPKtPttPKsP ssPKhPhhPKaPaaPKcPccPKwPwwPKbPbbPKvPvvN10__cxxabiv123__fundamental _type_infoEN10__cxxabiv117__array_type_infoEN10__cxxabiv120__function_type_ infoEN10__cxxabiv116__enum_type_infoEN10__cxxabiv1.__pbase_type_info

I tried to login as apache on the server, and infact launching the same script from the command line gave me the same error. After giving full ownership and write permissions to /var/www I've been able to run the python script from command line, but it still gives me the error when ezpublish launches it.

Anybody has some ideas on how to fix the permissions? I don't know what else to look :-(
Max

Powered by eZ Publish™ CMS Open Source Web Content Management. Copyright © 1999-2014 eZ Systems AS (except where otherwise noted). All rights reserved.

eZ debug

Timing: Jan 18 2025 14:55:48
Script start
Timing: Jan 18 2025 14:55:48
Module start 'layout'
Timing: Jan 18 2025 14:55:48
Module start 'content'
Timing: Jan 18 2025 14:55:49
Module end 'content'
Timing: Jan 18 2025 14:55:49
Script end

Main resources:

Total runtime1.1751 sec
Peak memory usage4,096.0000 KB
Database Queries46

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0052 588.1719152.6563
Module start 'layout' 0.00520.0022 740.828139.5078
Module start 'content' 0.00751.1662 780.3359444.0625
Module end 'content' 1.17370.0014 1,224.39848.0938
Script end 1.1751  1,232.4922 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00320.2705160.0002
Check MTime0.00120.1057160.0001
Mysql Total
Database connection0.00090.074210.0009
Mysqli_queries1.132796.3878460.0246
Looping result0.00050.0390440.0000
Template Total1.143297.320.5716
Template load0.00270.231320.0014
Template processing1.140597.052320.5702
Template load and register function0.00010.008810.0001
states
state_id_array0.00090.080710.0009
state_identifier_array0.00150.127620.0007
Override
Cache load0.00240.2018120.0002
Sytem overhead
Fetch class attribute can translate value0.00050.045810.0005
Fetch class attribute name0.00160.137620.0008
XML
Image XML parsing0.00050.041310.0005
class_abstraction
Instantiating content class attribute0.00000.000420.0000
General
dbfile0.00160.1387160.0001
String conversion0.00000.002240.0000
Note: percentages do not add up to 100% because some accumulators overlap

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1node/view/full.tplfull/forum_topic.tplextension/sevenx/design/simple/override/templates/full/forum_topic.tplEdit templateOverride template
1content/datatype/view/ezimage.tpl<No override>extension/sevenx/design/simple/templates/content/datatype/view/ezimage.tplEdit templateOverride template
1content/datatype/view/ezxmltext.tpl<No override>extension/community_design/design/suncana/templates/content/datatype/view/ezxmltext.tplEdit templateOverride template
2content/datatype/view/ezxmltags/line.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/line.tplEdit templateOverride template
3content/datatype/view/ezxmltags/paragraph.tpl<No override>extension/ezwebin/design/ezwebin/templates/content/datatype/view/ezxmltags/paragraph.tplEdit templateOverride template
1content/datatype/view/ezxmltags/literal.tpl<No override>extension/community/design/standard/templates/content/datatype/view/ezxmltags/literal.tplEdit templateOverride template
1print_pagelayout.tpl<No override>extension/community/design/community/templates/print_pagelayout.tplEdit templateOverride template
 Number of times templates used: 10
 Number of unique templates used: 7

Time used to render debug report: 0.0001 secs