Rss feeds with malformed character '&'

Author Message

Lucie Foulhac

Monday 06 July 2009 12:56:18 am

Hi,

I use eZ Find to index rss feeds with Solr DataImportHandler. Everything goes well during the full import :

http://<host>:<port>/solr/dataimport?command=full-import

But if a rss feed is malformed with the caracter '&' instead '&amp;', I have an error and none of my rss feeds are indexed.

The error returned is :

"The entity name must immediately follow the '&' in the entity reference"

I saw that the 1.4 version of Solr will introduce a control on error behavior (abort, skip, continue) : http://issues.apache.org/jira/browse/SOLR-842

Does anybody ever had this problem and how to solve it?

Thanks,
Lucie

Powered by eZ Publish™ CMS Open Source Web Content Management. Copyright © 1999-2014 eZ Systems AS (except where otherwise noted). All rights reserved.

eZ debug

Timing: Jan 18 2025 10:33:12
Script start
Timing: Jan 18 2025 10:33:12
Module start 'layout'
Timing: Jan 18 2025 10:33:12
Module start 'content'
Timing: Jan 18 2025 10:33:13
Module end 'content'
Timing: Jan 18 2025 10:33:13
Script end

Main resources:

Total runtime0.6611 sec
Peak memory usage4,096.0000 KB
Database Queries48

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0065 589.1953152.6406
Module start 'layout' 0.00650.0035 741.835939.4766
Module start 'content' 0.01000.6496 781.3125402.5625
Module end 'content' 0.65960.0014 1,183.87508.1328
Script end 0.6610  1,192.0078 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00320.4881160.0002
Check MTime0.00130.1978160.0001
Mysql Total
Database connection0.00090.133610.0009
Mysqli_queries0.624194.4125480.0130
Looping result0.00040.0673460.0000
Template Total0.621994.120.3109
Template load0.00200.308320.0010
Template processing0.619893.763120.3099
Template load and register function0.00010.022710.0001
states
state_id_array0.00160.248710.0016
state_identifier_array0.00060.097720.0003
Override
Cache load0.00160.2453130.0001
Sytem overhead
Fetch class attribute can translate value0.00070.101410.0007
Fetch class attribute name0.00050.075910.0005
XML
Image XML parsing0.00010.015710.0001
class_abstraction
Instantiating content class attribute0.00000.000810.0000
General
dbfile0.00050.0807100.0001
String conversion0.00000.001040.0000
Note: percentages do not add up to 100% because some accumulators overlap

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1node/view/full.tplfull/forum_topic.tplextension/sevenx/design/simple/override/templates/full/forum_topic.tplEdit templateOverride template
1content/datatype/view/ezxmltext.tpl<No override>extension/community_design/design/suncana/templates/content/datatype/view/ezxmltext.tplEdit templateOverride template
2content/datatype/view/ezxmltags/paragraph.tpl<No override>extension/ezwebin/design/ezwebin/templates/content/datatype/view/ezxmltags/paragraph.tplEdit templateOverride template
1content/datatype/view/ezxmltags/line.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/line.tplEdit templateOverride template
1print_pagelayout.tpl<No override>extension/community/design/community/templates/print_pagelayout.tplEdit templateOverride template
 Number of times templates used: 6
 Number of unique templates used: 5

Time used to render debug report: 0.0001 secs