RSS-bug : cron-log says "Invalid RSS document"

Author Message

Valentin Svelland

Tuesday 09 December 2008 1:07:16 pm

I'm experiencing some problems with my rssimport.php-cronjob. The RSS-feed on my site has stopped, and I've only found a couple of other reports of "Invalid RSS document" in this forum. Do you know the solution to this? Here is the cron-log:

Running cronjobs/unpublish.php

Running cronjobs/rssimport.php
RSSImport Nyheter "Oppdrettstorsk": Starting.
RSSImport Nyheter "Oppdrettstorsk": Invalid RSS document.
RSSImport Fiskeribladet Fiskaren: Starting.
RSSImport Fiskeribladet Fiskaren: Object ( 891 ) with URL:  already exists
RSSImport Fiskeribladet Fiskaren: Object ( 892 ) with URL:  already exists
RSSImport Fiskeribladet Fiskaren: Object ( 890 ) with URL:  already exists
RSSImport Fiskeribladet Fiskaren: Object ( 889 ) with URL:  already exists
RSSImport Fiskeribladet Fiskaren: Object ( 888 ) with URL:  already exists
RSSImport Fiskeribladet Fiskaren: Object ( 887 ) with URL:  already exists
RSSImport Fiskeribladet Fiskaren: Object ( 885 ) with URL:  already exists
RSSImport Fiskeribladet Fiskaren: Object ( 886 ) with URL:  already exists
RSSImport Fiskeribladet Fiskaren: Object ( 882 ) with URL:  already exists
RSSImport Fiskeribladet Fiskaren: Object ( 883 ) with URL:  already exists
RSSImport Fiskeribladet Fiskaren: End. 0 objects added

Running cronjobs/indexcontent.php
Starting processing pending search engine modifications
Done

Running cronjobs/hide.php

Running cronjobs/subtreeexpirycleanup.php

Running cronjobs/internal_drafts_cleanup.php
Cleaning up internal drafts...
Cleaned up 0 internal drafts

------------------------
I made eZ run on www.eigersund.kommune.no, bjerkreim.kommune.no, lund.kommune.no and sokndal.kommune.no. Municipalities should use open source!

Quoc Huy Nguyen Dinh

Saturday 01 January 2011 2:59:59 pm

OK this thread is very old but for those looking for the solution, I've just encountered this with eZ 4.4 and the cause is that the RSS I'm using is having the same <link> (linking to the main website of the rss feed) for each items thus making eZ thinks all items are the same. But each items of the feed has a different <guid> which makes sense here. So I've edited cronjob/rssimport.php and at about line 210 I've inverted the two if conditions to prefer to use <guid> if it does exist.

Powered by eZ Publish™ CMS Open Source Web Content Management. Copyright © 1999-2014 eZ Systems AS (except where otherwise noted). All rights reserved.

eZ debug

Timing: Jan 29 2025 23:52:06
Script start
Timing: Jan 29 2025 23:52:06
Module start 'layout'
Timing: Jan 29 2025 23:52:06
Module start 'content'
Timing: Jan 29 2025 23:52:06
Module end 'content'
Timing: Jan 29 2025 23:52:06
Script end

Main resources:

Total runtime0.0126 sec
Peak memory usage2,048.0000 KB
Database Queries3

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0042 588.3828151.2422
Module start 'layout' 0.00420.0019 739.625036.6953
Module start 'content' 0.00620.0049 776.320390.2344
Module end 'content' 0.01110.0014 866.554733.9922
Script end 0.0125  900.5469 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.002620.3392140.0002
Check MTime0.00129.8049140.0001
Mysql Total
Database connection0.00053.616510.0005
Mysqli_queries0.001511.928530.0005
Looping result0.00000.079810.0000
Template Total0.00119.010.0011
Template load0.00097.073510.0009
Template processing0.00021.848210.0002
Override
Cache load0.00064.866410.0006
General
dbfile0.00097.510480.0001
String conversion0.00000.060840.0000
Note: percentages do not add up to 100% because some accumulators overlap

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1print_pagelayout.tpl<No override>extension/community/design/community/templates/print_pagelayout.tplEdit templateOverride template
 Number of times templates used: 1
 Number of unique templates used: 1

Time used to render debug report: 0.0001 secs