Forums / Developer / Import rss as literal html

Import rss as literal html

Author Message

michael depetrillo

Tuesday 15 July 2008 10:14:42 am

Hello everyone

I need to import my rss feed as literal html.

I changed the setEZXMLAttribute method in cronjobs/rssimport.php.

The rss feed is importing OK, but when I go to front-end I do not see all the HTML tags.

If I go into back-end with editor enabled, I still do not see all the HTML tags.

If I go into back-end with editor disabled, I see all the HTML. I can then hit save and the editor and front-end will display the correct HTML.

What piece am I missing here?

The feed I am working with is - http://www.cnbc.com/id/20040302/rssCmp/97305/device/rss/rss.xml

function setEZXMLAttribute( $attribute, $attributeValue, $link = false )
{
    //include_once( 'kernel/classes/datatypes/ezxmltext/handlers/input/ezsimplifiedxmlinputparser.php' );
	
    $contentObjectID = $attribute->attribute( "contentobject_id" );
	
	// echo $attributeValue ."\n";
	
	// ADDED FOR LP
	$contentClassID = $attribute->attribute('contentclassattribute_id');
	if ($contentClassID == 206) {
		
		$inputData = "<?xml version=\"1.0\" encoding=\"utf-8\"?>\n";
		$inputData .= "<section xmlns:image=\"http://ez.no/namespaces/ezpublish3/image/\"\n";
		$inputData .= "         xmlns:xhtml=\"http://ez.no/namespaces/ezpublish3/xhtml/\"\n";
		$inputData .= "         xmlns:custom=\"http://ez.no/namespaces/ezpublish3/custom/\">\n";
		$inputData .= "<paragraph>\n<literal class=\"html\">";
		$inputData .= strip_tags($attributeValue, 			"<span><a><p><h1><h2><h3><h4><h5><ul><li><br><table><tr><td><th><tbody><tfoot><hr><img><embed><object>");
		$inputData .= "</literal></paragraph>";
		$inputData .= "</section>";

		$domString = $inputData;
			
	// END ADDED FOR LP
	} else {
		
		$parser = new eZSimplifiedXMLInputParser( $contentObjectID, false, 0, false );
	
		$attributeValue = str_replace( "\r", '', $attributeValue );
		$attributeValue = str_replace( "\n", '', $attributeValue );
		$attributeValue = str_replace( "\t", ' ', $attributeValue );
	
		$document = $parser->process( $attributeValue );
		if ( !is_object( $document ) )
		{
			$cli = eZCLI::instance();
			$cli->output( 'Error in xml parsing' );
			return;
		}
		$domString = eZXMLTextType::domString( $document );
	}
	
	// echo $domString;
	
    $attribute->setAttribute( 'data_text', $domString );
    $attribute->store();
}

Guillaume Kulakowski

Tuesday 15 July 2008 1:37:46 pm

Hello Michael,

I use eZ for a planet : http://planet.fedora-fr.org.

For that, I store RSS content in Text block.
For a valid xHTML content, I use a tidy and a cleaner parser.

You can inspirate of my code :
http://trac.llaumgui.com/browser/ez_publish/myutils/trunk/cronjobs/planet.php (look at setEZTXTAttribute)

My blog : http://www.llaumgui.com (not in eZ Publish ;-))
eZC on RHEL : http://blog.famillecollet.com/pages/Config-en
eZC on Fedora : just "yum install php-channel-ezc"

michael depetrillo

Thursday 17 July 2008 12:12:37 pm

What does the disabled editor due to the HTML before it saves it to a dom document?

Or I could ask

What does the editor due to the HTML from the dom document before it displays it?

eZ debug

Timing: Jan 18 2025 15:51:10
Script start
Timing: Jan 18 2025 15:51:10
Module start 'content'
Timing: Jan 18 2025 15:51:10
Module end 'content'
Timing: Jan 18 2025 15:51:10
Script end

Main resources:

Total runtime0.7524 sec
Peak memory usage4,096.0000 KB
Database Queries194

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0065 587.7109180.8359
Module start 'content' 0.00660.6251 768.5469548.6172
Module end 'content' 0.63170.1206 1,317.1641336.7500
Script end 0.7523  1,653.9141 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00390.5201210.0002
Check MTime0.00140.1807210.0001
Mysql Total
Database connection0.00070.093510.0007
Mysqli_queries0.687991.42681940.0035
Looping result0.00190.25891920.0000
Template Total0.723296.120.3616
Template load0.00190.258620.0010
Template processing0.721395.864120.3606
Template load and register function0.00010.019110.0001
states
state_id_array0.00060.075510.0006
state_identifier_array0.00120.161520.0006
Override
Cache load0.00160.2069240.0001
Sytem overhead
Fetch class attribute can translate value0.00110.149530.0004
Fetch class attribute name0.00110.139640.0003
XML
Image XML parsing0.00120.160730.0004
class_abstraction
Instantiating content class attribute0.00000.001040.0000
General
dbfile0.00190.2511280.0001
String conversion0.00000.001130.0000
Note: percentages do not add up to 100% because some accumulators overlap

CSS/JS files loaded with "ezjscPacker" during request:

CacheTypePacklevelSourceFiles
CSS0extension/community/design/community/stylesheets/ext/jquery.autocomplete.css
extension/community_design/design/suncana/stylesheets/scrollbars.css
extension/community_design/design/suncana/stylesheets/tabs.css
extension/community_design/design/suncana/stylesheets/roadmap.css
extension/community_design/design/suncana/stylesheets/content.css
extension/community_design/design/suncana/stylesheets/star-rating.css
extension/community_design/design/suncana/stylesheets/syntax_and_custom_tags.css
extension/community_design/design/suncana/stylesheets/buttons.css
extension/community_design/design/suncana/stylesheets/tweetbox.css
extension/community_design/design/suncana/stylesheets/jquery.fancybox-1.3.4.css
extension/bcsmoothgallery/design/standard/stylesheets/magnific-popup.css
extension/sevenx/design/simple/stylesheets/star_rating.css
extension/sevenx/design/simple/stylesheets/libs/fontawesome/css/all.min.css
extension/sevenx/design/simple/stylesheets/main.v02.css
extension/sevenx/design/simple/stylesheets/main.v02.res.css
JS0extension/ezjscore/design/standard/lib/yui/3.17.2/build/yui/yui-min.js
extension/ezjscore/design/standard/javascript/jquery-3.7.0.min.js
extension/community_design/design/suncana/javascript/jquery.ui.core.min.js
extension/community_design/design/suncana/javascript/jquery.ui.widget.min.js
extension/community_design/design/suncana/javascript/jquery.easing.1.3.js
extension/community_design/design/suncana/javascript/jquery.ui.tabs.js
extension/community_design/design/suncana/javascript/jquery.hoverIntent.min.js
extension/community_design/design/suncana/javascript/jquery.popmenu.js
extension/community_design/design/suncana/javascript/jScrollPane.js
extension/community_design/design/suncana/javascript/jquery.mousewheel.js
extension/community_design/design/suncana/javascript/jquery.cycle.all.js
extension/sevenx/design/simple/javascript/jquery.scrollTo.js
extension/community_design/design/suncana/javascript/jquery.cookie.js
extension/community_design/design/suncana/javascript/ezstarrating_jquery.js
extension/community_design/design/suncana/javascript/jquery.initboxes.js
extension/community_design/design/suncana/javascript/app.js
extension/community_design/design/suncana/javascript/twitterwidget.js
extension/community_design/design/suncana/javascript/community.js
extension/community_design/design/suncana/javascript/roadmap.js
extension/community_design/design/suncana/javascript/ez.js
extension/community_design/design/suncana/javascript/ezshareevents.js
extension/sevenx/design/simple/javascript/main.js

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1node/view/full.tplfull/forum_topic.tplextension/sevenx/design/simple/override/templates/full/forum_topic.tplEdit templateOverride template
3content/datatype/view/ezxmltext.tpl<No override>extension/community_design/design/suncana/templates/content/datatype/view/ezxmltext.tplEdit templateOverride template
5content/datatype/view/ezxmltags/paragraph.tpl<No override>extension/ezwebin/design/ezwebin/templates/content/datatype/view/ezxmltags/paragraph.tplEdit templateOverride template
1content/datatype/view/ezxmltags/literal.tpl<No override>extension/community/design/standard/templates/content/datatype/view/ezxmltags/literal.tplEdit templateOverride template
1content/datatype/view/ezimage.tpl<No override>extension/sevenx/design/simple/templates/content/datatype/view/ezimage.tplEdit templateOverride template
2content/datatype/view/ezxmltags/line.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/line.tplEdit templateOverride template
1pagelayout.tpl<No override>extension/sevenx/design/simple/templates/pagelayout.tplEdit templateOverride template
 Number of times templates used: 14
 Number of unique templates used: 7

Time used to render debug report: 0.0001 secs