eZFind facets problem with ezselection datatype indexed as text

Author Message

Bartek Modzelewski

Wednesday 13 January 2010 12:18:42 am

Hi,

We were struggling with strange problem with facets on ezselection attributes. Options were indexed properly, but in factes some of available options in part "Refine with facets" had cut last letter, but only if the last letter was "e":

$search_extras.facet_fields:
array(1) {
  [0]=>
  array(5) {
    ["field"]=>
    string(12) "facet_fields"
    ["count"]=>
    int(3)
    ["queryLimit"]=>
    array(3) {
      ["homm"]=>
      string(18) "attr_gender_t:homm"
      ["enfant"]=>
      string(20) "attr_gender_t:enfant"
      ["femm"]=>
      string(18) "attr_gender_t:femm"
    }
    ["nameList"]=>
    array(3) {
      ["homm"]=>
      string(4) "homm"
      ["enfant"]=>
      string(6) "enfant"
      ["femm"]=>
      string(4) "femm"
    }
    ["countList"]=>
    array(3) {
      ["homm"]=>
      int(3)
      ["enfant"]=>
      int(1)
      ["femm"]=>
      int(1)
    }
  }
}

Why ?! Of course options should be as in content class: "Homme", "Enfant" & "Femme".
Second our problem was that option names were lowercase, without any spaces or special chars like slash.

Finally we have added into ezfind.ini [SoldFieldMapSettings]:

DatatypeMap[ezselection]=string

so string is used instead of default text.

Can you just confirm me that it's the only solution? There is no need to be afraid of any sideeffect ?

Thanks

Baobaz
http://www.baobaz.com

Paul Borgermans

Wednesday 13 January 2010 7:34:13 am

Mapping ezselection to string is the right solution with eZ Find 2.1

Declaring it as text means it is subjected to transformations like stemming (which removes the "s" at the end) and tokenizing

Paul

eZ Publish, eZ Find, Solr expert consulting and training
http://twitter.com/paulborgermans

H-Works Agency

Tuesday 26 January 2010 3:14:13 am

I am having a hard time with faceting on a ezselection attribute with multiple select.

I want to return possible "ezselection" values with number of results in each :

  • Should i use ezfind facets ?
  • Should i use a foreach on attributes values ?

The problem when i use facets is that it returns every combination of values.

For example :

I have those ezselection attribute values (with multi-selection enabled) : funk, rock, electro...etc

I want my facet query to return this :

  • funk (2)
  • electro (3)
  • rock (3)
  • ...etc.

Even if some object have funk + electro selected i don't want to have facets like :

  • funk(2)
  • electro (3)
  • funk electro (1)
  • electro rock (1)
  • ...etc

Which is what is happening now when using facets with ezselection mapped to 'string'. It show all combination a values used.

Does anyone has a clue ? Thanks in advance.

EZP is Great

Bartek Modzelewski

Wednesday 27 January 2010 5:32:08 am

Hi Martin,

I thought that we won't need multiple select in our project, but finally we need and we've got exactly the same problem. No solution for now.

Baobaz
http://www.baobaz.com

Paul Borgermans

Wednesday 27 January 2010 6:22:05 am

Hi

The problem is actually in the ezselection datatype metaData() implementation, which conctenates the multiple selections

I'll see if I can get this fixed in the kernel, if not a new datatype handler in ez find is needed

You can file a bug on this one in the ez find issue tracker

Paul

eZ Publish, eZ Find, Solr expert consulting and training
http://twitter.com/paulborgermans

H-Works Agency

Wednesday 27 January 2010 10:31:33 am

Ok Paul thank you a lot.

Until a fix i will loop class attribute values and won't display number of results.

EZP is Great

Matthieu Sévère

Monday 08 March 2010 7:16:29 am

Hello Paul,

I am having exactly the same problem. I didn't see any commit about that an the kernel and in ezfind. Can you tell me if this issue is resolved ?

Thanks !!

--
eZ certified developer: http://ez.no/certification/verify/346216

Sander van den Akker

Tuesday 14 June 2011 12:13:13 pm

Any updates on this?

eZ Publish certified developer
http://auth.ez.no/certification/verify/392313

Gaetano Giunta

Tuesday 14 June 2011 1:17:08 pm

Gonna post soon an alternate ezselection indexer that allows proper faceting. It should also help with non-applying-text-transforms, even though I did nit test it for that. Stay tuned...

Principal Consultant International Business
Member of the Community Project Board

Sander van den Akker

Wednesday 15 June 2011 1:22:24 am

Great!

eZ Publish certified developer
http://auth.ez.no/certification/verify/392313

Jordan Hirsch

Tuesday 12 July 2011 3:13:51 pm

Gaetano,

Please count me as another enthusiastic vote for a new ezselection indexer.  I'm also stuck between having truncated lower-case terms (as facets) when I index my ezselections as text fields, or having multiple selections crammed together into 1 string when I index them as strings.

Really looking forward to getting around this issue!

Has anyone posted a bug about this?  And/or was there a kernel fix released?

Me: http://jordan.teamhirsch.com
My blog: http://wiredformusic.blogspot.com
My other company: http://thinkimprov.com
eZ Certification: http://auth.ez.no/certification/verify/402488
eZ Award: http://ez.no/company/news/ez_awards_2007_prize_winners

Jordan Hirsch

Thursday 14 July 2011 11:50:27 am

We have a solution!  My talented co-worker David Sayre did the majority of the work on this, and we're happy to share it with everyone.

Basically we created a new class that handles the indexing of ezselection datatype attributes, and passes their values as an array to Solr rather than a string with spaces in it.

Wasn't sure of the best way to put this out there other than pasting it here, so please let me know if there's a better one.

NB:  I had to completely blow away the contents of /usr/local/solr/java/solr/data/index/ before this would take affect, but YMMV.

To implement:

  1. Add the following to your ezfind.ini override file:
  2. Add this file to your custom extension's "classes" folder as ezfsolrdocumentfieldselection.php
 <?php
//
//
// ## BEGIN COPYRIGHT, LICENSE AND WARRANTY NOTICE ##
// SOFTWARE NAME: eZ Find
// SOFTWARE RELEASE: 2.3.0
// COPYRIGHT NOTICE: Copyright (C) 1999-2010 eZ Systems AS
// SOFTWARE LICENSE: GNU General Public License v2.0
// NOTICE: >
//   This program is free software; you can redistribute it and/or
//   modify it under the terms of version 2.0  of the GNU General
//   Public License as published by the Free Software Foundation.
//
//   This program is distributed in the hope that it will be useful,
//   but WITHOUT ANY WARRANTY; without even the implied warranty of
//   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
//   GNU General Public License for more details.
//
//   You should have received a copy of version 2.0 of the GNU General
//   Public License along with this program; if not, write to the Free
//   Software Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston,
//   MA 02110-1301, USA.
//
//
// ## END COPYRIGHT, LICENSE AND WARRANTY NOTICE ##
//

/**
 * Based off of the ezfSolrDocumentFieldDummyExample class.
 *
 * This class addresses an issue with indexing the ezselection datatype
 * for multi-select fields.
 *
 * By default, ezselection fields are stored as "text" fields in Solr.  However, this leads to undesired transformation
 * of the values, e.g. "Europe" gets stored in Solr as "europ".
 *
 * A workaround for this is to tell Solr to store ezselection data in "string" fields.  This works fine for single-select
 * ezselection fields.  However, an issue arises with multi-select ezselection fields, namely the values get strung together,
 * e.g. "Europe USA" would get stored for an item with "Europe" and "USA" both selected.
 * This is a result of the implementation of the "metaData" function in kernel/classes/datatypes/ezselection/ezselectiontype.php.
 *
 * This class gets around that problem by passing an array() of values to Solr rather than a space-joined string for multiple selections.
 *
 * ======================
 * To tell eZ Find to use this class:
 *
 * 1)  Place this file in extension/[your_extension]/classes/
 * 2)  Add the following to your ezfind.ini override:
 *        [SolrFieldMapSettings]
 *        CustomMap[ezselection]=ezfSolrDocumentFieldSelection
 *
 * ======================
 *
 * More information:  http://share.ez.no/forums/extensions/ez-find/ezfind-facets-problem-with-ezselection-datatype-indexed-as-text/
 * ======================
 * @author David Sayre, Jordan Hirsch via Beaconfire - http://www.beaconfire.com
**/

class ezfSolrDocumentFieldSelection extends ezfSolrDocumentFieldBase
{
    /**
     * Contains the definition of subattributes for this given datatype.
     * This associative array takes as key the name of the field, and as value
     * the type. The type must be picked amongst the value present as keys in the
     * following array :
     * ezfSolrDocumentFieldName::$FieldTypeMap
     *
     * WARNING : this definition *must* contain the default attribute's one as well.
     *
     * @see ezfSolrDocumentFieldName::$FieldTypeMap
     * @var array
     */
    public static $subattributesDefinition = array( self::DEFAULT_SUBATTRIBUTE => 'string' );
    // public static $subattributesDefinition = array( self::DEFAULT_SUBATTRIBUTE => 'text',
                                                    //self::DEFAULT_SUBATTRIBUTE => 'string' );

    /**
     * The name of the default subattribute. It will be used when
     * this field is requested with no subfield refinement.
     *
     * @see ezfSolrDocumentFieldSelection::$subattributesDefinition
     * @var string
     */
    const DEFAULT_SUBATTRIBUTE = 'data_type_string';

    /**
     * @see ezfSolrDocumentFieldBase::__construct()
     */
    function __construct( eZContentObjectAttribute $attribute )
    {
        parent::__construct( $attribute );
    }

    /**
     * @see ezfSolrDocumentFieldBase::getData()
     */
    public function getData()
    {
        // Extract data from the attribute, and format it as described in the doc link above.
        $data = array();
        $contentClassAttribute = $this->ContentObjectAttribute->attribute( 'contentclass_attribute' ); //get class attribute
        $metaData = array();
        $classAttributeContent = self::getSelectClassContent($contentClassAttribute); //parse selection options into array
        $selectedValue = $idString = explode( '-', $this->ContentObjectAttribute->attribute( 'data_text' ) );

       if ( count( $selectedValue ) > 0)
           {
            $count = 0;
               $optionArray = $classAttributeContent['options'];
               foreach ( $selectedValue as $id )
               {
                   foreach ( $optionArray as $option )
                   {
                       $optionID = $option['id'];
                       if ( $optionID == $id )
                           $metaData[] = $option['name'];
                   }
               }
        }

        //failsafe
        if( !is_array( $metaData ) ) $metaData = array( $metaData );

        $fieldName = self::getFieldName( $contentClassAttribute, self::DEFAULT_SUBATTRIBUTE );
        $data[$fieldName] = $metaData;
        ob_flush();
        return $data;
    }

    /**
     * @see ezfSolrDocumentFieldBase::getFieldName()
     */
    public static function getFieldName( eZContentClassAttribute $classAttribute, $subAttribute = null )
    {
        if ( $subAttribute and
             $subAttribute !== '' and
             array_key_exists( $subAttribute, self::$subattributesDefinition ) and
             $subAttribute != self::DEFAULT_SUBATTRIBUTE )
        {
            // A subattribute was passed
            return parent::generateSubattributeFieldName( $classAttribute,
                                                          $subAttribute,
                                                          self::$subattributesDefinition[$subAttribute] );
        }
        else
        {
            // return the default field name here.
            return parent::generateAttributeFieldName( $classAttribute,
                                                       self::$subattributesDefinition[self::DEFAULT_SUBATTRIBUTE] );
        }
    }

    /**
     * @see ezfSolrDocumentFieldBase::getFieldNameList()
     */
    public static function getFieldNameList( eZContentClassAttribute $classAttribute, $exclusiveTypeFilter = array() )
    {
        // Generate the list of subfield names.
        $subfields = array();

        //   Handle first the default subattribute
        $subattributesDefinition = self::$subattributesDefinition;
        if ( !in_array( $subattributesDefinition[self::DEFAULT_SUBATTRIBUTE], $exclusiveTypeFilter ) )
        {
            $subfields[] = parent::generateAttributeFieldName( $classAttribute, $subattributesDefinition[self::DEFAULT_SUBATTRIBUTE] );
        }
        unset( $subattributesDefinition[self::DEFAULT_SUBATTRIBUTE] );

        //   Then hanlde all other subattributes
        foreach ( $subattributesDefinition as $name => $type )
        {
            if ( empty( $exclusiveTypeFilter ) or !in_array( $type, $exclusiveTypeFilter ) )
            {
                $subfields[] = parent::generateSubattributeFieldName( $classAttribute, $name, $type );
            }
        }
        return $subfields;
    }

     /*!
      * Copy from kernel/datatype/ezselection/ezselection.php
     * Returns the content data for the given content class attribute.
     * //do NOT collide function or variables with existing
    */
    static function getSelectClassContent(eZContentClassAttribute $selectClassAttrib )
    {
        $dom = new DOMDocument( '1.0', 'utf-8' );
        $xmlString = $selectClassAttrib->attribute( 'data_text5' );
        $optionArray = array();
        if ( $xmlString != '' )
        {
            $success = $dom->loadXML( $xmlString );
            if ( $success )
            {
                $options = $dom->getElementsByTagName( 'option' );

                foreach ( $options as $optionNode )
                {
                    $optionArray[] = array( 'id' => $optionNode->getAttribute( 'id' ),
                                            'name' => $optionNode->getAttribute( 'name' ) );
                }
            }
        }

        if ( count( $optionArray ) == 0 )
        {
            $optionArray[] = array( 'id' => 0,
                                    'name' => '' );
        }
        $attrValue = array( 'options' => $optionArray,
                            'is_multiselect' => $selectClassAttrib->attribute( 'data_int1' ) );
        return $attrValue;
    }


    /**
     * @see ezfSolrDocumentFieldBase::getClassAttributeType()
     */
    static function getClassAttributeType( eZContentClassAttribute $classAttribute, $subAttribute = null )
    {
        if ( $subAttribute and
             $subAttribute !== '' and
             array_key_exists( $subAttribute, self::$subattributesDefinition ) )
        {
            // If a subattribute's type is being explicitly requested :
            return self::$subattributesDefinition[$subAttribute];
        }
        else
        {
            // If no subattribute is passed, return the default subattribute's type :
            return self::$subattributesDefinition[self::DEFAULT_SUBATTRIBUTE];
        }
    }
}
?>

Me: http://jordan.teamhirsch.com
My blog: http://wiredformusic.blogspot.com
My other company: http://thinkimprov.com
eZ Certification: http://auth.ez.no/certification/verify/402488
eZ Award: http://ez.no/company/news/ez_awards_2007_prize_winners

Gaetano Giunta

Monday 18 July 2011 6:48:38 am

PS: seems I missed to post my solution to this thread. You can find it at: http://share.ez.no/forums/extensions/ez-find/facets-and-ezselection

Principal Consultant International Business
Member of the Community Project Board

Powered by eZ Publish™ CMS Open Source Web Content Management. Copyright © 1999-2014 eZ Systems AS (except where otherwise noted). All rights reserved.

eZ debug

Timing: Jan 18 2025 02:17:27
Script start
Timing: Jan 18 2025 02:17:27
Module start 'layout'
Timing: Jan 18 2025 02:17:27
Module start 'content'
Timing: Jan 18 2025 02:17:28
Module end 'content'
Timing: Jan 18 2025 02:17:28
Script end

Main resources:

Total runtime1.2532 sec
Peak memory usage4,096.0000 KB
Database Queries99

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0104 588.4219152.6875
Module start 'layout' 0.01040.0047 741.109439.5234
Module start 'content' 0.01511.2365 780.6328925.3203
Module end 'content' 1.25160.0016 1,705.953140.0703
Script end 1.2532  1,746.0234 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00370.2977160.0002
Check MTime0.00140.1101160.0001
Mysql Total
Database connection0.00150.118510.0015
Mysqli_queries0.717157.2227990.0072
Looping result0.00150.1174970.0000
Template Total0.790563.120.3952
Template load0.00210.165620.0010
Template processing0.788462.908120.3942
Template load and register function0.00010.009610.0001
states
state_id_array0.00090.072410.0009
state_identifier_array0.00230.187420.0012
Override
Cache load0.00210.1659940.0000
Sytem overhead
Fetch class attribute can translate value0.00060.046370.0001
Fetch class attribute name0.00160.1291190.0001
XML
Image XML parsing0.00390.309270.0006
class_abstraction
Instantiating content class attribute0.00010.0051240.0000
General
dbfile0.00190.1488490.0000
String conversion0.00000.001040.0000
Note: percentages do not add up to 100% because some accumulators overlap

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1node/view/full.tplfull/forum_topic.tplextension/sevenx/design/simple/override/templates/full/forum_topic.tplEdit templateOverride template
13content/datatype/view/ezxmltext.tpl<No override>extension/community_design/design/suncana/templates/content/datatype/view/ezxmltext.tplEdit templateOverride template
19content/datatype/view/ezxmltags/paragraph.tpl<No override>extension/ezwebin/design/ezwebin/templates/content/datatype/view/ezxmltags/paragraph.tplEdit templateOverride template
3content/datatype/view/ezxmltags/literal.tpl<No override>extension/community/design/standard/templates/content/datatype/view/ezxmltags/literal.tplEdit templateOverride template
1content/datatype/view/ezxmltags/line.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/line.tplEdit templateOverride template
2content/datatype/view/ezxmltags/emphasize.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/emphasize.tplEdit templateOverride template
11content/datatype/view/ezimage.tpl<No override>extension/sevenx/design/simple/templates/content/datatype/view/ezimage.tplEdit templateOverride template
4content/datatype/view/ezxmltags/li.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/li.tplEdit templateOverride template
3content/datatype/view/ezxmltags/ul.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/ul.tplEdit templateOverride template
1content/datatype/view/ezxmltags/ol.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/ol.tplEdit templateOverride template
1print_pagelayout.tpl<No override>extension/community/design/community/templates/print_pagelayout.tplEdit templateOverride template
 Number of times templates used: 59
 Number of unique templates used: 11

Time used to render debug report: 0.0001 secs