Find duplicate items in an array (how to)

Author Message

Jorge estévez

Thursday 11 September 2008 5:24:21 am

Hello,

I have this bunch of products with each one of the having a unique code (text line), I would like to search all of the products and see if the unique id repeats itself

So, I should:

Get all products
  Add eah ID to an array
  Search the array to find if there is a duplicate ID item in it

How do I "Search the array to find if there is a duplicate item in it", this could be a slow programming algorithm..

Any hints,

Thanks

Diseño Web Cuba
Web Design Cuba
www.elfosdesign.com

Piotrek Karaś

Thursday 11 September 2008 8:53:44 am

If you worked directly with MySQL, you could do with something like this:

SELECT COUNT(real_id), real_id, duplicate_id 
FROM table 
GROUP BY duplicate_id

and probably even limit the results to only duplicates.

In PHP, I'm not sure if it is fast, but should work:

<?php
$IDArray = array(...); // here are all the IDs, including duplicates
$IDCount = array_count_values( $IDArray );
$duplicateArray = array();
foreach( $IDCount as $id => $count )
{
    if( $count > 1 )
    {
        array_push( $duplicateArray, $id );
    }
}
$duplicateArray = ... // these are duplicate ids...
?>

http://pl.php.net/manual/pl/function.array-count-values.php

--
Company: mediaSELF Sp. z o.o., http://www.mediaself.pl
eZ references: http://ez.no/partners/worldwide_partners/mediaself
eZ certified developer: http://ez.no/certification/verify/272585
eZ blog: http://ez.ryba.eu

Mark Marsiglio

Saturday 13 September 2008 11:22:46 am

This function might be useful -

http://ez.no/doc/ez_publish/technical_manual/4_0/reference/template_operators/arrays/unique

however, are you interested in the duplicates? Or the array with duplicates removed?

http://www.thinkcreative.com
Turning Ideas Into Strategic Solutions

Jorge estévez

Saturday 13 September 2008 12:21:12 pm

Hi!

What I would like to do is to allow my client to make a click on a link that will search all products unique codes and tell him wich items have the same code (an error while populating the site)...

So I must search all products codes, select all that have same ID's and show them to my customer.

This algorithm should be run on demand as it will slow down the site, I cannot aford to do such a thing each time an item is added.

Unique will eliminate duplicate entryes within the array THIS IS NOT WHAT I WOULD LIKE TO DO!

Unique could be used before running the "duplicate search code" to see if it has duplicate codes (comparing the array|count before with an array|count|unique)

any hints
thanks

Diseño Web Cuba
Web Design Cuba
www.elfosdesign.com

Powered by eZ Publish™ CMS Open Source Web Content Management. Copyright © 1999-2014 eZ Systems AS (except where otherwise noted). All rights reserved.

eZ debug

Timing: Jan 31 2025 04:33:33
Script start
Timing: Jan 31 2025 04:33:33
Module start 'layout'
Timing: Jan 31 2025 04:33:33
Module start 'content'
Timing: Jan 31 2025 04:33:35
Module end 'content'
Timing: Jan 31 2025 04:33:35
Script end

Main resources:

Total runtime1.4831 sec
Peak memory usage4,096.0000 KB
Database Queries62

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0057 588.2500151.2266
Module start 'layout' 0.00580.0026 739.476636.6797
Module start 'content' 0.00841.4739 776.1563989.8672
Module end 'content' 1.48230.0007 1,766.023411.8125
Script end 1.4830  1,777.8359 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.00320.2173160.0002
Check MTime0.00130.0895160.0001
Mysql Total
Database connection0.00080.056010.0008
Mysqli_queries1.388193.5937620.0224
Looping result0.00070.0473600.0000
Template Total1.453698.020.7268
Template load0.00210.145020.0011
Template processing1.451597.869220.7257
Template load and register function0.00150.102110.0015
states
state_id_array0.00140.091410.0014
state_identifier_array0.00130.087220.0006
Override
Cache load0.00190.1252320.0001
Sytem overhead
Fetch class attribute can translate value0.00070.047330.0002
Fetch class attribute name0.00210.142260.0004
XML
Image XML parsing0.00150.099330.0005
class_abstraction
Instantiating content class attribute0.00000.000960.0000
General
dbfile0.00160.1097220.0001
String conversion0.00000.000540.0000
Note: percentages do not add up to 100% because some accumulators overlap

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1node/view/full.tplfull/forum_topic.tplextension/sevenx/design/simple/override/templates/full/forum_topic.tplEdit templateOverride template
4content/datatype/view/ezxmltext.tpl<No override>extension/community_design/design/suncana/templates/content/datatype/view/ezxmltext.tplEdit templateOverride template
8content/datatype/view/ezxmltags/paragraph.tpl<No override>extension/ezwebin/design/ezwebin/templates/content/datatype/view/ezxmltags/paragraph.tplEdit templateOverride template
3content/datatype/view/ezxmltags/literal.tpl<No override>extension/community/design/standard/templates/content/datatype/view/ezxmltags/literal.tplEdit templateOverride template
2content/datatype/view/ezimage.tpl<No override>extension/sevenx/design/simple/templates/content/datatype/view/ezimage.tplEdit templateOverride template
1content/datatype/view/ezxmltags/line.tpl<No override>design/standard/templates/content/datatype/view/ezxmltags/line.tplEdit templateOverride template
1print_pagelayout.tpl<No override>extension/community/design/community/templates/print_pagelayout.tplEdit templateOverride template
 Number of times templates used: 20
 Number of unique templates used: 7

Time used to render debug report: 0.0001 secs