I'm interested in comparing image files, in order to find duplicates. Because there are several thousand files, an NxN comparison is out of the question. Also, it is quite possible that some of the images may have been resized or tweaked in other manners, so the comparison will not be an exact one. Finally, the images are mostly in JPEG format, so they will have to be expanded into pixels before any comparison can be done. Here is some pseudo-perl that describes my current thinking: Walk tree of JPEG files, using File::Find Expand file image to pixmap form create a signature ($key) for image $tag{$key}++ if ($got{$key} ne ''); $got{$key} .= " $File::Find::name ($key)\n"; foreach $key (keys(%tag)) { push (@val, $got{$key}); } foreach $val (sort(@val)) { printf("%s\n", $val); } Actually, the code above assumes that the signatures will be exact, which is unlikely... Anyway, I'm looking for suggestions on: * expanding JPEGs into pixmaps * signature creation * non-exact signature comparisons Ideas, anyone? -r -- Rich Morin: rdm@cfcl.com, +1 650-873-7841, http://www.ptf.com/~rdm Prime Time Freeware: info@ptf.com, +1 408-433-9662, http://www.ptf.com MacPerl: http://www.macperl.com, http://www.ptf.com/ptf/products/MPPE MkLinux: http://www.mklinux.apple.com, http://www.ptf.com/ptf/products/MKLP ===== Want to unsubscribe from this list? ===== Send mail with body "unsubscribe" to macperl-request@macperl.org