[Date Prev][Date Next][Thread Prev][Thread Next] [Search] [Date Index] [Thread Index]

Re: [MacPerl] Searching a VERY large text file

To: mac-perl@iis.ee.ethz.ch
Subject: Re: [MacPerl] Searching a VERY large text file
From: Greenblatt & Seay <g-s@navix.net>
Date: Thu, 5 Feb 1998 18:00:19 -0500
In-Reply-To: <34d9f825.795149@mailhost.tornado.be>
References: <v03110700b0ff45a483dc@[193.192.223.4]><v03110700b0ff45a483dc@[193.192.223.4]>

>>Is there some way of compiling MacPerl scripts into machine language or
>>converting them into C/C++ routines that I could then read into
>>CodeWarrior... Maybe this would increase speed - then again maybe not??
>
>On Thu, 5 Feb 1998 10:36:55 +0000, Bart Lateur replied:
>
>I think that, if you read in the data via sysread/read in blocks of,
>say, 32k, and scan those, you might get a decent speed incease.
>
>But the code won't look as nice: you have to consider (and deal with)
>the possibility of missing a possible match on the edge between two
>blocks.
>
>	Bart.
>


I agree with Bart.  The following subroutine shows how I'd code the search.
The line after the 'else' statement avoids the problem of missing a
possible match on the edge between two blocks.


sub search_file {
  $find_text="Put text to search here.";
  $length_to_read=32768; # READ 32K AT A TIME -- INCREASE IF YOU HAVE THE
MEMORY
  $offset = 0;
  $find_text_length=length($find_text) + 2;
  open(CHECKFILE, $pathname) || die "Could not open $pathname.\n";
  while ($buffer = read(CHECKFILE, $read_results, $length_to_read, $offset)){
     if (grep(/$find_text/i,$read_results)) {
        $read_results=lc($read_results);
        $find_this=lc($find_text);
        $i=index($read_results,$find_this);
        $found_it++;
        $found_loc= $offset + $i + 1;
        print "Found $find_text at byte $found_loc in... \"$pathname\" \n";
        last;
     } else {
        $offset = $offset + $length_to_read - $find_text_length;
     }
  }
  close(CHECKFILE);
}


David Seay
http://www.mastercall.com/g-s



***** Want to unsubscribe from this list?
***** Send mail with body "unsubscribe" to mac-perl-request@iis.ee.ethz.ch

Follow-Ups:
- Re: [MacPerl] Searching a VERY large text file
  - From: Brian "L." Matthews <blm@halcyon.com>

References:
- Re: [MacPerl] Searching a VERY large text file
  - From: Shyam Hegde <dev@cequel.co.uk>
- Re: [MacPerl] Searching a VERY large text file
  - From: bart.mediamind@tornado.be (Bart Lateur)

Prev by Date: Re: [MacPerl] Searching a VERY large text file
Next by Date: Re: [MacPerl] Searching a VERY large text file
Prev by thread: Re: [MacPerl] Searching a VERY large text file
Next by thread: Re: [MacPerl] Searching a VERY large text file
Navigation: Date Index | Thread Index | Search | Other lists at bumppo.net