[Date Prev][Date Next][Thread Prev][Thread Next] [Search] [Date Index] [Thread Index]

Re: [FWP] Matching regexp's at a stream of data?

To: Sven Neuhaus <sn@neopoly.de>
Subject: Re: [FWP] Matching regexp's at a stream of data?
From: abigail@foad.org
Date: Thu, 3 Aug 2000 02:12:11 -0400
Cc: fwp@technofile.org
In-Reply-To: <20000803044310.A17126@neopoly.de>; from sn@neopoly.de on Thu, Aug 03, 2000 at 04:43:10AM +0200
References: <20000802072049.A15975@neopoly.de> <9h8h5gzkg2NN092yn@efn.org> <20000803005419.10181.qmail@foad.org> <20000803044310.A17126@neopoly.de>

On Thu, Aug 03, 2000 at 04:43:10AM +0200, Sven Neuhaus wrote:
> On Wed, Aug 02, 2000 at 08:54:19PM -0400, abigail@foad.org wrote:
> > local $_;
> > sysread $fh => $_, $maxchars or return;
> > do {s/^pattern1/replacement1/ and next;   # Note the anchor.
> >     s/^pattern2/replacement2/ and next;
> >     ...
> > } while defined substr $_ => 0, 1, "" and
> >        (sysread $fh => $_, length, 1 or length);
> 
> That's probably too slow, isn't it?
> 
> I was thinking when I have a max match size of, say, 400 bytes, the
> algorithm looks at 600 bytes then slides the window 200 bytes further.
> There must be some overlap or you will miss some matches. Testing 
> every byte is too slow, though (haven't benchmarked it, but I'd be
> fairly surprised if it weren't).

What makes you think that? Sure, reading 200 bytes chunks makes you
have less I/O, but you loose the anchor in the regex, making the regexes
potentially a lot slower. 

I would be very surprised if one method is faster than the other without
taking the actual data and regexes in consideration.

Abigail

==== Want to unsubscribe from Fun With Perl?  Well, if you insist...
==== Send email to <fwp-request@technofile.org> with message _body_
====   unsubscribe

References:
- [FWP] Matching regexp's at a stream of data?
  - From: Sven Neuhaus <sn@neopoly.de>
- Re: [FWP] Matching regexp's at a stream of data?
  - From: sthoenna@efn.org (Yitzchak Scott-Thoennes)
- Re: [FWP] Matching regexp's at a stream of data?
  - From: abigail@foad.org
- Re: [FWP] Matching regexp's at a stream of data?
  - From: Sven Neuhaus <sn@neopoly.de>

Prev by Date: Re: [FWP] Matching regexp's at a stream of data?
Next by Date: Re: [FWP] Matching regexp's at a stream of data?
Prev by thread: Re: [FWP] Matching regexp's at a stream of data?
Next by thread: [FWP] Using invariants in OO modules.
Navigation: Date Index | Thread Index | Search | Other lists at bumppo.net