[Date Prev][Date Next][Thread Prev][Thread Next] [Search] [Date Index] [Thread Index]

[MacPerl-Forum] Getting the text from a HTMLfile. / match problem ?

To: comp-infosystems-www-authoring-cgi@moderators.isc.org
To: Forum Macperl <macperl-forum@macperl.org>, Web Macperl <macperl-webcgi@macperl.org>
Subject: [MacPerl-Forum] Getting the text from a HTMLfile. / match problem ?
From: Jimmy Lantz <webmaster@ostas.lu.se>
Date: Tue, 30 May 2000 11:46:07 +0200
Newsgroups: comp.infosystems.www.authoring.cgi,alt.perl,comp.lang.perl.misc
Organization: Department of East Asian languages

Hi, 
I need to get the text from a HTML file and I wonder if I should use the
HTML Parser
for this or if I should process it just by reading the text with a lot
of comments <!--kkkkkkk -->.
Which is the best way to go ?
(BTW. I'm working with MacPerl)


I need to get the text from the following
<html>
<body>
Some html
some html some more htmlcode.
Then find the following comment:
<!--VALUE: PARTS="4" PART1="3" PART2="0" PART3="2" PART4="1" -->
Get the values, read it into scalars 
$PARTS = '4';
$PART1 = '3';
$PART2 = '0';
$PART3 = '2';
$PART4 = '1';

then get the the rest of the comments 
which looks like this <!--VALUE: NAME="some_name" -->
TEXT TEXT TEXT<br>
TEXT <b>TEXT</b> TEXT<br>
<!--END -->

read it into scalars like this:
$some_name = 'TEXT TEXT TEXT<br>
TEXT <b>TEXT</b> TEXT<br>';



Some html
some html some more htmlcode.
</body>
</html>

I'm not good at matching expressions, is there a good tutorial 
somewhere or could someone give me some pointers on how to proceed on this?
I 'm very gratefull for any help.
Yours sincerely
Jimmy Lantz

==== Want to unsubscribe from this list?
==== Send mail with body "unsubscribe" to macperl-forum-request@macperl.org

Prev by Date: [MacPerl-Forum] HTML Parser for MacPerl?
Next by Date: [MacPerl-Forum] Reading several lines from a file into a scalar? (Getting the text from a HTMLfile # 2.)
Prev by thread: [MacPerl-Forum] Reading several lines from a file into a scalar? (Getting the text from a HTMLfile # 2.)
Next by thread: [MacPerl-Forum] HTML Parser for MacPerl?
Navigation: Date Index | Thread Index | Search | Other lists at bumppo.net