List Info

Thread: html parser help




html parser help
user name
2006-06-05 19:42:53
I don't know of any commercial tool for that, maybe there
is. Some time
ago I implemented a small solution for doing the same, but
in PHP. I
used the Tidy library, also available as a .NET class, to
clean up the
html, and then used regular expressions to take out the
markup.
The main challenge however was to determine the email
formatting, this
is whether it was text or not, and handling attachments.
Hth, 
lizet

-----Original Message-----
From: Discussion of building .NET applications targeted for
the Web
[mailtoOTNET-WE
BDISCUSS.DEVELOP.COM] On Behalf Of Jeff
Sent: Monday, June 05, 2006 3:12 PM
To: DOTNET-WEBDISCUSS.DEVELOP.COM
Subject: [DOTNET-WEB] html parser help

Does anyone know of a .net component that can be used to
take an email
message and strip out all the funky html and nasty ms code
that
outlook/express imbeds into an email? We are trying to
display this into
a
multi line text box and it just looks nasty.  There has to
be some easy
way
to display an email body into a text box and easily strip
out all this
nasty
stuff.  Anyone have any ideas?? Trying to create this logic
ourselves
given
all the code variatations, and our limited time frame is not
an option.



Thanks.








===================================
This list is hosted by DevelopMentor(r)  http://www.develop.com

View archives and manage your subscription(s) at
http://discuss.develop.com


===================================
This list is hosted by DevelopMentorŪ  http://www.develop.com

View archives and manage your subscription(s) at http://discuss.develop.com

[1]

about | contact  Other archives ( Real Estate discussion Medical topics )