List Info

Thread: Search URL's in HTML - but NOT in tags




Search URL's in HTML - but NOT in tags
user name
2006-09-14 09:32:05
Hi everybody,

I am looking a few days now for a regular expression which
gives me all
the URL's, but not those which are in href-attributes.
This HTML string demonstrates what I want:
<p>This a <a
href="www.test.com">test</a>:<br
/>This URL
www.second.test.com may only match</p>

I was able to create this one:
\b((www|ftp|http|https|file)(([A-Za-z0-9$_.+!*(),;/?:&~=-])|%[A-Fa-f0-9]){2,}(#([a-zA-Z0-9][a-zA-Z0
-9$_.+!*(),;/?:&~=%-]*))?([A-Za-z0-9$_+!*();/?:~-])|(\w+[-\w\._%\d/\#]+))\b

But the values of the href-attributes may not be found.

HELP is needed, thx in advance.

Best regards,
Maarten


--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the
Google Groups "Regex" group.
To post to this group, send email to regexgooglegroups.com
To unsubscribe from this group, send email to
regex-unsubscribegooglegroups.com
For more options, visit this group at http://groups.go
ogle.com/group/regex
-~----------~----~----~----~------~----~------~--~---

[1]

about | contact  Other archives ( Real Estate discussion Medical topics )