Hi Maarten,
You can try adding a negative lookbehind at the front of the
regex:
(?<!href=)
Rick Quatro
Carmen Publishing
585-659-8267
www.frameexpert.com
> Hi everybody,
>
> I am looking a few days now for a regular expression
which gives me all
> the URL's, but not those which are in href-attributes.
> This HTML string demonstrates what I want:
> <p>This a <a
href="www.test.com">test</a>:<br
/>This URL
> www.second.test.com may only match</p>
>
> I was able to create this one:
>
\b((www|ftp|http|https|file)(([A-Za-z0-9$_.+!*(),;/?: &~=-])|%[A-Fa-f0-9]){2,}(#([a-zA-Z0-9][a-zA-Z0
-9$_.+!*(),;/?: &~=%-]*))?([A-Za-z0-9$_+!*();/?:~-])|(\w+ [-\w\._%\d/\#]+))\b
>
> But the values of the href-attributes may not be found.
>
> HELP is needed, thx in advance.
>
> Best regards,
> Maarten
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the
Google Groups "Regex" group.
To post to this group, send email to regex googlegroups.com
To unsubscribe from this group, send email to
regex-unsubscribe googlegroups.com
For more options, visit this group at http://groups.go
ogle.com/group/regex
-~----------~----~----~----~------~----~------~--~---
|