uri regex pattern match

Question

I'm having trouble getting urls to match with

 sregex rex = sregex::compile("(?:ftp|http|https)+://([\\S\^<\^>]+)", sregex::icase );

It matches all the urls but it also includes >> on the end of each match, which I'm trying to negate. What am I doing wrong?

What do you intend [\\S\^<\^>]+ to do?

JaredC
– JaredC

2013-01-14 03:35:57 +00:00
Commented Jan 14, 2013 at 3:35 — JaredC
– JaredC, Commented Jan 14, 2013 at 3:35

Justin O Barber · Accepted Answer · 2013-01-14 04:00:10Z

1

I believe what you want is this:

 sregex rex = sregex::compile("(?:ftp|http|https)://([\\S]+[^<>]*)", sregex::icase );

The character ^ only means "not" when ^ is the first character of a set. Thus, the ^ in [\\S\^<\^>]+ does not mean "not." When ^ is not the first character of a set, it indicates the beginning of a target sequence or follows a line terminator, or has no special meaning.