Nintex Community Menu Bar

Solved

Regular Expression to extract URL from text string

2 years ago
November 9, 2022
2 replies
1190 views
Translate

bobFermi
Rookie
3 replies

Hello All,

I need to extract URL(s) from a text string using the regex function. A sample string may be something like: "This is a test https:||yahoo.com this is still a test" (I had to replace the // with || to be able to post a URL)

I found a regex expression online that works great when I test it with a regex tester online, however the Nintex regex action returns nothing. Why would this regex expression work on a regex tester but not for nintex?

/(?:(?:https?|ftp|file)://|www.|ftp.)(?:([-A-Z0-9+&@#/%=~_|$?!:,.]*)|[-A-Z0-9+&@#/%=~_|$?!:,.])*(?:([-A-Z0-9+&@#/%=~_|$?!:,.]*)|[A-Z0-9+&@#/%=~_|$])/igm

Best answer by MegaJerk

It might not be working for you because of the enclosing regex brackets and tokens at the end:

/.../igm

If I format it without those opening and closing brackets / tokens:

(?:(?:https?|ftp|file)://|www.|ftp.)(?:([-A-Z0-9+&@#/%=~_|$?!:,.]*)|[-A-Z0-9+&@#/%=~_|$?!:,.])*(?:([-A-Z0-9+&@#/%=~_|$?!:,.]*)|[A-Z0-9+&@#/%=~_|$])

And instead configure the Regex Workflow Action to ignore case:

Then I get a working match extraction:

Let me know if this solves the issue for you!

PS: Remember that you'll need to assign a Collection type Workflow Variable to the output of your Regex Extraction! It will output ALL matches to their own index in the Collection!

View original

Did this topic help you find an answer to your question?

+14

MegaJerk
Scholar
832 replies
Answer
2 years ago
November 10, 2022

It might not be working for you because of the enclosing regex brackets and tokens at the end:

/.../igm

If I format it without those opening and closing brackets / tokens:

(?:(?:https?|ftp|file)://|www.|ftp.)(?:([-A-Z0-9+&@#/%=~_|$?!:,.]*)|[-A-Z0-9+&@#/%=~_|$?!:,.])*(?:([-A-Z0-9+&@#/%=~_|$?!:,.]*)|[A-Z0-9+&@#/%=~_|$])

And instead configure the Regex Workflow Action to ignore case:

Then I get a working match extraction:

Let me know if this solves the issue for you!

PS: Remember that you'll need to assign a Collection type Workflow Variable to the output of your Regex Extraction! It will output ALL matches to their own index in the Collection!

Translate

Anonymous
0 replies
2 years ago
November 12, 2022

Using RegexBuddy:

To Get URL only:

.*(?=/)/

To Get Filename only:

(?<=/)[^/?#]+(?=[^/]*$)

Test String:

This is a URL with a filename https://yahoo.com/gtip/foo.mmmhttps://yahoo.com/gtip/foo.pdf

Translate

Reply

Cookie policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos

Reply

Related topics

stringreplacer to replace one specific entire line of data in an LAS file?icon

Extract XML from URLicon

Using wildcard in StringReplacer to match but not change texticon

Portable FME Flow/Form Python environmenticon

How to transform and download a number of large LAS files in an FME script from the attributes of a feature class hyperlink. The download should be executed based on an intersect and the LAS file should be removed from the SSD after the transformatioicon

Sign up

Log in with SSO

Login to the community

Log in with SSO

Scanning file for viruses.

This file cannot be downloaded

Cookie policy

Cookie settings