How to protect against regex denial-of-service (ReDoS) attacks

Interesting article.

Your explanation is wrong though. \w+\s* does not return “A long sentence with invalid characters that takes so much time to be matched that it potentially causes our CPU usage to increase”. it matches “A “, because \w is only a single char, so \w+ matches as many word char are available (in this case just the letter A), then \s* matches as many spaces as possible (just one in this case), the result is “A “. then (\w+\s*)* matches the whole string. It matches as many “at least one word char followed by 0 or more space”. The rest of your explanation is therefore erroneous.

Too bad also your solution is not a real solution. It rejects rapidly the sequence with invalid chars, but it also reject any sequence with valid char ! In fact, this formula will never match anything but the empty string. This is due to the fact that you reference the 1st group from within the first group (the \1 is within the first pair of ()). If you define the first group as “The first group is the first group plus the repetition of itself”, the only solution is the empty group.

A solution that works to you problem is “an optional blank separated list of words plus one word” and it’s spelled like this :
/^(\w+\s+)*\w+$/
which can be decoded as :
^: start
(…)* repeat 0 or more time
\w+: at least one word char
\s+: at least one space char :
\w+: followed by at least one word char
$: then end

It instantly matches “correct”
it instantly matches “this is a list of word”
it instantly does not match “this is an invalid list!”
it instantly does not match “A long sentence with invalid characters that takes soo much time to be matched that it potentially causes our CPU usage to increase drastically!!!”

One Reply to "How to protect against regex denial-of-service (ReDoS) attacks"

Schplurtz le déboulonné says:

July 20, 2023 at 3:12 am

Interesting article.

Your explanation is wrong though. \w+\s* does not return “A long sentence with invalid characters that takes so much time to be matched that it potentially causes our CPU usage to increase”. it matches “A “, because \w is only a single char, so \w+ matches as many word char are available (in this case just the letter A), then \s* matches as many spaces as possible (just one in this case), the result is “A “. then (\w+\s*)* matches the whole string. It matches as many “at least one word char followed by 0 or more space”. The rest of your explanation is therefore erroneous.

Too bad also your solution is not a real solution. It rejects rapidly the sequence with invalid chars, but it also reject any sequence with valid char ! In fact, this formula will never match anything but the empty string. This is due to the fact that you reference the 1st group from within the first group (the \1 is within the first pair of ()). If you define the first group as “The first group is the first group plus the repetition of itself”, the only solution is the empty group.

A solution that works to you problem is “an optional blank separated list of words plus one word” and it’s spelled like this :
/^(\w+\s+)*\w+$/
which can be decoded as :
^: start
(…)* repeat 0 or more time
\w+: at least one word char
\s+: at least one space char :
\w+: followed by at least one word char
$: then end

It instantly matches “correct”
it instantly matches “this is a list of word”
it instantly does not match “this is an invalid list!”
it instantly does not match “A long sentence with invalid characters that takes soo much time to be matched that it potentially causes our CPU usage to increase drastically!!!”

Advisory boards aren’t only for executives. Join the LogRocket Content Advisory Board today →

How to protect against regex denial-of-service (ReDoS) attacks

See how LogRocket's Galileo AI surfaces the most severe issues for you

No signup required

What is regular expression denial-of-service (ReDoS)?

How do regular expressions work?

What types of regex are susceptible to DOS attacks?

Over 200k developers use LogRocket to create better digital experiences

More great articles from LogRocket:

How to protect regular expressions against DoS attacks

Reduce the number of combinations

Control backtracking

Atomic group

Lookahead

Conclusion

Get set up with LogRocket's modern error tracking in minutes:

Stop guessing about your digital experience with LogRocket

Recent posts:

A developer’s guide to Antigravity and Gemini 3

Bun 1.3: Is it time for devs to rethink the Node stack?

Stop using JavaScript to solve CSS problems

The Replay (12/3/25): React’s next era, AI code review tools, and more

One Reply to "How to protect against regex denial-of-service (ReDoS) attacks"

Leave a ReplyCancel reply

Advisory boards aren’t only for executives. Join the LogRocket Content Advisory Board today →

See how LogRocket's Galileo AI surfaces the most severe issues for you

No signup required

🚀 Sign up for The Replay newsletter

What is regular expression denial-of-service (ReDoS)?

How do regular expressions work?

What types of regex are susceptible to DOS attacks?

Over 200k developers use LogRocket to create better digital experiences

More great articles from LogRocket:

How to protect regular expressions against DoS attacks

Reduce the number of combinations

Control backtracking

Atomic group

Lookahead

Conclusion

Get set up with LogRocket's modern error tracking in minutes:

Stop guessing about your digital experience with LogRocket

Recent posts:

A developer’s guide to Antigravity and Gemini 3

Bun 1.3: Is it time for devs to rethink the Node stack?

Stop using JavaScript to solve CSS problems

The Replay (12/3/25): React’s next era, AI code review tools, and more

One Reply to "How to protect against regex denial-of-service (ReDoS) attacks"

Leave a ReplyCancel reply