StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POWhy are C# compiled regular expressions faster than equivalent string methods?
primarykey
Id
12428776
data
AcceptedAnswerId
12428997
AnswerCount
3
ClosedDate
CommentCount
5
CommunityOwnedDate
CreationDate
2012-09-14T16:48:50.750
FavoriteCount
3
LastActivityDate
2012-09-14T20:25:14.890
LastEditDate
2017-05-23T11:54:07.150
LastEditorUserId
-1
OwnerUserId
203002
ParentId
0
PostTypeId
1
Score
18
ViewCount
4809
LastEditorDisplayName
text
Body
Every time I have to do simple containment or replacement operations on strings, where the term that I'm searching for is a fixed value, I find that if I take my sample input and do some profiling on it, using a compiled regular expression is nearly* always faster than using the equivalent method from the String class. I've tried comparing a variety of methods ( <code>hs</code> is the "haystack" to search, <code>ndl</code> is the "needle" to search for, <code>repl</code> is the replacement value. <code>regex</code> is always created with the <code>RegexOptions.Compiled</code> option ): <ul> <li><code>hs.Replace( ndl, repl )</code> vs <code>regex.Replace( hs, repl )</code></li> <li><code>hs.Contains( ndl )</code> vs <code>regex.IsMatch( hs )</code></li> </ul> I've found quite a few discussions focusing on which of the two techniques are faster (<a href="https://stackoverflow.com/questions/3186285/c-sharp-which-is-faster-string-contains-or-regex-ismatch">1</a>, <a href="https://stackoverflow.com/questions/3601465/string-split-vs-regex-split">2</a>, <a href="https://stackoverflow.com/questions/9380062/is-using-a-regular-expression-faster-than-indexof">3</a>, and loads of others), but those discussions always seem to focus on: <ol> <li>Use the string version for simple operations and regex for complex operations (which, from a raw performance perspective, doesn't even seem to be necessarily a good idea), or</li> <li>Run a test and compare the two ( and for equivalent tests, the regex version seems to always perform better ).</li> </ol> I don't understand how this can possibly be the case: how does the regex engine compare any two strings for substring matches faster than the equivalent string version? This seems to hold true for search spaces that are very small or very large, or search terms that are small or large, or whether the search term occurs early or late in the search space. So, why are regular expressions faster? <hr> * In fact, the only case I've managed to show that the string version is faster than a compiled regex is when searching an empty string! Any other case, from single character strings to very long strings are processed faster by a compiled regex than the equivalent string method. <hr> Update: Added a clause to clarify that I'm looking at cases where the search term is known at compile time. For dynamic or one-time operations, the overhead of compiling the regular expression will tend to skew the results in favor of the string methods.
Tags
<c#><.net><regex><string><performance>
Title
Why are C# compiled regular expressions faster than equivalent string methods?
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USCommunity
UserOwnerUserId
1. USChris Phillips
plurals
PostLinksPostIdRelatedPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
2. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
3. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostLinksRelatedPostIdPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
3. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POWhy are C# compiled regular expressions faster than equivalent string methods?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POWhy are C# compiled regular expressions faster than equivalent string methods?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 POWhy are C# compiled regular expressions faster than equivalent string methods?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.