Regex
Table of Contents:
Slides
Intro
Consider this function:
Now, consider the function with regular expressions.
What is a Regular Expression
A regular expression (or "Reg Ex") is a sequence of characters that specifies a "match pattern" to search for strings in text, extract information, or validate input.
To create a regular expression, we use a pair of forward slashes with the match pattern inside.
The symbols and characters inside of the / /
define the characteristics of the strings that the regular expression can be used to search for. In the example above, we have:
^
requires that the matching string must start with...\w+
one or more word characters (letters, numbers, or_
).The
\w
represents a word character and the+
modifier means "one or more"
$
requires that the matching string must end before any other characters are included (no spaces, no symbols, nothing!)
What can you do with regular expressions?
Validate Strings
Like the isOnlyAlphaNumeric
function, regular expressions are often used to validate if a string matches a specific pattern.
One of the most common challenges in applications is validating proper date formats. This function uses a regular expression to ensure that a given string is in the "MM-DD-YYYY" format:
Extract Strings From Text
One cool usage of regular expressions is to "find all instances of X in a string". For example, in a big block of text, grab all of the emails:
Regular Expression Syntax
There are a lot of cool ways to use regular expressions
Here are some snippets to get started!
Name
Info
Example
Flags
g=global, I=insensitive
/cat/gi
letters
You know letters
/dog/
digits
You know digits
/12/
Character Set/Class
Brackets
/[aeiou]/
Negated Character Set
Carrot and brackets
/[^aeiou]/
O or more
*
/a*/
1 or more
+
/a+/
0 or 1
?
/a?/
Or
|
/cat|dog/
Start of string
Carrot, no brackets
/^hi/
End of string
Dollar sign
/bye$/
Quantifier exactly
Braces, one number
/AH{10}/
Quantifier at least X many
Braces, one number and comma
/OH{3,}/
Quantifier exact range
Braces, two comma separated numbers
/WO{2,4}W/
Groups
Parens with optional pipe OR operator
/(my)\s(cat|dog)/
Special Characters
[0-9]
Any single digit
[3-30]
And number in range
[a-z]
Any lowercase letter
[a-d]
Any lowercase letter in range
[A-Z]
Any uppercase letter
[E-Q]
Any uppercase letter
[a-zA-Z]
Any letter
\d
Any single digit
\D
Any NON digit
\w
Any alphanumeric (and _)
\W
Any NON alphanumeric
\s
Any whitespace character
\S
Any NON whitespace character
\b
Any word break
\B
Any NON word break
.
Any character at all
\.
An escaped period
How to Regular Expressions in JavaScript
The methods to be aware of that utilize regular expressions are:
RegExp.test(str)
- validate that a string has a matchstr.search(RegExp)
- find the index of a match in a stringstr.match(RegExp)
- find one or more matches in a stringstr.replace(RegExp, replacement)
- replace one or more matches in a string
Note that the first method is invoked on a regular expression and the input is a string while the other two are invoked on a string and the input is a regular expression.
RegExp.test(str)
and the i
flag
RegExp.test(str)
and the i
flagThe RegExp.test(str)
returns true
if there is a match between the regular expression and the given string.
By default, regular expressions are case sensitive and, unless otherwise specified, must be an exact match.
Adding the i
flag after the second /
allows for case-insensitive matches:
str.search(RegExp)
str.search(RegExp)
str.search(RegExp)
returns the index of the first match between the string and the given regular expression, or -1
if none exist.
str.match(RegExp)
and the g
flag
str.match(RegExp)
and the g
flagstr.match(RegExp)
returns an array containing matches between the string and the given regular expression.
The returned array is a little bit strange looking at first but its just an array with one value, and then the properties index
, input
, and groups
By default, regular expressions will stop searching after the first match is found. Adding the global g
flag after the second /
causes match
to return ALL matches.
str.replace(RegExp, replacement)
str.replace(RegExp, replacement)
str.replace(RegExp, replacement)
replaces matches between the string the given regular expression with the given replacement
string.
By default, replace
will only replace the first match. You can use the g
flag to replace ALL matches.
Last updated