Notes: Regular expression

Character classes
.	any character except newline
\w \d \s	word, digit, whitespace
\W \D \S	not word, digit, whitespace
[abc]	any of a, b, or c
[^abc]	not a, b, or c
[a-g]	character between a & g
Anchors
^abc$	start / end of the string
\b	word boundary
Escaped characters
\. \* \\	escaped special characters
\t \n \r	tab, linefeed, carriage return
\u00A9	unicode escaped ©
Groups & Lookaround
(abc)	capture group
\1	backreference to group #1
(?:abc)	non-capturing group
(?=abc)	positive lookahead
(?!abc)	negative lookahead
Quantifiers & Alternation
a* a+ a?	0 or more, 1 or more, 0 or 1
a{5} a{2,}	exactly five, two or more
a{1,3}	between one & three
a+? a{2,}?	match as few as possible
ab\|cd	match ab or cd

Reference: http://www.regexpal.com/

我們可以使用預先定義的字元類別：

. 符合任一字元。例如有一字串abcdebcadxbc，使用.bc來比對的話，符合的子字串有abc、ebc、xbc三個；如果使用..cd，則符合的子字串只有bcd。

以上的例子來根據字元比對，您也可以使用「字元類」（Character class）來比較一組字元範圍，例如：

一次只指定一個字元不過癮，也可以用Greedy quantifiers來指定字元可能出現的次數：

Reference:

http://www.codeproject.com/Articles/9099/The-Minute-Regex-Tutorial

http://regex.learncodethehardway.org/book/