Crib on regular expressions PHP

On this topic:

^ - Beginning of the line
$ - End of line

. - Any character except for line translations (without the parameter /.../s)
[...] - Any of the above character set. Within the square brackets, other operators do not work, but you can use metacharacters. With a hyphen, you can specify character sets: from the first to the last. For example, [af] means any letter from a, b, c, d, e, f.
[^ ...] - None of the above character set. Within the square brackets, other operators do not work, but you can use metacharacters. With a hyphen, you can specify character sets: from the first to the last. For example, [^ 0-9] means any characters other than 0, 1, 2, 3, 4, 5, 6, 7, 8, 9.
\ # - The next character in the slash is # (except az and 0-9). For example, \\ stands for \, \. Means a symbol. (Dot), \ $ means the symbol $, and so on.

\ B - Beginning of the word
\ B - End of word
[[: Alnum:]] - alphanumeric characters
[[: Digit :]] - decimal numeric characters

[[: Xdigit:]] - hexadecimal numeric characters
[[: Alpha:]] - alphabetic characters
[[: Upper :]] - uppercase alphabetic characters
[[: Lower:]] - lower case letters

[[: Punct:]] - punctuation
[[: Space :]] - space characters
[[: Blanc:]] - tab and space characters
[[: Print:]] - printed characters

[[: Cntrl:]] - control characters
[[: Graph :]] - printed characters, excluding whitespace
\ XNN - NN - hexadecimal ASCII character code (\ x20 - space, \ x4A - J, \ x6A - j, etc.)

\ T is a tab character
\ N - new line
\ R - carriage return
\ A - format translation

\ V - vertical tabulation
\ A - call
\ E -escape
\ 033 - the octal record of the character

\ X1A - hexadecimal
\ C - control character
\ L - lower case of the next character
\ U - uppercase - // -

\ L - all characters in lowercase before \ E
\ U - in the upper - // -
\ E - limiter of the change of register
\ Q - cancel the action as a metacharacter

\ W - alphanumeric or '_' character
\ W - not - // -
\ S is one space

\ S - one is not a space
\ D - one number
\ D - one is not a digit

\ B - word boundary
\ B is not a word boundary
\ A - the beginning of a line for each line in a multiline string
\ Z - the end of the line for each line in a multi-line string

\ G - the end of the action m // g

(...) - Group characters into one pattern and remember
| | | - Previous or next pattern (logical "OR")

* - Zero or more times
+ - One or more times
? - 0 or 1 times the previous mask
{N} - Repeat n times

{N,} - Repeat n or more times
{N, m} - Repeat from n to m times
? #N - This is the "backwards" operator. N is the number of characters to view.

? ~ N - Negative view back.
? = - Preview forward.
?! - Negating the view forward.

I - do not distinguish between lowercase and uppercase letters.
M - consider a multiline string.
S is a single-line string.
X - extended syntax (using spaces and comments)

E - after executing standard substitutions in the replaced string interprets it as PHP code and uses the result to replace the search string.
A - the pattern matching will be achieved only if it corresponds to the beginning of the line in which the search is performed.
D - the metacharacter $ in the template corresponds only to the end of the data being processed. Without this modifier, the $ metacharacter also matches the position before the last character, if it is a line feed (but does not apply to any other line feeds). This modifier is ignored if the modifier m is used. In Perl, there is no similar modifier.
S - if this modifier is used, an additional template analysis is performed. In the present, this makes sense only for fixed templates that do not contain reference variables.

The U -modifier inverts the greed of the quantifiers, so they are not greedy by default. But become greedy if followed by the symbol '?'. This feature is not compatible with Perl. The U modifier can also be used inside the template, using the '? U' entry.
X includes additional PCRE functionality that is not compatible with Perl: any backslash in the template, followed by a character that does not have a special value, results in an error. This is due to the fact that such combinations are reserved for further development. By default, as in Perl, the slash followed by a character without special meaning is treated as a typo. To date, these are all the features that are controlled by this modifier
U - includes additional PCRE functionality that is not compatible with Perl: templates are treated as UTF8 strings. The u modifier is available in PHP 4.1.0 and higher for Unix platforms, and in PHP 4.2.3 and higher for Windows platforms.

(? # Comment) is a comment in the body of the template.
(?: Pattern) - grouping like '()', but without backlink
(? = Template) - "peeking" forward. For example, \ w + (? = \ T) / matches the word followed by a tab, but the '\ t' character is not included in the result.

\ NUMBER - A reference inside the regex on its own parsed bracket, where NUMBER is the number of the required group (brackets). This operator works with some restrictions on the type of the referenced block - it works only if there are no repeat operators in the referenced bracket.

Liked? Subscribe to RSS news!
You can also support shram.kiev.ua, press:

It will not be superfluous for your friends to learn this information, share their article with them!

Expand / Collapse

Comments

When commenting on, remember that the content and tone of your message can hurt the feelings of real people, show respect and tolerance to your interlocutors even if you do not share their opinion, your behavior in the conditions of freedom of expression and anonymity provided by the Internet, changes Not only virtual, but also the real world. All comments are hidden from the index, spam is controlled.