如何在正则表达式中使用变量?

我想在正则表达式中使用一个变量，我如何在Python中做到这一点?

TEXTO = sys.argv[1]

if re.search(r"\b(?=\w)TEXTO\b(?!\w)", subject, re.IGNORECASE):
    # Successful match
else:
    # Match attempt failed

当前回答

从python 3.6开始，你也可以使用文字字符串插值，“f-strings”。在你的具体情况下，解决方案是:

if re.search(rf"\b(?=\w){TEXTO}\b(?!\w)", subject, re.IGNORECASE):
    ...do something

编辑:

由于评论中有一些关于如何处理特殊字符的问题，我想扩展我的回答:

原始字符串('r'):

在处理正则表达式中的特殊字符时，您必须了解的一个主要概念是区分字符串字面量和正则表达式本身。这里有很好的解释:

简而言之:

让我们说，而不是在TEXTO后面找到一个单词boundary \b，你想要匹配字符串\boundary。你必须写:

TEXTO = "Var"
subject = r"Var\boundary"

if re.search(rf"\b(?=\w){TEXTO}\\boundary(?!\w)", subject, re.IGNORECASE):
    print("match")

这只是因为我们使用了一个原始字符串(正则表达式前面有'r')，否则我们必须在正则表达式中写入“\\\\boundary”(四个反斜杠)。此外，如果没有'\r'， \b'将不再转换为单词边界，而是转换为退格!

re.escape:

基本上就是在任何特殊字符前加一个反斜杠。因此，如果你希望TEXTO中有一个特殊字符，你需要写:

if re.search(rf"\b(?=\w){re.escape(TEXTO)}\b(?!\w)", subject, re.IGNORECASE):
    print("match")

注意:对于任何版本> = python 3.7 : !, ", %, ', ,, /, :, ;, <, =, >, @, 和“不逃。只有在正则表达式中有意义的特殊字符仍然被转义。_从Python 3.3开始就没有转义。这里)

花括号:

如果要在使用f-字符串的正则表达式中使用量词，则必须使用双花括号。让我们假设你想匹配TEXTO后面恰好有2个数字:

if re.search(rf"\b(?=\w){re.escape(TEXTO)}\d{{2}}\b(?!\w)", subject, re.IGNORECASE):
    print("match")

2019-04-23 12:06:52

其他回答

你可以尝试使用格式语法sugarer的另一种用法:

re_genre = r'{}'.format(your_variable)
regex_pattern = re.compile(re_genre)

2019-04-18 09:06:16

我同意以上所有观点，除非:

sys。argv[1]有点像Chicken\d{2}-\d{2}一个\s*重要的\s*锚

sys.argv[1] = "Chicken\d{2}-\d{2}An\s*important\s*anchor"

你不会想要使用re.escape，因为在这种情况下，你希望它表现得像一个正则表达式

TEXTO = sys.argv[1]

if re.search(r"\b(?<=\w)" + TEXTO + "\b(?!\w)", subject, re.IGNORECASE):
    # Successful match
else:
    # Match attempt failed

2015-03-28 13:37:34

from re import search, IGNORECASE

def is_string_match(word1, word2):
    #  Case insensitively function that checks if two words are the same
    # word1: string
    # word2: string | list

    # if the word1 is in a list of words
    if isinstance(word2, list):
        for word in word2:
            if search(rf'\b{word1}\b', word, IGNORECASE):
                return True
        return False

    # if the word1 is same as word2
    if search(rf'\b{word1}\b', word2, IGNORECASE):
        return True
    return False

is_match_word = is_string_match("Hello", "hELLO") 
True

is_match_word = is_string_match("Hello", ["Bye", "hELLO", "@vagavela"])
True

is_match_word = is_string_match("Hello", "Bye")
False

2022-09-21 19:15:20

下面是你可以使用的另一种格式(在python 3.7上测试)

regex_str = r'\b(?< \ \w)%s\b(

我发现当你不能使用{}变量(这里替换为%s)时，它很有用。

2021-04-03 13:54:33

我需要搜索彼此相似的用户名，Ned Batchelder说的非常有用。然而，当我使用re.compile创建我的re搜索项时，我发现我有更清晰的输出:

pattern = re.compile(r"("+username+".*):(.*?):(.*?):(.*?):(.*)"
matches = re.findall(pattern, lines)

输出可以使用以下命令打印:

print(matches[1]) # prints one whole matching line (in this case, the first line)
print(matches[1][3]) # prints the fourth character group (established with the parentheses in the regex statement) of the first line.

2015-10-23 20:43:37

如何在正则表达式中使用变量?

推荐文章

最新文章

标签