如何在Python中从字符串中提取数字?

我想提取一个字符串中包含的所有数字。正则表达式和isdigit()方法哪个更适合这个目的?

例子:

line = "hello 12 hi 89"

结果:

[12, 89]

当前回答

使用下面的正则表达式是一种方法

lines = "hello 12 hi 89"
import re
output = []
#repl_str = re.compile('\d+.?\d*')
repl_str = re.compile('^\d+$')
#t = r'\d+.?\d*'
line = lines.split()
for word in line:
        match = re.search(repl_str, word)
        if match:
            output.append(float(match.group()))
print (output)

和findall Re.findall (r'\d+'， "hello 12 hi 89")

['12', '89']

re.findall(r'\b\d+\b'， "hello 12 hi 89 33F AC 777")

['12', '89', '777']

2018-08-16 05:39:54

其他回答

如果你知道字符串中只有一个数字，比如'hello 12 hi'，你可以尝试filter。

例如:

In [1]: int(''.join(filter(str.isdigit, '200 grams')))
Out[1]: 200
In [2]: int(''.join(filter(str.isdigit, 'Counters: 55')))
Out[2]: 55
In [3]: int(''.join(filter(str.isdigit, 'more than 23 times')))
Out[3]: 23

但是要小心!!：

In [4]: int(''.join(filter(str.isdigit, '200 grams 5')))
Out[4]: 2005

2016-04-05 18:20:43

我假设你想要浮点数，而不仅仅是整数，所以我会这样做:

l = []
for t in s.split():
    try:
        l.append(float(t))
    except ValueError:
        pass

请注意，这里发布的其他一些解决方案不适用于负数:

>>> re.findall(r'\b\d+\b', 'he33llo 42 I\'m a 32 string -30')
['42', '32', '30']

>>> '-3'.isdigit()
False

2010-11-27 00:28:48

我找到的最佳选择如下。它将提取一个数字，并可以消除任何类型的字符。

def extract_nbr(input_str):
    if input_str is None or input_str == '':
        return 0

    out_number = ''
    for ele in input_str:
        if ele.isdigit():
            out_number += ele
    return float(out_number)

2015-08-11 16:28:55

对于电话号码，您可以在regex中排除所有带\D的非数字字符:

import re

phone_number = "(619) 459-3635"
phone_number = re.sub(r"\D", "", phone_number)
print(phone_number)

r"\D"中的r代表原始字符串。这是必要的。如果没有它，Python将把\D视为转义字符。

2020-05-29 22:06:12

@jmnas，我喜欢你的答案，但它没有找到浮动。我正在编写一个脚本来解析前往CNC铣床的代码，需要找到可以是整数或浮点数的X和Y维度，所以我将您的代码改编为以下内容。这就找到了int, float值为正和负。仍然没有找到十六进制格式的值，但你可以添加“x”和“A”通过“F”到num_char元组，我认为它会解析像“0x23AC”这样的东西。

s = 'hello X42 I\'m a Y-32.35 string Z30'
xy = ("X", "Y")
num_char = (".", "+", "-")

l = []

tokens = s.split()
for token in tokens:

    if token.startswith(xy):
        num = ""
        for char in token:
            # print(char)
            if char.isdigit() or (char in num_char):
                num = num + char

        try:
            l.append(float(num))
        except ValueError:
            pass

print(l)

2014-11-15 21:52:13

如何在Python中从字符串中提取数字?

推荐文章

最新文章

标签