是否有一个简单的方法来删除字符串中的多个空格?

假设这个字符串:

The   fox jumped   over    the log.

变成:

The fox jumped over the log.

在不分割和进入列表的情况下，最简单的实现方法(1-2行)是什么?

当前回答

要去除空白，考虑开头、结尾和单词之间的额外空白，可以使用:

(?<=\s) +|^ +(?=\s)| (?= +[\n\0])

第一个或处理前导空白，第二个或处理字符串开头的前导空白，最后一个处理尾随空白。

为了证明使用，这个链接将为您提供一个测试。

https://regex101.com/r/meBYli/4

这将与re.split函数一起使用。

2016-11-11 04:50:48

其他回答

类似于前面的解决方案，但更具体:用一个空格替换两个或多个空格:

>>> import re
>>> s = "The   fox jumped   over    the log."
>>> re.sub('\s{2,}', ' ', s)
'The fox jumped over the log.'

2009-10-09 21:58:27

我不得不同意Paul McGuire的评论。对我来说,

' '.join(the_string.split())

比快速生成正则表达式要好得多。

我的测量结果(Linux和Python 2.5)显示，先分离后连接的速度几乎比“re.sub(…)”快5倍，如果你一次预编译正则表达式并多次执行该操作，速度仍然快3倍。而且无论从哪方面看，它都更容易理解——更python化。

2009-10-10 02:39:51

一个简单的灵魂

>>> import re
>>> s="The   fox jumped   over    the log."
>>> print re.sub('\s+',' ', s)
The fox jumped over the log.

2015-11-04 06:11:39

我没有深入研究其他示例，但是我刚刚创建了这个方法来合并多个连续的空格字符。

它不使用任何库，虽然它的脚本长度相对较长，但它不是一个复杂的实现:

def spaceMatcher(command):
    """
    Function defined to consolidate multiple whitespace characters in
    strings to a single space
    """
    # Initiate index to flag if more than one consecutive character
    iteration
    space_match = 0
    space_char = ""
    for char in command:
      if char == " ":
          space_match += 1
          space_char += " "
      elif (char != " ") & (space_match > 1):
          new_command = command.replace(space_char, " ")
          space_match = 0
          space_char = ""
      elif char != " ":
          space_match = 0
          space_char = ""
   return new_command

command = None
command = str(input("Please enter a command ->"))
print(spaceMatcher(command))
print(list(spaceMatcher(command)))

2017-12-18 17:29:40

另一个选择:

>>> import re
>>> str = 'this is a            string with    multiple spaces and    tabs'
>>> str = re.sub('[ \t]+' , ' ', str)
>>> print str
this is a string with multiple spaces and tabs

2012-07-25 10:19:34

是否有一个简单的方法来删除字符串中的多个空格?

推荐文章

最新文章

标签