将字节转换为字符串

我将外部程序的标准输出捕获到一个字节对象中：

>>> from subprocess import *
>>> command_stdout = Popen(['ls', '-l'], stdout=PIPE).communicate()[0]
>>>
>>> command_stdout
b'total 0\n-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file1\n-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file2\n'

我想把它转换成一个普通的Python字符串，这样我就可以这样打印了：

>>> print(command_stdout)
-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file1
-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file2

如何使用Python 3将字节对象转换为str？

当前回答

对于Python 3，这是一种从字节转换为字符串的更安全和Python方法：

def byte_to_str(bytes_or_str):
    if isinstance(bytes_or_str, bytes): # Check if it's in bytes
        print(bytes_or_str.decode('utf-8'))
    else:
        print("Object not of byte type")

byte_to_str(b'total 0\n-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file1\n-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file2\n')

输出：

total 0
-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file1
-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file2

2017-01-18 07:21:09

其他回答

从系统-系统特定参数和功能：

要从标准流中写入或读取二进制数据，请使用底层二进制缓冲区。例如，要将字节写入stdout，请使用sys.stdout.buffer.write（b'abc'）。

2014-01-11 07:15:18

对于Python 3，这是一种从字节转换为字符串的更安全和Python方法：

def byte_to_str(bytes_or_str):
    if isinstance(bytes_or_str, bytes): # Check if it's in bytes
        print(bytes_or_str.decode('utf-8'))
    else:
        print("Object not of byte type")

byte_to_str(b'total 0\n-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file1\n-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file2\n')

输出：

total 0
-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file1
-rw-rw-r-- 1 thomas thomas 0 Mar  3 07:03 file2

2017-01-18 07:21:09

对于“运行shell命令并将其输出作为文本而不是字节”的特定情况，在Python 3.7上，应该使用subprocess.run并传入text=True（以及capture_output=True来捕获输出）

command_result = subprocess.run(["ls", "-l"], capture_output=True, text=True)
command_result.stdout  # is a `str` containing your program's stdout

文本过去被称为universal_newlines，在Python 3.7中被更改（嗯，别名）。如果希望支持3.7之前的Python版本，请传入universal_newlines=True而不是text=True

2019-08-07 14:15:31

这将字节列表合并为字符串：

>>> bytes_data = [112, 52, 52]
>>> "".join(map(chr, bytes_data))
'p44'

2012-08-22 12:57:08

尝试使用这个；此函数将忽略所有非字符集（如UTF-8）二进制文件，并返回一个干净的字符串。它针对Python 3.6及更高版本进行了测试。

def bin2str(text, encoding = 'utf-8'):
    """Converts a binary to Unicode string by removing all non Unicode char
    text: binary string to work on
    encoding: output encoding *utf-8"""

    return text.decode(encoding, 'ignore')

在这里，函数将获取二进制并对其进行解码（使用Python预定义的字符集将二进制数据转换为字符，忽略参数忽略二进制中的所有非字符集数据，并最终返回所需的字符串值）。

如果您不确定编码，请使用sys.getdefaultencoding（）获取设备的默认编码。

2021-05-18 19:07:58

将字节转换为字符串

推荐文章

最新文章

标签