如何克隆列表，使其在分配后不会意外更改？

使用new_list=my_list时，对new_list的任何修改都会每次更改my_list。为什么会出现这种情况，以及如何克隆或复制列表以防止出现这种情况？

当前回答

通过id和gc查看内存的一个稍微实用的视角。

>>> b = a = ['hell', 'word']
>>> c = ['hell', 'word']

>>> id(a), id(b), id(c)
(4424020872, 4424020872, 4423979272) 
     |           |
      -----------

>>> id(a[0]), id(b[0]), id(c[0])
(4424018328, 4424018328, 4424018328) # all referring to same 'hell'
     |           |           |
      -----------------------

>>> id(a[0][0]), id(b[0][0]), id(c[0][0])
(4422785208, 4422785208, 4422785208) # all referring to same 'h'
     |           |           |
      -----------------------

>>> a[0] += 'o'
>>> a,b,c
(['hello', 'word'], ['hello', 'word'], ['hell', 'word'])  # b changed too
>>> id(a[0]), id(b[0]), id(c[0])
(4424018384, 4424018384, 4424018328) # augmented assignment changed a[0],b[0]
     |           |
      -----------

>>> b = a = ['hell', 'word']
>>> id(a[0]), id(b[0]), id(c[0])
(4424018328, 4424018328, 4424018328) # the same hell
     |           |           |
      -----------------------

>>> import gc
>>> gc.get_referrers(a[0]) 
[['hell', 'word'], ['hell', 'word']]  # one copy belong to a,b, the another for c
>>> gc.get_referrers(('hell'))
[['hell', 'word'], ['hell', 'word'], ('hell', None)] # ('hello', None)

2019-11-23 19:01:46

其他回答

在Python中，请记住：

    list1 = ['apples','bananas','pineapples']
    list2 = list1

List2没有存储实际的列表，而是对list1的引用。因此，当您对list1执行任何操作时，list2也会发生变化。使用copy模块（非默认，在pip上下载）制作列表的原始副本（对于简单列表，copy.copy（）；对于嵌套列表，copy。deepcopy（））。这将生成一个不会随第一个列表而更改的副本。

2020-02-22 12:44:40

菲利克斯已经给出了一个很好的答案，但我想我应该对各种方法进行速度比较：

10.59秒（105.9µs/itn）-copy.depcopy（旧列表）10.16秒（101.6µs/itn）-纯Python Copy（）方法使用deepcopy复制类1.488秒（14.88µs/itn）-纯Python Copy（）方法不复制类（仅dicts/lists/tuples）0.325秒（3.25µs/itn）-对于old_list:new_list.append（项目）中的项目0.217秒（2.17µs/itn）-[i代表old_list]（列表理解）0.186秒（1.86µs/itn）-复制副本（old_list）0.075秒（0.75µs/itn）-列表（旧列表）0.053秒（0.53µs/itn）-新列表=[]；新列表扩展（旧列表）0.039秒（0.39µs/itn）-old_list[：]（列表切片）

所以最快的是列表切片。但请注意，与copy.deepcopy（）和python版本不同，copy.copy（）、list[：]和list（list）不会复制列表中的任何列表、字典和类实例，因此如果原始列表发生变化，它们也会在复制的列表中发生变化，反之亦然。

（如果有人感兴趣或想提出任何问题，以下是脚本：）

from copy import deepcopy

class old_class:
    def __init__(self):
        self.blah = 'blah'

class new_class(object):
    def __init__(self):
        self.blah = 'blah'

dignore = {str: None, unicode: None, int: None, type(None): None}

def Copy(obj, use_deepcopy=True):
    t = type(obj)

    if t in (list, tuple):
        if t == tuple:
            # Convert to a list if a tuple to
            # allow assigning to when copying
            is_tuple = True
            obj = list(obj)
        else:
            # Otherwise just do a quick slice copy
            obj = obj[:]
            is_tuple = False

        # Copy each item recursively
        for x in xrange(len(obj)):
            if type(obj[x]) in dignore:
                continue
            obj[x] = Copy(obj[x], use_deepcopy)

        if is_tuple:
            # Convert back into a tuple again
            obj = tuple(obj)

    elif t == dict:
        # Use the fast shallow dict copy() method and copy any
        # values which aren't immutable (like lists, dicts etc)
        obj = obj.copy()
        for k in obj:
            if type(obj[k]) in dignore:
                continue
            obj[k] = Copy(obj[k], use_deepcopy)

    elif t in dignore:
        # Numeric or string/unicode?
        # It's immutable, so ignore it!
        pass

    elif use_deepcopy:
        obj = deepcopy(obj)
    return obj

if __name__ == '__main__':
    import copy
    from time import time

    num_times = 100000
    L = [None, 'blah', 1, 543.4532,
         ['foo'], ('bar',), {'blah': 'blah'},
         old_class(), new_class()]

    t = time()
    for i in xrange(num_times):
        Copy(L)
    print 'Custom Copy:', time()-t

    t = time()
    for i in xrange(num_times):
        Copy(L, use_deepcopy=False)
    print 'Custom Copy Only Copying Lists/Tuples/Dicts (no classes):', time()-t

    t = time()
    for i in xrange(num_times):
        copy.copy(L)
    print 'copy.copy:', time()-t

    t = time()
    for i in xrange(num_times):
        copy.deepcopy(L)
    print 'copy.deepcopy:', time()-t

    t = time()
    for i in xrange(num_times):
        L[:]
    print 'list slicing [:]:', time()-t

    t = time()
    for i in xrange(num_times):
        list(L)
    print 'list(L):', time()-t

    t = time()
    for i in xrange(num_times):
        [i for i in L]
    print 'list expression(L):', time()-t

    t = time()
    for i in xrange(num_times):
        a = []
        a.extend(L)
    print 'list extend:', time()-t

    t = time()
    for i in xrange(num_times):
        a = []
        for y in L:
            a.append(y)
    print 'list append:', time()-t

    t = time()
    for i in xrange(num_times):
        a = []
        a.extend(i for i in L)
    print 'generator expression extend:', time()-t

2010-04-10 10:16:24

让我们从头开始，探讨这个问题。

假设您有两个列表：

list_1 = ['01', '98']
list_2 = [['01', '98']]

我们必须复制两个列表，现在从第一个列表开始：

因此，首先让我们将变量副本设置为原始列表list_1：

copy = list_1

现在，如果你认为copy复制了list_1，那么你错了。id函数可以告诉我们两个变量是否可以指向同一个对象。让我们试试看：

print(id(copy))
print(id(list_1))

输出为：

4329485320
4329485320

这两个变量是完全相同的参数。你惊讶吗？

所以我们知道，Python不会在变量中存储任何内容，变量只是引用对象，对象存储值。这里的对象是一个列表，但我们通过两个不同的变量名创建了对同一对象的两个引用。这意味着两个变量都指向同一个对象，只是名称不同。

当您执行copy=list_1时，它实际上正在执行以下操作：

在这里，图像list_1和copy是两个变量名，但两个变量的对象是相同的，即列表。

因此，如果您尝试修改复制的列表，那么它也会修改原始列表，因为那里只有一个列表，无论您是从复制的列表还是从原始列表进行修改，都会修改该列表：

copy[0] = "modify"

print(copy)
print(list_1)

输出：

['modify', '98']
['modify', '98']

所以它修改了原始列表：

现在，让我们来看看复制列表的Pythonic方法。

copy_1 = list_1[:]

该方法解决了我们遇到的第一个问题：

print(id(copy_1))
print(id(list_1))

4338792136
4338791432

因此，我们可以看到两个列表都有不同的id，这意味着两个变量都指向不同的对象。所以这里的实际情况是：

现在，让我们尝试修改列表，看看我们是否仍然面临前面的问题：

copy_1[0] = "modify"

print(list_1)
print(copy_1)

输出为：

['01', '98']
['modify', '98']

如您所见，它只修改了复制的列表。这意味着它奏效了。

你认为我们结束了吗？不，让我们尝试复制嵌套列表。

copy_2 = list_2[:]

list2应该引用另一个对象，该对象是list2的副本。让我们检查一下：

print(id((list_2)), id(copy_2))

我们得到输出：

4330403592 4330403528

现在我们可以假设两个列表都指向不同的对象，所以现在让我们尝试修改它，看看它给出了我们想要的：

copy_2[0][1] = "modify"

print(list_2, copy_2)

这为我们提供了输出：

[['01', 'modify']] [['01', 'modify']]

这可能看起来有点令人困惑，因为我们以前使用的相同方法奏效了。让我们试着理解这一点。

当您这样做时：

copy_2 = list_2[:]

你只是在复制外部列表，而不是内部列表。我们可以再次使用id函数来检查这一点。

print(id(copy_2[0]))
print(id(list_2[0]))

输出为：

4329485832
4329485832

当我们执行copy_2=list_2[：]时，会发生以下情况：

它创建列表副本，但仅创建外部列表副本，而不是嵌套列表副本。两个变量的嵌套列表都相同，因此如果您尝试修改嵌套列表，那么它也会修改原始列表，因为嵌套列表对象对于两个列表都相同。

解决方案是什么？解决方案是deepcopy函数。

from copy import deepcopy
deep = deepcopy(list_2)

让我们检查一下：

print(id((list_2)), id(deep))

4322146056 4322148040

两个外部列表都有不同的ID。让我们在内部嵌套列表上尝试一下。

print(id(deep[0]))
print(id(list_2[0]))

输出为：

4322145992
4322145800

正如您所看到的，两个ID都不同，这意味着我们可以假设两个嵌套列表现在都指向不同的对象。

这意味着当您执行deep=deepcopy（list_2）时，实际发生了什么：

两个嵌套列表都指向不同的对象，现在它们有嵌套列表的单独副本。

现在，让我们尝试修改嵌套列表，看看它是否解决了前面的问题：

deep[0][1] = "modify"
print(list_2, deep)

它输出：

[['01', '98']] [['01', 'modify']]

如您所见，它没有修改原始嵌套列表，只修改了复制的列表。

2017-11-13 07:04:17

deepcopy选项是唯一适用于我的方法：

from copy import deepcopy

a = [   [ list(range(1, 3)) for i in range(3) ]   ]
b = deepcopy(a)
b[0][1]=[3]
print('Deep:')
print(a)
print(b)
print('-----------------------------')
a = [   [ list(range(1, 3)) for i in range(3) ]   ]
b = a*1
b[0][1]=[3]
print('*1:')
print(a)
print(b)
print('-----------------------------')
a = [   [ list(range(1, 3)) for i in range(3) ] ]
b = a[:]
b[0][1]=[3]
print('Vector copy:')
print(a)
print(b)
print('-----------------------------')
a = [   [ list(range(1, 3)) for i in range(3) ]  ]
b = list(a)
b[0][1]=[3]
print('List copy:')
print(a)
print(b)
print('-----------------------------')
a = [   [ list(range(1, 3)) for i in range(3) ]  ]
b = a.copy()
b[0][1]=[3]
print('.copy():')
print(a)
print(b)
print('-----------------------------')
a = [   [ list(range(1, 3)) for i in range(3) ]  ]
b = a
b[0][1]=[3]
print('Shallow:')
print(a)
print(b)
print('-----------------------------')

导致输出：

Deep:
[[[1, 2], [1, 2], [1, 2]]]
[[[1, 2], [3], [1, 2]]]
-----------------------------
*1:
[[[1, 2], [3], [1, 2]]]
[[[1, 2], [3], [1, 2]]]
-----------------------------
Vector copy:
[[[1, 2], [3], [1, 2]]]
[[[1, 2], [3], [1, 2]]]
-----------------------------
List copy:
[[[1, 2], [3], [1, 2]]]
[[[1, 2], [3], [1, 2]]]
-----------------------------
.copy():
[[[1, 2], [3], [1, 2]]]
[[[1, 2], [3], [1, 2]]]
-----------------------------
Shallow:
[[[1, 2], [3], [1, 2]]]
[[[1, 2], [3], [1, 2]]]
-----------------------------

2020-04-11 11:19:40

Python 3.6计时

下面是使用Python 3.6.8的计时结果。请记住，这些时间是相对的，而不是绝对的。

我坚持只做浅层复制，还添加了一些在Python 2中不可能的新方法，例如list.copy（）（Python 3切片的等价物）和两种形式的列表解包（*new_list，=list和new_list=[*list]）：

METHOD                TIME TAKEN
b = [*a]               2.75180600000021
b = a * 1              3.50215399999990
b = a[:]               3.78278899999986  # Python 2 winner (see above)
b = a.copy()           4.20556500000020  # Python 3 "slice equivalent" (see above)
b = []; b.extend(a)    4.68069800000012
b = a[0:len(a)]        6.84498999999959
*b, = a                7.54031799999984
b = list(a)            7.75815899999997
b = [i for i in a]    18.4886440000000
b = copy.copy(a)      18.8254879999999
b = []
for item in a:
  b.append(item)      35.4729199999997

我们可以看到，Python 2的获胜者仍然表现出色，但并没有远远超过Python 3 list.copy（），特别是考虑到后者的出色可读性。

黑马是拆包和重新包装方法（b=[*a]），它比原始切片快约25%，比其他拆包方法（*b，=a）快两倍多。

b=a*1的表现也出奇地好。

请注意，这些方法不会为列表以外的任何输入输出等效结果。它们都适用于可切片对象，少数适用于任何可迭代对象，但只有copy.copy（）适用于更一般的Python对象。

以下是相关方的测试代码（此处的模板）：

import timeit

COUNT = 50000000
print("Array duplicating. Tests run", COUNT, "times")
setup = 'a = [0,1,2,3,4,5,6,7,8,9]; import copy'

print("b = list(a)\t\t", timeit.timeit(stmt='b = list(a)', setup=setup, number=COUNT))
print("b = copy.copy(a)\t", timeit.timeit(stmt='b = copy.copy(a)', setup=setup, number=COUNT))
print("b = a.copy()\t\t", timeit.timeit(stmt='b = a.copy()', setup=setup, number=COUNT))
print("b = a[:]\t\t", timeit.timeit(stmt='b = a[:]', setup=setup, number=COUNT))
print("b = a[0:len(a)]\t\t", timeit.timeit(stmt='b = a[0:len(a)]', setup=setup, number=COUNT))
print("*b, = a\t\t\t", timeit.timeit(stmt='*b, = a', setup=setup, number=COUNT))
print("b = []; b.extend(a)\t", timeit.timeit(stmt='b = []; b.extend(a)', setup=setup, number=COUNT))
print("b = []; for item in a: b.append(item)\t", timeit.timeit(stmt='b = []\nfor item in a:  b.append(item)', setup=setup, number=COUNT))
print("b = [i for i in a]\t", timeit.timeit(stmt='b = [i for i in a]', setup=setup, number=COUNT))
print("b = [*a]\t\t", timeit.timeit(stmt='b = [*a]', setup=setup, number=COUNT))
print("b = a * 1\t\t", timeit.timeit(stmt='b = a * 1', setup=setup, number=COUNT))

2017-04-05 01:01:10

如何克隆列表，使其在分配后不会意外更改？

推荐文章

最新文章

标签