Python有有序集吗?

Python有一个有序字典。那么有序集呢?

当前回答

更新:这个答案在Python 3.7已经过时了。请参阅上面jrc的回答以获得更好的解决方案。出于历史原因，我将保留这个答案。

有序集在功能上是有序字典的一种特殊情况。

字典的键是唯一的。因此，如果忽略有序字典中的值(例如，将它们赋值为None)，那么本质上是有序集。

从Python 3.1和2.7开始，就有了collections.OrderedDict。下面是OrderedSet的一个示例实现。(注意，只有少数方法需要定义或重写:集合。有序字典和集合。让我们来做繁重的工作。

import collections

class OrderedSet(collections.OrderedDict, collections.MutableSet):

    def update(self, *args, **kwargs):
        if kwargs:
            raise TypeError("update() takes no keyword arguments")

        for s in args:
            for e in s:
                 self.add(e)

    def add(self, elem):
        self[elem] = None

    def discard(self, elem):
        self.pop(elem, None)

    def __le__(self, other):
        return all(e in other for e in self)

    def __lt__(self, other):
        return self <= other and self != other

    def __ge__(self, other):
        return all(e in self for e in other)

    def __gt__(self, other):
        return self >= other and self != other

    def __repr__(self):
        return 'OrderedSet([%s])' % (', '.join(map(repr, self.keys())))

    def __str__(self):
        return '{%s}' % (', '.join(map(repr, self.keys())))
    
    difference = property(lambda self: self.__sub__)
    difference_update = property(lambda self: self.__isub__)
    intersection = property(lambda self: self.__and__)
    intersection_update = property(lambda self: self.__iand__)
    issubset = property(lambda self: self.__le__)
    issuperset = property(lambda self: self.__ge__)
    symmetric_difference = property(lambda self: self.__xor__)
    symmetric_difference_update = property(lambda self: self.__ixor__)
    union = property(lambda self: self.__or__)

2009-10-31 10:17:39

其他回答

如果您已经在代码中使用了pandas，那么它的Index对象的行为就非常像一个有序集，如本文所示。

文章中的例子:

indA = pd.Index([1, 3, 5, 7, 9])
indB = pd.Index([2, 3, 5, 7, 11])

indA & indB  # intersection
indA | indB  # union
indA - indB  # difference
indA ^ indB  # symmetric difference

2015-09-25 14:13:08

在官方库中没有OrderedSet。我对所有数据结构做了详尽的备忘单，供您参考。

DataStructure = {
    'Collections': {
        'Map': [
            ('dict', 'OrderDict', 'defaultdict'),
            ('chainmap', 'types.MappingProxyType')
        ],
        'Set': [('set', 'frozenset'), {'multiset': 'collection.Counter'}]
    },
    'Sequence': {
        'Basic': ['list', 'tuple', 'iterator']
    },
    'Algorithm': {
        'Priority': ['heapq', 'queue.PriorityQueue'],
        'Queue': ['queue.Queue', 'multiprocessing.Queue'],
        'Stack': ['collection.deque', 'queue.LifeQueue']
        },
    'text_sequence': ['str', 'byte', 'bytearray']
}

2017-12-06 10:50:34

对于许多目的来说，简单地调用sorted就足够了。例如

>>> s = set([0, 1, 2, 99, 4, 40, 3, 20, 24, 100, 60])
>>> sorted(s)
[0, 1, 2, 3, 4, 20, 24, 40, 60, 99, 100]

如果你要重复使用它，调用排序函数会产生开销，所以你可能想要保存结果列表，只要你完成了对集合的更改。如果您需要维护唯一的元素并进行排序，我同意从具有任意值(如None)的集合中使用OrderedDict的建议。

2013-02-20 22:52:44

正如其他人所说，OrderedDict在功能方面是有序集的超集，但如果你需要一个与API交互的集，并且不需要它是可变的，OrderedDict.keys()实际上是一个实现abc.collections.Set:

import random
from collections import OrderedDict, abc

a = list(range(0, 100))
random.shuffle(a)

# True
a == list(OrderedDict((i, 0) for i in a).keys())

# True
isinstance(OrderedDict().keys(), abc.Set)

注意事项是不可变性，必须像字典一样构建集合，但它很简单，只使用内置。

2020-09-02 02:33:54

我可以为您提供一个比OrderedSet更好的方法:boltons有一个纯python、2/3兼容的IndexedSet类型，它不仅是一个有序集，而且还支持索引(与列表一样)。

简单的pip install boltons(或复制setutils.py到你的代码库中)，导入IndexedSet和:

>>> from boltons.setutils import IndexedSet
>>> x = IndexedSet(list(range(4)) + list(range(8)))
>>> x
IndexedSet([0, 1, 2, 3, 4, 5, 6, 7])
>>> x - set(range(2))
IndexedSet([2, 3, 4, 5, 6, 7])
>>> x[-1]
7
>>> fcr = IndexedSet('freecreditreport.com')
>>> ''.join(fcr[:fcr.index('.')])
'frecditpo'

一切都是唯一的，并保持有序。完全披露:IndexedSet是我写的，但这也意味着如果有任何问题，您可以找我麻烦。：）

2016-02-07 20:41:45

Python有有序集吗?

推荐文章

最新文章

标签