如何验证一个列表是否是另一个列表的子集?

我需要验证一个列表是否是另一个列表的子集-布尔返回是我所寻求的。

在交叉路口后的小列表上测试相等性是最快的方法吗?考虑到需要比较的数据集的数量，性能是极其重要的。

在讨论的基础上补充进一步的事实:

对于许多测试，这两个列表是否都是相同的?其中一个是静态查找表。需要是一个列表吗?它不是——静态查找表可以是任何性能最好的表。动态的是一个字典，我们从中提取键来执行静态查找。

在这种情况下，最佳解决方案是什么?

当前回答

集合理论不适用于列表，因为重复使用集合理论会导致错误的答案。

例如:

a = [1, 3, 3, 3, 5]
b = [1, 3, 3, 4, 5]
set(b) > set(a)

没有任何意义。是的，它给出了一个错误的答案，但这是不正确的，因为集合理论只是比较:1,3,5与1,3,4,5。你必须包括所有副本。

相反，你必须计算每一项的出现次数，并使用大于等于来检查。这不是很昂贵，因为它不使用O(N²)操作，也不需要快速排序。

#!/usr/bin/env python

from collections import Counter

def containedInFirst(a, b):
  a_count = Counter(a)
  b_count = Counter(b)
  for key in b_count:
    if a_count.has_key(key) == False:
      return False
    if b_count[key] > a_count[key]:
      return False
  return True


a = [1, 3, 3, 3, 5]
b = [1, 3, 3, 4, 5]
print "b in a: ", containedInFirst(a, b)

a = [1, 3, 3, 3, 4, 4, 5]
b = [1, 3, 3, 4, 5]
print "b in a: ", containedInFirst(a, b)

然后运行这个，你会得到:

$ python contained.py 
b in a:  False
b in a:  True

2017-11-14 10:11:25

其他回答

如果你在问一个列表是否“包含”在另一个列表中，那么:

>>>if listA in listB: return True

如果你想知道listA中的每个元素在listB中是否有相等数量的匹配元素，试试:

all(True if listA.count(item) <= listB.count(item) else False for item in listA)

2015-06-25 09:23:21

在python 3.5中，您可以执行[*set()][index]来获取元素。它比其他方法要慢得多。

one = [1, 2, 3]
two = [9, 8, 5, 3, 2, 1]

result = set(x in two for x in one)

[*result][0] == True

或者只用len和set

len(set(a+b)) == len(set(a))

2017-12-11 03:59:36

下面是我如何知道如果一个列表是另一个列表的子集，在我的情况下，序列对我很重要。

def is_subset(list_long,list_short):
    short_length = len(list_short)
    subset_list = []
    for i in range(len(list_long)-short_length+1):
        subset_list.append(list_long[i:i+short_length])
    if list_short in subset_list:
        return True
    else: return False

2019-04-24 09:08:29

另一种解决方案是使用十字路口。

one = [1, 2, 3]
two = [9, 8, 5, 3, 2, 1]

set(one).intersection(set(two)) == set(one)

集合的交点包含集合1

(OR)

one = [1, 2, 3]
two = [9, 8, 5, 3, 2, 1]

set(one) & (set(two)) == set(one)

2017-12-11 04:01:24

大多数解决方案都认为列表没有副本。如果你的列表确实有重复，你可以试试这个:

def isSubList(subList,mlist):
    uniqueElements=set(subList)
    for e in uniqueElements:
        if subList.count(e) > mlist.count(e):
            return False     
    # It is sublist
    return True

它确保子列表的元素永远不会与list不同，或者公共元素的数量更多。

lst=[1,2,2,3,4]
sl1=[2,2,3]
sl2=[2,2,2]
sl3=[2,5]

print(isSubList(sl1,lst)) # True
print(isSubList(sl2,lst)) # False
print(isSubList(sl3,lst)) # False

2020-04-02 15:18:20

如何验证一个列表是否是另一个列表的子集?

推荐文章

最新文章

标签