是否有一种内置/快速的方法来使用字典的键列表来获得对应项的列表?
例如,我有:
>>> mydict = {'one': 1, 'two': 2, 'three': 3}
>>> mykeys = ['three', 'one']
我如何使用mykeys在字典中作为列表获得相应的值?
>>> mydict.WHAT_GOES_HERE(mykeys)
[3, 1]
是否有一种内置/快速的方法来使用字典的键列表来获得对应项的列表?
例如,我有:
>>> mydict = {'one': 1, 'two': 2, 'three': 3}
>>> mykeys = ['three', 'one']
我如何使用mykeys在字典中作为列表获得相应的值?
>>> mydict.WHAT_GOES_HERE(mykeys)
[3, 1]
当前回答
如果您发现自己经常这样做,您可能希望继承dict的子类,以获取键的列表并返回值的列表。
>>> d = MyDict(mydict)
>>> d[mykeys]
[3, 1]
下面是一个演示实现。
class MyDict(dict):
def __getitem__(self, key):
getitem = super().__getitem__
if isinstance(key, list):
return [getitem(x) for x in key]
else:
return getitem(key)
继承dict的子类需要更多的工作,另外你可能想要实现.get(), .__setitem__(), 以及.__delitem__()等。
其他回答
reduce(lambda x,y: mydict.get(y) and x.append(mydict[y]) or x, mykeys,[])
以防字典里没有钥匙。
如果您发现自己经常这样做,您可能希望继承dict的子类,以获取键的列表并返回值的列表。
>>> d = MyDict(mydict)
>>> d[mykeys]
[3, 1]
下面是一个演示实现。
class MyDict(dict):
def __getitem__(self, key):
getitem = super().__getitem__
if isinstance(key, list):
return [getitem(x) for x in key]
else:
return getitem(key)
继承dict的子类需要更多的工作,另外你可能想要实现.get(), .__setitem__(), 以及.__delitem__()等。
比较一下速度:
Python 2.7.11 |Anaconda 2.4.1 (64-bit)| (default, Dec 7 2015, 14:10:42) [MSC v.1500 64 bit (AMD64)] on win32
In[1]: l = [0,1,2,3,2,3,1,2,0]
In[2]: m = {0:10, 1:11, 2:12, 3:13}
In[3]: %timeit [m[_] for _ in l] # list comprehension
1000000 loops, best of 3: 762 ns per loop
In[4]: %timeit map(lambda _: m[_], l) # using 'map'
1000000 loops, best of 3: 1.66 µs per loop
In[5]: %timeit list(m[_] for _ in l) # a generator expression passed to a list constructor.
1000000 loops, best of 3: 1.65 µs per loop
In[6]: %timeit map(m.__getitem__, l)
The slowest run took 4.01 times longer than the fastest. This could mean that an intermediate result is being cached
1000000 loops, best of 3: 853 ns per loop
In[7]: %timeit map(m.get, l)
1000000 loops, best of 3: 908 ns per loop
In[33]: from operator import itemgetter
In[34]: %timeit list(itemgetter(*l)(m))
The slowest run took 9.26 times longer than the fastest. This could mean that an intermediate result is being cached
1000000 loops, best of 3: 739 ns per loop
列表理解和itemgetter是最快的方法。
更新
对于大型随机列表和地图,我得到了一些不同的结果:
Python 2.7.11 |Anaconda 2.4.1 (64-bit)| (default, Dec 7 2015, 14:10:42) [MSC v.1500 64 bit (AMD64)] on win32
In[2]: import numpy.random as nprnd
l = nprnd.randint(1000, size=10000)
m = dict([(_, nprnd.rand()) for _ in range(1000)])
from operator import itemgetter
import operator
f = operator.itemgetter(*l)
%timeit f(m)
1000 loops, best of 3: 1.14 ms per loop
%timeit list(itemgetter(*l)(m))
1000 loops, best of 3: 1.68 ms per loop
%timeit [m[_] for _ in l] # list comprehension
100 loops, best of 3: 2 ms per loop
%timeit map(m.__getitem__, l)
100 loops, best of 3: 2.05 ms per loop
%timeit list(m[_] for _ in l) # a generator expression passed to a list constructor.
100 loops, best of 3: 2.19 ms per loop
%timeit map(m.get, l)
100 loops, best of 3: 2.53 ms per loop
%timeit map(lambda _: m[_], l)
100 loops, best of 3: 2.9 ms per loop
所以在这种情况下,明显的赢家是f = operator。itemgetter(*l);F (m)和clear outsider: map(lambda _: m[_], l)。
Python 3.6.4更新
import numpy.random as nprnd
l = nprnd.randint(1000, size=10000)
m = dict([(_, nprnd.rand()) for _ in range(1000)])
from operator import itemgetter
import operator
f = operator.itemgetter(*l)
%timeit f(m)
1.66 ms ± 74.2 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)
%timeit list(itemgetter(*l)(m))
2.1 ms ± 93.2 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
%timeit [m[_] for _ in l] # list comprehension
2.58 ms ± 88.8 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
%timeit list(map(m.__getitem__, l))
2.36 ms ± 60.7 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
%timeit list(m[_] for _ in l) # a generator expression passed to a list constructor.
2.98 ms ± 142 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
%timeit list(map(m.get, l))
2.7 ms ± 284 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
%timeit list(map(lambda _: m[_], l)
3.14 ms ± 62.6 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
因此,Python 3.6.4的结果几乎相同。
除了list-comp,还有其他几种方法:
如果key未找到,则生成列表并抛出异常:__getitem__ mykey) 如果没有找到键,则使用None构建列表:得到,mykey)
或者,使用operator。Itemgetter可以返回一个元组:
from operator import itemgetter
myvalues = itemgetter(*mykeys)(mydict)
# use `list(...)` if list is required
注意:在Python3中,map返回一个迭代器而不是一个列表。使用list(map(…))作为列表。
试试这个:
mydict = {'one': 1, 'two': 2, 'three': 3}
mykeys = ['three', 'one','ten']
newList=[mydict[k] for k in mykeys if k in mydict]
print newList
[3, 1]