在Java中增加Map值的最有效方法

我希望这个问题对这个论坛来说不是太基本的，但我们会看到的。我想知道如何重构一些代码以获得更好的性能，这是运行了很多次。

假设我正在使用Map(可能是HashMap)创建一个词频列表，其中每个键都是一个包含要统计的单词的String，值是一个Integer，该Integer在每次找到单词的标记时递增。

在Perl中，增加这样一个值非常简单:

$map{$word}++;

但在Java中，这要复杂得多。下面是我目前的做法:

int count = map.containsKey(word) ? map.get(word) : 0;
map.put(word, count + 1);

这当然依赖于新Java版本中的自动装箱特性。我想知道您是否可以建议一种更有效的方法来增加这个值。是否有更好的性能理由避开Collections框架而使用其他框架呢?

更新:我已经对几个答案做了测试。见下文。

当前回答

你应该意识到你最初的尝试

int count = map.containsKey(word) ? map.get(word) : 0;

包含映射上两个可能代价高昂的操作，即containsKey和get。前者执行的操作可能与后者非常相似，因此您要做两次相同的工作!

如果查看Map的API，当Map不包含所请求的元素时，get操作通常返回null。

注意，这将得到一个像

map.put( key, map.get(key) + 1 );

dangerous, since it might yield NullPointerExceptions. You should check for a null first. Also note, and this is very important, that HashMaps can contain nulls by definition. So not every returned null says "there is no such element". In this respect, containsKey behaves differently from get in actually telling you whether there is such an element. Refer to the API for details. For your case, however, you might not want to distinguish between a stored null and "noSuchElement". If you don't want to permit nulls you might prefer a Hashtable. Using a wrapper library as was already proposed in other answers might be a better solution to manual treatment, depending on the complexity of your application. To complete the answer (and I forgot to put that in at first, thanks to the edit function!), the best way of doing it natively, is to get into a final variable, check for null and put it back in with a 1. The variable should be final because it's immutable anyway. The compiler might not need this hint, but its clearer that way. final HashMap map = generateRandomHashMap(); final Object key = fetchSomeKey(); final Integer i = map.get(key); if (i != null) { map.put(i + 1); } else { // do something } If you do not want to rely on autoboxing, you should say something like map.put(new Integer(1 + i.getValue())); instead.

2008-09-17 10:20:32

其他回答

我将使用Apache Collections Lazy Map(将值初始化为0)，并使用Apache Lang中的MutableIntegers作为该映射中的值。

在您的方法中，最大的代价是必须搜索两次地图。在我这里，你只需要做一次。只需要获取值(如果没有，它将被初始化)并增加它。

2008-09-17 10:21:19

很简单，只需使用Map.java中的内置函数，如下所示

map.put(key, map.getOrDefault(key, 0) + 1);

2019-03-25 15:33:42

另一种方法是创建一个可变整数:

class MutableInt {
  int value = 0;
  public void inc () { ++value; }
  public int get () { return value; }
}
...
Map<String,MutableInt> map = new HashMap<String,MutableInt> ();
MutableInt value = map.get (key);
if (value == null) {
  value = new MutableInt ();
  map.put (key, value);
} else {
  value.inc ();
}

当然，这意味着创建一个额外的对象，但与创建一个Integer(即使是Integer. valueof)相比，开销不应该那么多。

2008-09-17 09:47:03

可以使用Java 8提供的Map接口中的computeIfAbsent方法。

final Map<String,AtomicLong> map = new ConcurrentHashMap<>();
map.computeIfAbsent("A", k->new AtomicLong(0)).incrementAndGet();
map.computeIfAbsent("B", k->new AtomicLong(0)).incrementAndGet();
map.computeIfAbsent("A", k->new AtomicLong(0)).incrementAndGet(); //[A=2, B=1]

方法computeIfAbsent检查指定的键是否已经与某个值关联?如果没有关联值，则尝试使用给定的映射函数计算其值。在任何情况下，它都会返回与指定键关联的当前值(现有值或计算值)，如果计算值为空则返回null。

另一方面，如果你遇到多个线程更新一个公共和的情况，你可以看看LongAdder类。在高争用情况下，该类的预期吞吐量显著高于AtomicLong，但代价是更高的空间消耗。

2016-05-25 14:21:13

你确定这是瓶颈吗?你做过性能分析吗?

尝试使用NetBeans分析器(它是免费的，内置在NB 6.1中)来查看热点。

最后，JVM升级(比如从1.5升级到>1.6)通常是一种廉价的性能增强。即使是版本号的升级也能提供良好的性能提升。如果您在Windows上运行，并且这是一个服务器类应用程序，请在命令行上使用-server来使用server Hotspot JVM。在Linux和Solaris机器上，这是自动检测到的。

2008-09-17 12:12:33

在Java中增加Map值的最有效方法

推荐文章

最新文章

标签