重写isEqual:和散列的最佳实践

在Objective-C中如何正确地覆盖isEqual: ?“陷阱”似乎是，如果两个对象相等(由isEqual:方法决定)，它们必须具有相同的散列值。

Cocoa Fundamentals Guide的Introspection部分确实有一个关于如何重写isEqual:的例子，复制如下，用于一个名为MyWidget的类:

- (BOOL)isEqual:(id)other {
    if (other == self)
        return YES;
    if (!other || ![other isKindOfClass:[self class]])
        return NO;
    return [self isEqualToWidget:other];
}

- (BOOL)isEqualToWidget:(MyWidget *)aWidget {
    if (self == aWidget)
        return YES;
    if (![(id)[self name] isEqual:[aWidget name]])
        return NO;
    if (![[self data] isEqualToData:[aWidget data]])
        return NO;
    return YES;
}

它检查指针是否相等，然后是类是否相等，最后使用isEqualToWidget:比较对象，后者只检查名称和数据属性。这个例子没有说明如何重写哈希。

让我们假设有其他属性不影响平等，比如年龄。难道不应该重写哈希方法，以便只有名称和数据影响哈希吗?如果是，你会怎么做?只是添加名称和数据的散列吗?例如:

- (NSUInteger)hash {
    NSUInteger hash = 0;
    hash += [[self name] hash];
    hash += [[self data] hash];
    return hash;
}

这足够了吗?有更好的技术吗?如果你有基本类型，比如int呢?将它们转换为NSNumber以获得它们的散列?或者像NSRect这样的结构?

(脑屁:最初把“位或”和|=写在一起。意味着添加。)

当前回答

开始

 NSUInteger prime = 31;
 NSUInteger result = 1;

然后对于每一个原始元素

 result = prime * result + var

对于对象，你用0表示nil，否则它们的hashcode。

 result = prime * result + [var hash];

对于布尔值，使用两个不同的值

 result = prime * result + ((var)?1231:1237);

解释与归因

这不是tcurdt的作品，评论要求更多的解释，所以我相信编辑归因是公平的。

This algorithm was popularized in the book "Effective Java", and the relevant chapter can currently be found online here. That book popularized the algorithm, which is now a default in a number of Java applications (including Eclipse). It derived, however, from an even older implementation which is variously attributed to Dan Bernstein or Chris Torek. That older algorithm originally floated around on Usenet, and certain attribution is difficult. For example, there is some interesting commentary in this Apache code (search for their names) that references the original source.

最重要的是，这是一个非常古老，简单的哈希算法。它不是性能最好的，甚至在数学上也没有被证明是一个“好”算法。但它很简单，而且很多人长期使用它，效果很好，所以它有很大的历史支持。

2008-10-31 17:58:53

其他回答

我发现本页是重写等号和哈希类型方法的有用指南。它包括一个计算哈希码的不错的算法。这个页面是面向Java的，但是很容易适应Objective-C/Cocoa。

2008-10-31 17:37:16

Sorry if I risk sounding a complete boffin here but... ...nobody bothered mentioning that to follow 'best practices' you should definitely not specify an equals method that would NOT take into account all data owned by your target object, e.g whatever data is aggregated to your object, versus an associate of it, should be taken into account when implementing equals. If you don't want to take, say 'age' into account in a comparison, then you should write a comparator and use that to perform your comparisons instead of isEqual:.

如果您定义了一个isEqual:方法来任意执行相等比较，那么一旦您忘记了equals解释中的“扭曲”，您就会冒这个方法被其他开发人员甚至您自己误用的风险。

因此，虽然这是一个关于哈希的很好的问答，你通常不需要重新定义哈希方法，你可能应该定义一个特别的比较器。

2009-11-05 16:40:55

对关键属性的哈希值进行简单的XOR就足够了 99%的情况下。

例如:

- (NSUInteger)hash
{
    return [self.name hash] ^ [self.data hash];
}

Mattt Thompson在http://nshipster.com/equality/找到了解决方案(他也在他的帖子中提到了这个问题:~)

2013-11-16 02:57:09

坚持，当然一个更简单的方法来做到这一点是首先覆盖- (NSString)描述，并提供一个字符串表示你的对象状态(你必须在这个字符串中表示你的对象的整个状态)。

然后，只需提供以下哈希的实现:

- (NSUInteger)hash {
    return [[self description] hash];
}

这是基于这样的原则:“如果两个字符串对象相等(由isEqualToString:方法决定)，它们必须具有相同的散列值。”

来源:NSString类参考

2012-01-17 21:48:58

等号契约和散列契约在Java世界中有很好的指定和深入的研究(参见@mipardi的回答)，但所有相同的考虑都应该适用于Objective-C。

Eclipse在Java中生成这些方法的工作非常可靠，所以这里有一个手工移植到Objective-C的Eclipse示例:

- (BOOL)isEqual:(id)object {
    if (self == object)
        return true;
    if ([self class] != [object class])
        return false;
    MyWidget *other = (MyWidget *)object;
    if (_name == nil) {
        if (other->_name != nil)
            return false;
    }
    else if (![_name isEqual:other->_name])
        return false;
    if (_data == nil) {
        if (other->_data != nil)
            return false;
    }
    else if (![_data isEqual:other->_data])
        return false;
    return true;
}

- (NSUInteger)hash {
    const NSUInteger prime = 31;
    NSUInteger result = 1;
    result = prime * result + [_name hash];
    result = prime * result + [_data hash];
    return result;
}

对于一个子类YourWidget，它添加了一个属性serialNo:

- (BOOL)isEqual:(id)object {
    if (self == object)
        return true;
    if (![super isEqual:object])
        return false;
    if ([self class] != [object class])
        return false;
    YourWidget *other = (YourWidget *)object;
    if (_serialNo == nil) {
        if (other->_serialNo != nil)
            return false;
    }
    else if (![_serialNo isEqual:other->_serialNo])
        return false;
    return true;
}

- (NSUInteger)hash {
    const NSUInteger prime = 31;
    NSUInteger result = [super hash];
    result = prime * result + [_serialNo hash];
    return result;
}

这个实现避免了isEqual: from Apple示例中的一些子类化陷阱:

Apple's class test other isKindOfClass:[self class] is asymmetric for two different subclasses of MyWidget. Equality needs to be symmetric: a=b if and only if b=a. This could easily be fixed by changing the test to other isKindOfClass:[MyWidget class], then all MyWidget subclasses would be mutually comparable. Using an isKindOfClass: subclass test prevents subclasses from overriding isEqual: with a refined equality test. This is because equality needs to be transitive: if a=b and a=c then b=c. If a MyWidget instance compares equal to two YourWidget instances, then those YourWidget instances must compare equal to each other, even if their serialNo differs.

第二个问题可以通过只考虑属于完全相同类的对象相等来解决，因此这里使用[self class] != [object class]测试。对于典型的应用程序类，这似乎是最好的方法。

然而，在某些情况下，isKindOfClass: test更可取。这在框架类中比在应用程序类中更典型。例如，不管NSString/NSMutableString的区别如何，也不管NSString类集群中涉及哪些私有类，任何NSString都应该与其他具有相同底层字符序列的NSString进行比较。

在这种情况下，isEqual:应该具有定义良好的、记录良好的行为，并且应该清楚地说明子类不能重写此行为。在Java中，“不重写”限制可以通过将equals和hashcode方法标记为final来强制执行，但Objective-C没有等效的方法。

2013-05-19 10:32:42

重写isEqual:和散列的最佳实践

推荐文章

最新文章

标签