字符串作为SQL数据库的主键

Strings are slower in joins and in real life they are very rarely really unique (even when they are supposed to be). The only advantage is that they can reduce the number of joins if you are joining to the primary table only to get the name. However, strings are also often subject to change thus creating the problem of having to fix all related records when the company name changes or the person gets married. This can be a huge performance hit and if all tables that should be related somehow are not related (this happens more often than you think), then you might have data mismatches as well. An integer that will never change through the life of the record is a far safer choice from a data integrity standpoint as well as from a performance standpoint. Natural keys are usually not so good for maintenance of the data.

我还想指出，两者的最佳方法通常是使用自递增键(或者在某些特殊情况下，使用GUID)作为PK，然后在自然键上放置唯一索引。您可以获得更快的连接，不会得到重复的记录，也不必因为公司名称更改而更新一百万个子记录。

2009-02-05 19:54:57

Technically yes, but if a string makes sense to be the primary key then you should probably use it. This all depends on the size of the table you're making it for and the length of the string that is going to be the primary key (longer strings == harder to compare). I wouldn't necessarily use a string for a table that has millions of rows, but the amount of performance slowdown you'll get by using a string on smaller tables will be minuscule to the headaches that you can have by having an integer that doesn't mean anything in relation to the data.

2009-02-05 19:44:16

Strings are slower in joins and in real life they are very rarely really unique (even when they are supposed to be). The only advantage is that they can reduce the number of joins if you are joining to the primary table only to get the name. However, strings are also often subject to change thus creating the problem of having to fix all related records when the company name changes or the person gets married. This can be a huge performance hit and if all tables that should be related somehow are not related (this happens more often than you think), then you might have data mismatches as well. An integer that will never change through the life of the record is a far safer choice from a data integrity standpoint as well as from a performance standpoint. Natural keys are usually not so good for maintenance of the data.

我还想指出，两者的最佳方法通常是使用自递增键(或者在某些特殊情况下，使用GUID)作为PK，然后在自然键上放置唯一索引。您可以获得更快的连接，不会得到重复的记录，也不必因为公司名称更改而更新一百万个子记录。

2009-02-05 19:54:57

不要担心性能，直到您获得了一个简单而合理的设计，该设计与数据描述的主题一致，并且非常适合数据的预期用途。然后，如果出现性能问题，您可以通过调整系统来处理它们。

在这种情况下，使用字符串作为自然的主键几乎总是更好的，只要您可以信任它。如果是字符串也不用担心，只要字符串足够短，比如说最多25个字符。就性能而言，你不会付出很大的代价。

数据输入人员或自动数据源是否总是为假定的自然键提供一个值，还是有时会省略?输入数据偶尔会出错吗?如果是，如何检测和纠正错误?

指定查询的程序员和交互用户能够使用自然键来获得他们想要的东西吗?

如果你不相信天然的钥匙，那就找一个替代品。如果你发明了一个代理，你也可以发明一个整数。然后，您必须考虑是否对用户社区隐藏代理。一些没有隐藏代理键的开发人员后来后悔了。

2009-02-06 19:33:40

指数意味着大量的比较。

通常，字符串比整数长，并且可以应用排序规则进行比较，因此比较字符串通常比比较整数需要更多的计算量。

不过，有时使用字符串作为主键要比使用字符串与数字id表进行额外的连接更快。

2009-02-05 19:44:32

有可能是一个非常大的误解有关字符串在数据库中。几乎每个人都认为数字的数据库表示比字符串更紧凑。他们认为db-s中的数字表示为内存中的数字。但事实并非如此。在大多数情况下，数字表示法更接近于字符串表示法。

使用数字或字符串的速度更依赖于索引，而不是类型本身。

2009-02-05 20:13:16

字符串作为SQL数据库的主键

推荐文章

最新文章

标签