CAP定理-可用性和分区容忍

当我试图理解CAP中的“Availability”(A)和“Partition tolerance”(P)时，我发现很难理解各种文章的解释。

我有一种感觉，a和P可以同时出现(我知道事实并非如此，这就是我不能理解的原因!)

简单地解释一下，什么是A和P以及它们之间的区别?

当前回答

Brewer's keynote, the Gilbert paper, and many other treatments, places C, A and P on an equal footing as desirable properties of an implementation and effectively say 'choose two!'. However, this is often considered to be a misleading presentation, since you cannot build - or choose! - 'partition tolerance': your system either might experience partitions or it won't. CAP is better understood as describing the tradeoffs you have to make when you are building a system that may suffer partitions. In practice, this is every distributed system: there is no 100% reliable network. So (at least in the distributed context) there is no realistic CA system. You will potentially suffer partitions, therefore you must at some point compromise C or A.

https://github.com/henryr/cap-faq#10-why-do-some-people-get-annoyed-when-i-characterise-my-system-as-ca

2021-05-07 04:04:26

其他回答

根据上图C是断开的，但A,B, D可以继续工作。现在我们可以调用系统部分工作(分区容忍)。

假设一个特定的事务只需要a、B和d，系统可以执行它而不会产生任何不一致。

但是当C必须参与一个特定的事务时，系统可以以两种方式执行。

1.由于C不可用，A可以拒绝用户请求。

So the system has Partition-Tolerance and consistency (P,C).
But no availability, because of the rejection.

2.A可以将C接收到的消息保存在A的本地内存中，并在C连接回来时传输。

So the system has Partition-Tolerance and availability (P,A).
But no consistency.because C has not updated.

2022-09-20 09:19:03

一致性意味着整个集群中的数据是相同的，因此您可以从/写入任何节点并获得相同的数据。

可用性意味着即使集群中的某个节点宕机，也能够访问集群。

分区容忍意味着即使两个节点之间存在“分区”(通信中断)(两个节点都在工作，但不能通信)，集群也能继续工作。

为了同时获得可用性和分区容忍，您必须放弃一致性。考虑一下在master-master设置中是否有两个节点X和Y。现在，X和Y之间的网络通信中断了，所以它们不能同步更新。此时你可以:

A)允许节点不同步(放弃一致性)，或者

B)认为集群“关闭”(放弃可用性)

所有可用的组合是:

CA - data is consistent between all nodes - as long as all nodes are online - and you can read/write from any node and be sure that the data is the same, but if you ever develop a partition between nodes, the data will be out of sync (and won't re-sync once the partition is resolved). CP - data is consistent between all nodes, and maintains partition tolerance (preventing data desync) by becoming unavailable when a node goes down. AP - nodes remain online even if they can't communicate with each other and will resync data once the partition is resolved, but you aren't guaranteed that all nodes will have the same data (either during or after the partition)

您应该注意，CA系统实际上并不存在(即使有些系统声称存在)。

2012-09-10 08:14:45

一致性——当我们发送读请求时，如果它正在返回结果，它应该返回客户端请求给出的最近的写。可用性—您的读/写请求应该总是成功的。分区容忍度——当网络分区(某些机器相互通信的问题)发生时，系统仍然可以工作。

在分布式环境中，存在网络分区发生的可能性，我们无法避免CAP的“P”。因此，我们在“一致性”和“可用性”之间进行选择。

http://bigdatadose.com/understanding-cap-theorem/

2015-03-05 08:46:06

以下是我讨论CAP的方式，特别是关于P。

CA只有在单机数据库(可能有复制，但所有数据都在一个“故障块”上-服务器不被认为是部分故障)的情况下才可能使用。

如果您的问题需要向外扩展、分布式和多服务器，则可能发生网络分区。您已经需要p了，我所处理的问题中很少有适用于总是单服务器的范例(或者，如Stonebraker所说，“分布式是桌面赌注”)。如果您能找到CA问题，那么像传统的非向外扩展RDBMS这样的解决方案将提供很多好处。

对我来说，罕见:所以我们继续讨论AP和CP。

当您有分区时，只能在AP操作和CP操作之间进行选择。如果网络和硬件运行正常，你就能得到你的蛋糕并吃掉它。

让我们讨论AP / CP的区别。

AP -当有网络分区时，让独立的部分自由运行。

CP——当存在网络分区时，关闭节点或禁止读写，这样就会出现确定性故障。

我喜欢能两者兼顾的架构，因为有些问题是AP问题，有些是CP问题，而有些数据库可以两者兼顾。在CP和AP解决方案中，也有一些微妙之处。

例如，在AP数据集中，您可能同时存在不一致的读取和生成写入冲突-这是两种不同的AP模式。您的系统是否可以配置为具有高读可用性但不允许写冲突的AP ?或者您的AP系统可以接受写入冲突，具有强大而灵活的解决系统?你最终需要两者吗，或者你可以选择一个只做其中一个的系统?

在CP系统中，小分区(单个服务器)的不可用性有多少?更大的复制会增加CP系统中的不可用性，系统如何处理这些权衡?

这些都是CP和AP要问的问题。

现在在这个领域有一个很好的阅读是Brewer的“12年后”的帖子。我相信这将清晰地推进CAP辩论，并强烈推荐它。

http://www.infoq.com/articles/cap-twelve-years-later-how-the-rules-have-changed

2014-07-09 17:31:47

一致性:

对于给定的客户端，读操作保证返回最近的写操作(如ACID)。如果在此期间有任何请求，则必须等待节点之间/节点内的数据同步完成。

可用性:

每个节点(如果没有失败)总是执行查询，并且应该总是响应请求。它是否返回最新的副本并不重要。

Partition-tolerance:

当发生网络分区时，系统将继续工作。

关于AP，可用性(始终可访问)可以与(Cassendra)或没有(RDBMS)分区容忍

图片来源

2017-04-18 09:19:45

CAP定理-可用性和分区容忍

推荐文章

最新文章

标签