在高负载站点中使用PHP的策略

在你回答这个问题之前，我从来没有开发过任何流行到足以达到高服务器负载的东西。请把我当作(唉)一个刚刚登陆地球的外星人，尽管我知道PHP和一些优化技术。

我正在开发一个PHP工具，可以获得相当多的用户，如果它是正确的。然而，虽然我完全有能力开发程序，但当涉及到制作可以处理巨大流量的东西时，我几乎一无所知。所以这里有一些关于它的问题(也可以把这个问题变成一个资源线程)。

数据库

At the moment I plan to use the MySQLi features in PHP5. However how should I setup the databases in relation to users and content? Do I actually need multiple databases? At the moment everything's jumbled into one database - although I've been considering spreading user data to one, actual content to another and finally core site content (template masters etc.) to another. My reasoning behind this is that sending queries to different databases will ease up the load on them as one database = 3 load sources. Also would this still be effective if they were all on the same server?

缓存

我有一个用于构建页面和交换变量的模板系统。主模板存储在数据库中，每当一个模板被调用时，它的缓存副本(html文档)就会被调用。目前，我在这些模板中有两种类型的变量-静态变量和动态变量。静态变量通常是像页面名称，网站的名称-不经常改变的东西;动态变量是在每次页面加载时改变的东西。

我的问题是:

比如说我对不同的文章有评论。这是一个更好的解决方案:存储简单的注释模板，并在每次页面加载时呈现注释(来自DB调用)，或者将注释页面的缓存副本存储为html页面——每次添加/编辑/删除注释时，页面都会被重新检索。

最后

有人有任何提示/指针运行一个高负载的PHP网站。我很确定这是一种可行的语言——Facebook和Yahoo!优先考虑——但有什么经验是我应该注意的吗?

当前回答

我运营的网站每月有700万到800万的访问量。不是特别多，但足以让我们的服务器感受到负载。我们选择的解决方案很简单:数据库级的Memcache。如果数据库负载是您的主要问题，则此解决方案效果很好。

我们开始使用Memcache缓存最常用的整个对象和数据库结果。它确实起作用了，但它也引入了bug(如果我们更加小心的话，我们可能会避免其中一些bug)。

所以我们改变了我们的方法。我们构建了一个数据库包装器(使用与旧数据库完全相同的方法，因此很容易切换)，然后我们将其子类化以提供memcached数据库访问方法。

现在，您所要做的就是决定查询是否可以使用缓存(可能已经过期)的结果。用户运行的大多数查询现在都直接从Memcache中获取。例外情况是更新和插入，这对于主网站来说只发生在日志记录中。这个相当简单的措施减少了大约80%的服务器负载。

2008-08-26 09:38:41

其他回答

APC是绝对必须的。它不仅是一个伟大的缓存系统，而且从自动缓存的PHP文件中获得的好处是天赐良机。至于多数据库的想法，我认为在同一台服务器上使用不同的数据库不会有什么好处。它可能会在查询时提高一些速度，但我怀疑为确保三者同步而部署和维护代码所付出的努力是否值得。

我还强烈建议运行Xdebug来查找程序中的瓶颈。它使优化对我来说轻而易举。

2008-08-23 22:45:58

看来我错了。MySQLi仍在开发中。但是根据这篇文章，PDO_MySQL现在由MySQL团队贡献。摘自文章:

The MySQL Improved Extension - mysqli - is the flagship. It supports all features of the MySQL Server including Charsets, Prepared Statements and Stored Procedures. The driver offers a hybrid API: you can use a procedural or object-oriented programming style based on your preference. mysqli comes with PHP 5 and up. Note that the End of life for PHP 4 is 2008-08-08. The PHP Data Objects (PDO) are a database access abstraction layer. PDO allows you to use the same API calls for various databases. PDO does not offer any degree of SQL abstraction. PDO_MYSQL is a MySQL driver for PDO. PDO_MYSQL comes with PHP 5. As of PHP 5.3 MySQL developers actively contribute to it. The PDO benefit of a unified API comes at the price that MySQL specific features, for example multiple statements, are not fully supported through the unified API. Please stop using the first MySQL driver for PHP ever published: ext/mysql. Since the introduction of the MySQL Improved Extension - mysqli - in 2004 with PHP 5 there is no reason to still use the oldest driver around. ext/mysql does not support Charsets, Prepared Statements and Stored Procedures. It is limited to the feature set of MySQL 4.0. Note that the Extended Support for MySQL 4.0 ends at 2008-12-31. Don't limit yourself to the feature set of such old software! Upgrade to mysqli, see also Converting_to_MySQLi. mysql is in maintenance only mode from our point of view.

对我来说，这篇文章似乎偏向MySQLi。我想我偏向于PDO。我真的很喜欢PDO胜过MySQLi。这对我来说很简单。这个API更接近于我编写的其他语言。OO数据库接口似乎工作得更好。

我还没有遇到过任何PDO无法提供的MySQL特性。如果有的话，我才会惊讶呢。

2008-08-24 14:14:48

如果您正在处理大量数据，而缓存无法解决问题，请查看Sphinx。我们使用SphinxSearch取得了很好的结果，不仅可以更好地进行文本搜索，还可以在处理较大的表时作为MySQL的数据检索替代品。如果你使用SphinxSE (MySQL插件)，它所获得的性能收益超过了我们从缓存中获得的几倍，并且应用程序实现是一个麻烦。

2009-04-15 16:49:55

第一个问题是，你真正期望它有多大?你们计划在基础设施上投资多少?既然你觉得有必要在这里问这个问题，我猜你希望从有限的预算开始。

Performance is irrelevant if the site is not available. And for availability you need horizontal scaling. The minimum you can sensibly get away with is 2 servers, both running apache, php and mysql. Set up one DBMS as a slave to the other. Do all the writes on the master, and all the reads on the local database (whatever that is) - unless for some reason you need to read back the data you've just read (use master). Make sure you've got the machinery in place to automatically promote the slave and fence the master. Use round-robin DNS for the webserver addresses to give more affinity for the slave node.

在这个阶段，在不同的数据库节点上划分你的数据是一个非常糟糕的主意——然而，你可能会考虑在同一台服务器上的不同数据库上划分数据(当你超越facebook时，这将有助于跨节点划分)。

一定要确保你有适当的监控和数据分析工具来衡量你的网站性能和识别瓶颈。大多数性能问题都可以通过编写更好的SQL /修复数据库模式来解决。

将模板缓存保存在数据库上是一个愚蠢的想法——数据库应该是结构化数据的中央公共存储库。将你的模板缓存保存在你的web服务器的本地文件系统中——这样会更快，也不会降低你对数据库的访问速度。

使用操作码缓存。

花大量的时间研究你的网站和它的日志，了解为什么它运行得这么慢。

将尽可能多的缓存推到客户端。

使用mod_gzip可以压缩所有内容。

2010-03-26 16:19:52

我不认为自己会很快从MySQL转换过来——所以我想我不需要PDO的抽象功能。DavidM，谢谢你的文章，它们帮了我很多。

2008-08-24 14:25:56

在高负载站点中使用PHP的策略

推荐文章

最新文章

标签