Linux中的线程与进程

最近我听到一些人说，在Linux中，使用进程几乎总是比使用线程更好，因为Linux在处理进程方面非常高效，而且与线程相关的问题太多了(比如锁)。然而，我对此持怀疑态度，因为在某些情况下，线程似乎可以带来相当大的性能提升。

因此，我的问题是，当遇到线程和进程都可以很好地处理的情况时，我应该使用进程还是线程?例如，如果我正在编写一个web服务器，我应该使用进程还是线程(或组合)?

当前回答

多线程是为受虐狂准备的。：）

If you are concerned about an environment where you are constantly creating threads/forks, perhaps like a web server handling requests, you can pre-fork processes, hundreds if necessary. Since they are Copy on Write and use the same memory until a write occurs, it's very fast. They can all block, listening on the same socket and the first one to accept an incoming TCP connection gets to run with it. With g++ you can also assign functions and variables to be closely placed in memory (hot segments) to ensure when you do write to memory, and cause an entire page to be copied at least subsequent write activity will occur on the same page. You really have to use a profiler to verify that kind of stuff but if you are concerned about performance, you should be doing that anyway.

Development time of threaded apps is 3x to 10x times longer due to the subtle interaction on shared objects, threading "gotchas" you didn't think of, and very hard to debug because you cannot reproduce thread interaction problems at will. You may have to do all sort of performance killing checks like having invariants in all your classes that are checked before and after every function and you halt the process and load the debugger if something isn't right. Most often it's embarrassing crashes that occur during production and you have to pore through a core dump trying to figure out which threads did what. Frankly, it's not worth the headache when forking processes is just as fast and implicitly thread safe unless you explicitly share something. At least with explicit sharing you know exactly where to look if a threading style problem occurs.

如果性能如此重要，那就增加另一台计算机和负载平衡。对于开发人员调试一个多线程应用程序的成本，即使是由一个有经验的多线程程序编写的应用程序，你可能会买4块40核的英特尔主板，每块都有64g内存。

That being said, there are asymmetric cases where parallel processing isn't appropriate, like, you want a foreground thread to accept user input and show button presses immediately, without waiting for some clunky back end GUI to keep up. Sexy use of threads where multiprocessing isn't geometrically appropriate. Many things like that just variables or pointers. They aren't "handles" that can be shared in a fork. You have to use threads. Even if you did fork, you'd be sharing the same resource and subject to threading style issues.

2021-05-18 18:12:49

其他回答

你的任务有多紧密耦合?

如果它们可以彼此独立，那么就使用流程。如果它们相互依赖，则使用线程。这样，您就可以终止并重新启动坏进程，而不会影响其他任务的操作。

2009-04-30 16:17:01

在大多数情况下，我更喜欢进程而不是线程。当您有一个相对较小的任务(每个划分的任务单元占用的进程开销>>时间)，并且需要在它们之间共享内存时，线程可能会很有用。想象一个大数组。另外(离题)，请注意，如果您的CPU利用率是100%或接近100%，那么多线程或处理将没有任何好处。(事实上情况会更糟)

2010-06-17 21:22:21

线程/进程之间的决定取决于您将使用它来做什么。进程的好处之一是它有一个PID，可以在不终止父进程的情况下被杀死。

对于一个真实世界的web服务器的例子，apache 1.3过去只支持多进程，但是在2.0中他们增加了一个抽象，这样你就可以在两者之间切换。评论似乎同意进程更健壮，但线程可以提供更好的性能(除了那些进程性能很差且您只想使用线程的窗口)。

2009-04-30 16:04:52

我不得不同意你所听到的。当我们对我们的集群(xhpl等)进行基准测试时，我们总是通过进程而不是线程获得明显更好的性能。< /轶事>

2009-04-30 15:37:35