是传递const std::string &作为参数的日子?

我最近听了Herb Sutter的一个演讲，他认为通过const &传递std::vector和std::string的原因已经基本消失了。他建议现在最好编写如下这样的函数:

std::string do_something ( std::string inval )
{
   std::string return_val;
   // ... do stuff ...
   return return_val;
}

我知道return_val在函数返回时将是一个右值，因此可以使用move语义返回，这非常便宜。然而，inval仍然比引用(通常实现为指针)的大小大得多。这是因为std::string有各种组件，包括指向堆的指针和用于短字符串优化的成员char[]。所以在我看来，通过引用传递仍然是一个好主意。

谁能解释一下赫伯为什么会这么说?

当前回答

Herb Sutter和Bjarne Stroustroup一起推荐使用const std::string&作为形参类型;见https://github.com/isocpp/CppCoreGuidelines/blob/master/CppCoreGuidelines.md#Rf-in。

这里有一个在其他答案中没有提到的陷阱:如果你将一个字符串字面值传递给一个const std::string& parameter，它将传递一个临时字符串的引用，该字符串是动态创建的，用于保存字面值的字符。如果然后保存该引用，那么一旦释放临时字符串，它将无效。为了安全起见，您必须保存副本，而不是参考资料。这个问题源于字符串字面值是const char[N]类型，需要升级为std::string。

下面的代码说明了陷阱和解决方法，以及一个较小的效率选项——使用const char*方法重载，如在c++中是否有一种方法将字符串文字作为引用传递。

(注意:Sutter & Stroustroup建议，如果你保留了字符串的副本，也要提供一个带有&&形参和std::move()的重载函数。)

#include <string>
#include <iostream>
class WidgetBadRef {
public:
    WidgetBadRef(const std::string& s) : myStrRef(s)  // copy the reference...
    {}

    const std::string& myStrRef;    // might be a reference to a temporary (oops!)
};

class WidgetSafeCopy {
public:
    WidgetSafeCopy(const std::string& s) : myStrCopy(s)
            // constructor for string references; copy the string
    {std::cout << "const std::string& constructor\n";}

    WidgetSafeCopy(const char* cs) : myStrCopy(cs)
            // constructor for string literals (and char arrays);
            // for minor efficiency only;
            // create the std::string directly from the chars
    {std::cout << "const char * constructor\n";}

    const std::string myStrCopy;    // save a copy, not a reference!
};

int main() {
    WidgetBadRef w1("First string");
    WidgetSafeCopy w2("Second string"); // uses the const char* constructor, no temp string
    WidgetSafeCopy w3(w2.myStrCopy);    // uses the String reference constructor
    std::cout << w1.myStrRef << "\n";   // garbage out
    std::cout << w2.myStrCopy << "\n";  // OK
    std::cout << w3.myStrCopy << "\n";  // OK
}

输出:

Const char *构造函数常量std::string&构造函数第二个字符串第二个字符串

2016-01-27 15:50:57

其他回答

正如@ jduzgosz在评论中指出的那样，Herb在另一个(稍后?)谈话中给出了其他建议，大致可以从这里看到:https://youtu.be/xnqTKD8uD64?t=54m50s。

他的建议可以归结为，对于一个接受所谓汇聚参数的函数f，只使用值形参，假设您将从这些汇聚参数中移动construct。

与分别为左值和右值参数定制的f的最佳实现相比，这种通用方法只是同时为左值和右值参数增加了move构造函数的开销。要了解为什么会出现这种情况，假设f接受一个值形参，其中T是某个复制和移动构造类型:

void f(T x) {
  T y{std::move(x)};
}

使用左值参数调用f将导致调用一个复制构造函数来构造x，调用一个移动构造函数来构造y。另一方面，使用右值参数调用f将导致调用一个移动构造函数来构造x，并调用另一个移动构造函数来构造y。

一般来说，f对左值参数的最佳实现如下:

void f(const T& x) {
  T y{x};
}

在这种情况下，只调用一个复制构造函数来构造y。对于右值参数，f的最佳实现通常如下所示:

void f(T&& x) {
  T y{std::move(x)};
}

在这种情况下，只调用一个move构造函数来构造y。

因此，一个明智的妥协是，取一个value形参，并有一个额外的move构造函数调用，用于左值或右值参数，这也是Herb在演讲中给出的建议。

正如@ jdlugosz在评论中指出的那样，仅对将从sink参数构造某个对象的函数才有意义。当函数f复制其实参时，按值传递的方法比一般的按常量引用传递的方法开销更大。保留形参副本的函数f的值传递方法将具有如下形式:

void f(T x) {
  T y{...};
  ...
  y = std::move(x);
}

在这种情况下，左值实参有一个复制构造和一个move赋值，右值实参有一个move构造和move赋值。左值参数的最佳情况是:

void f(const T& x) {
  T y{...};
  ...
  y = x;
}

这可以归结为仅进行赋值操作，这可能比值传递方法所需的复制构造函数加移动赋值要便宜得多。这样做的原因是赋值可能会重用y中现有的已分配内存，因此防止(取消)分配，而复制构造函数通常会分配内存。

对于右值实参，保留副本的f的最优实现形式为:

void f(T&& x) {
  T y{...};
  ...
  y = std::move(x);
}

这里只有一个move赋值。将右值传递给接受const引用的f版本只需要赋值，而不是move赋值。所以相对而言，在这种情况下，f的版本采用const引用作为通用实现更可取。

So in general, for the most optimal implementation, you will need to overload or do some kind of perfect forwarding as shown in the talk. The drawback is a combinatorial explosion in the number of overloads required, depending on the number of parameters for f in case you opt to overload on the value category of the argument. Perfect forwarding has the drawback that f becomes a template function, which prevents making it virtual, and results in significantly more complex code if you want to get it 100% right (see the talk for the gory details).