我如何知道代码中的哪些部分从未使用过?

我有遗留的c++代码，我应该删除不使用的代码。问题是代码库很大。

我如何才能知道哪些代码从未被调用/从未被使用?

当前回答

真正的答案是:你永远无法真正确定。

至少，对于重要的情况，你不能确定你已经得到了全部。考虑以下来自维基百科关于不可达代码的文章:

double x = sqrt(2);
if (x > 5)
{
  doStuff();
}

正如维基百科正确指出的那样，一个聪明的编译器也许能够捕捉到这样的东西。但是考虑一下修改:

int y;
cin >> y;
double x = sqrt((double)y);

if (x != 0 && x < 1)
{
  doStuff();
}

Will the compiler catch this? Maybe. But to do that, it will need to do more than run sqrt against a constant scalar value. It will have to figure out that (double)y will always be an integer (easy), and then understand the mathematical range of sqrt for the set of integers (hard). A very sophisticated compiler might be able to do this for the sqrt function, or for every function in math.h, or for any fixed-input function whose domain it can figure out. This gets very, very complex, and the complexity is basically limitless. You can keep adding layers of sophistication to your compiler, but there will always be a way to sneak in some code that will be unreachable for any given set of inputs.

还有一些输入集是永远不会被输入的。输入在现实生活中没有意义，或者在其他地方被验证逻辑阻塞。编译器没有办法知道这些。

这样做的最终结果是，虽然其他人提到的软件工具非常有用，但您永远无法确定您捕获了所有内容，除非您随后手动检查代码。即便如此，你也无法确定自己是否错过了什么。

恕我直言，唯一真正的解决方案是尽可能保持警惕，使用自动化，尽可能地重构，并不断寻找改进代码的方法。当然，这样做是个好主意。

2011-01-31 23:13:19

其他回答

CppDepend是一个商业工具，它可以检测未使用的类型、方法和字段，以及做更多的事情。它适用于Windows和Linux(但目前不支持64位)，并有两周的试用期。

免责声明:我不在那里工作，但我拥有这个工具的许可证(以及NDepend，它是。net代码的一个更强大的替代方案)。

对于那些好奇的人来说，这里有一个内置的(可定制的)检测死方法的规则示例，用CQLinq编写:

// <Name>Potentially dead Methods</Name>
warnif count > 0
// Filter procedure for methods that should'nt be considered as dead
let canMethodBeConsideredAsDeadProc = new Func<IMethod, bool>(
    m => !m.IsPublic &&       // Public methods might be used by client applications of your Projects.
         !m.IsEntryPoint &&            // Main() method is not used by-design.
         !m.IsClassConstructor &&      
         !m.IsVirtual &&               // Only check for non virtual method that are not seen as used in IL.
         !(m.IsConstructor &&          // Don't take account of protected ctor that might be call by a derived ctors.
           m.IsProtected) &&
         !m.IsGeneratedByCompiler
)

// Get methods unused
let methodsUnused = 
   from m in JustMyCode.Methods where 
   m.NbMethodsCallingMe == 0 && 
   canMethodBeConsideredAsDeadProc(m)
   select m

// Dead methods = methods used only by unused methods (recursive)
let deadMethodsMetric = methodsUnused.FillIterative(
   methods => // Unique loop, just to let a chance to build the hashset.
              from o in new[] { new object() }
              // Use a hashet to make Intersect calls much faster!
              let hashset = methods.ToHashSet()
              from m in codeBase.Application.Methods.UsedByAny(methods).Except(methods)
              where canMethodBeConsideredAsDeadProc(m) &&
                    // Select methods called only by methods already considered as dead
                    hashset.Intersect(m.MethodsCallingMe).Count() == m.NbMethodsCallingMe
              select m)

from m in JustMyCode.Methods.Intersect(deadMethodsMetric.DefinitionDomain)
select new { m, m.MethodsCallingMe, depth = deadMethodsMetric[m] }

2012-11-05 00:00:44

对于未使用的整个函数(和未使用的全局变量)，GCC实际上可以为您完成大部分工作，前提是您使用GCC和GNU ld。

编译源代码时，使用- function-sections和-fdata-sections，然后链接时使用-Wl，——gc-sections，——print-gc-sections。链接器现在将列出所有可以删除的函数，因为它们从未被调用，以及所有从未被引用的全局函数。

(当然，你也可以跳过——print-gc-sections部分，让链接器无声地删除函数，但将它们保留在源代码中。)

注意:这只会发现未使用的完整函数，它不会对函数中的死代码做任何事情。在活函数中从死代码调用的函数也将被保留。

一些c++特有的特性也会导致问题，特别是:

Virtual functions. Without knowing which subclasses exist and which are actually instantiated at run time, you can't know which virtual functions you need to exist in the final program. The linker doesn't have enough information about that so it will have to keep all of them around. Globals with constructors, and their constructors. In general, the linker can't know that the constructor for a global doesn't have side effects, so it must run it. Obviously this means the global itself also needs to be kept.

在这两种情况下，虚函数或全局变量构造函数使用的任何东西都必须保留。

另外需要注意的是，如果您正在构建一个共享库，GCC中的默认设置将导出共享库中的每个函数，导致只要链接器就会“使用”它。为了解决这个问题，你需要将默认值设置为隐藏符号而不是导出(例如使用-fvisibility=hidden)，然后显式地选择你需要导出的导出函数。

2011-02-01 01:07:00

真正的答案是:你永远无法真正确定。

至少，对于重要的情况，你不能确定你已经得到了全部。考虑以下来自维基百科关于不可达代码的文章:

double x = sqrt(2);
if (x > 5)
{
  doStuff();
}

正如维基百科正确指出的那样，一个聪明的编译器也许能够捕捉到这样的东西。但是考虑一下修改:

int y;
cin >> y;
double x = sqrt((double)y);

if (x != 0 && x < 1)
{
  doStuff();
}

还有一些输入集是永远不会被输入的。输入在现实生活中没有意义，或者在其他地方被验证逻辑阻塞。编译器没有办法知道这些。

恕我直言，唯一真正的解决方案是尽可能保持警惕，使用自动化，尽可能地重构，并不断寻找改进代码的方法。当然，这样做是个好主意。

2011-01-31 23:13:19

Mark as much public functions and variables as private or protected without causing compilation error, while doing this, try to also refactor the code. By making functions private and to some extent protected, you reduced your search area since private functions can only be called from the same class (unless there are stupid macro or other tricks to circumvent access restriction, and if that's the case I'd recommend you find a new job). It is much easier to determine that you don't need a private function since only the class you're currently working on can call this function. This method is easier if your code base have small classes and is loosely coupled. If your code base does not have small classes or have very tight coupling, I suggest cleaning those up first.

接下来将标记所有剩余的公共函数，并制作一个调用图，以找出类之间的关系。从这棵树上，试着找出树枝的哪一部分看起来可以修剪。

这种方法的优点是你可以在每个模块的基础上进行测试，所以当你的代码库损坏时，你很容易通过单元测试，而不会有很长一段时间。

2011-01-27 15:26:30

我不认为它可以自动完成。

即使使用代码覆盖工具，也需要提供足够的输入数据来运行。

可能是非常复杂和昂贵的静态分析工具，如Coverity的或LLVM编译器可能会有所帮助。

但我不确定，我更喜欢手动代码审查。

更新

嗯. .不过，仅删除未使用的变量和未使用的函数并不难。

更新

看了其他人的回答和评论后，我更加坚定地认为这是不可能的。

您必须了解代码以获得有意义的代码覆盖率度量，如果您知道大量的手动编辑将比准备/运行/检查覆盖率结果更快。

2011-01-27 08:44:11

我如何知道代码中的哪些部分从未使用过?

推荐文章

最新文章

标签