如何使用grep跨多行找到模式?

我想找到有“abc”和“efg”的文件，这两个字符串在该文件中的不同行。一个包含以下内容的文件:

blah blah..
blah blah..
blah abc blah
blah blah..
blah blah..
blah blah..
blah efg blah blah
blah blah..
blah blah..

应该匹配。

当前回答

这可以通过首先使用tr用其他字符替换换行符来轻松完成:

tr '\n' '\a' | grep -o 'abc.*def' | tr '\a' '\n'

这里，我使用警报字符\a (ASCII 7)来代替换行符。这在你的文本中几乎找不到，而且grep可以用一个.匹配它，或者专门用\a匹配它。

其他回答

Grep是这种操作的笨拙工具。

在大多数现代Linux系统中都可以找到pcregrep，可以用作

pcregrep -M  'abc.*(\n|.)*efg' test.txt

where -M，——multiline允许模式匹配多行

还有一个更新的pcre2grep。两者都是由PCRE项目提供的。

pcre2grep可以通过Mac Ports作为pcre2端口的一部分用于Mac OS X:

% sudo port install pcre2

并通过Homebrew为:

% brew install pcre

或者pcre2

% brew install pcre2

pcre2grep在Linux (Ubuntu 18.04+)上也可用

$ sudo apt install pcre2-utils # PCRE2
$ sudo apt install pcregrep    # Older PCRE

我不确定是否可以使用grep，但sed使它非常简单:

sed -e '/abc/,/efg/!d' [file-with-content]

这应该可以工作:

cat FILE | egrep 'abc|efg'

如果有多个匹配项，可以使用grep -v过滤掉

如果可以使用Perl，就可以很容易地做到这一点。

perl -ne 'if (/abc/) { $abc = 1; next }; print "Found in $ARGV\n" if ($abc && /efg/); }' yourfilename.txt

您也可以使用单个正则表达式来实现这一点，但这涉及到将文件的整个内容放入单个字符串中，对于大型文件，这可能会占用太多内存。为了完整起见，下面是该方法:

perl -e '@lines = <>; $content = join("", @lines); print "Found in $ARGV\n" if ($content =~ /abc.*efg/s);' yourfilename.txt

遗憾的是，你不能。来自grep文档:

grep搜索指定的输入FILEs(或标准输入，如果没有指定文件，或如果给出了一个连字符减号(-)作为文件名)，以查找包含与给定PATTERN匹配的行。

推荐文章