如何在Bash中分割分隔符上的字符串？

我将此字符串存储在变量中：

IN="bla@some.com;john@home.com"

现在，我想将字符串拆分为：；分隔符，以便我可以：

ADDR1="bla@some.com"
ADDR2="john@home.com"

我不一定需要ADDR1和ADDR2变量。如果它们是数组的元素，那就更好了。

根据以下答案的建议，我最终得出了以下结论，这就是我想要的：

#!/usr/bin/env bash

IN="bla@some.com;john@home.com"

mails=$(echo $IN | tr ";" "\n")

for addr in $mails
do
    echo "> [$addr]"
done

输出：

> [bla@some.com]
> [john@home.com]

有一个解决方案涉及将Internal_field_separator（IFS）设置为；。我不确定这个答案发生了什么，如何将IFS重置为默认值？

回复：IFS解决方案，我尝试了这个，它奏效了，我保留了旧的IFS，然后恢复它：

IN="bla@some.com;john@home.com"

OIFS=$IFS
IFS=';'
mails2=$IN
for x in $mails2
do
    echo "> [$x]"
done

IFS=$OIFS

顺便说一下，当我尝试

mails2=($IN)

在循环中打印时，我只得到了第一个字符串，$in周围没有括号，它可以工作。

当前回答

编辑：对不起，我在SO上读到POSIX需要perl，所以我认为使用它是合法的。但在unix.stackexchange.com上，一些用户表示perl不是POSIX规范的一部分。

我的解决方案：一个使用perl的拆分来完成工作的函数。

详细评论如下：

#!/bin/bash

# This function is a wrapper for Perl's split.\
# \
# Since we cannot return an array like in Perl,
# it takes the name of the resulting array as last
# argument.\
# \
# See https://perldoc.perl.org/functions/split for usage info
# and examples.\
# \
# If you provide a Perl regexp that contains e. g. an escaped token like \b,
# space(s) and/or capture group(s), it must be quoted, and e. g. /\b/ must
# be single-quoted.\
# Thus, it's best to generally single-quote a Perl regexp.
function split # Args: <Element separator regexp> <string> <array name>
{
    (($# != 3)) && echo "${FUNCNAME[0]}: Wrong number of arguments, returning." && return 1

    local elementSepRE=$1
    local string=$2
    local -n array=$3

    local element i=0

    # Attention! read does Word Splitting on each line!
    # I must admit I didn't know that so far.
    # This removes leading and trailing spaces, exactly
    # what we don't want.
    # Thus, we set IFS locally to newline only.
    local IFS=$'\n'

    while read element; do
        # As opposed to array+=($element),
        # this preserves leading and trailing spaces.
        array[i++]=$element
    done <<<$(_perl_split)
}

# This function calls Perl's split function and prints the elements of the
# resulting array on separate lines.\
# It uses the caller's $elementSepRE and $string.
function _perl_split
{
    # A heredoc is a great way of embedding a Perl script.
    # N.B.: - Shell variables get expanded.
    #         - Thus:
    #           - They must be quoted.
    #           - Perl scalar variables must be escaped.
    #       - The backslash of \n must be escaped to protect it.
    #       - Instead of redirecting a single heredoc to perl, we may
    #         use multiple heredocs with cat within a command group and
    #         pipe the result to perl.
    #         This enables us to conditionally add certain lines of code.

    {
        cat <<-END
            my \$elementSepRE=q($elementSepRE);
        END

        # If $elementSepRE is a literal Perl regexp, qr must be applied
        # to it in order to use it.
        # N.B.: We cannot write this condition in Perl because when perl
        # compiles the script, all statements are checked for validity,
        # no matter if they will actually be executed or not.
        # And if $elementSepRE was e. g. == ', the line below – although
        # not to be executed – would give an error because of an unterminated
        # single-quoted string.
        [[ $elementSepRE =~ ^m?/ && $elementSepRE =~ /[msixpodualn]*$ ]] && cat <<-END
            \$elementSepRE=qr$elementSepRE;
        END

        cat <<-END
            my @array=split(\$elementSepRE, q($string));

            print(\$_ . "\\n") for (@array);
        END
    } | perl
}

对于那些一眼就知道发生了什么的人来说，这一点没有任何评论；）

#!/bin/bash

# This function is a wrapper for Perl's split.\
# \
# Since we cannot return an array like in Perl,
# it takes the name of the resulting array as last
# argument.\
# \
# See https://perldoc.perl.org/functions/split for usage info
# and examples.\
# \
# If you provide a Perl regexp that contains e. g. an escaped token like \b,
# space(s) and/or capture group(s), it must be quoted, and e. g. /\b/ must
# be single-quoted.\
# Thus, it's best to generally single-quote a Perl regexp.
function split # Args: <Element separator regexp> <string> <array name>
{
    (($# != 3)) && echo "${FUNCNAME[0]}: Wrong number of arguments, returning." && return 1

    local elementSepRE=$1
    local string=$2
    local -n array=$3

    local element i=0

    local IFS=$'\n'

    while read element; do
        array[i++]=$element
    done <<<$(_perl_split)
}

function _perl_split
{
    {
        cat <<-END
            my \$elementSepRE=q($elementSepRE);
        END

        [[ $elementSepRE =~ ^m?/ && $elementSepRE =~ /[msixpodualn]*$ ]] && cat <<-END
            \$elementSepRE=qr$elementSepRE;
        END

        cat <<-END
            my @array=split(\$elementSepRE, q($string));

            print(\$_ . "\\n") for (@array);
        END
    } | perl
}

2021-08-22 01:15:32

其他回答

有两种简单的方法：

cat "text1;text2;text3" | tr " " "\n"

and

cat "text1;text2;text3" | sed -e 's/ /\n/g'

2011-10-12 11:09:41

有一种简单而聪明的方法：

echo "add:sfff" | xargs -d: -i  echo {}

但是您必须使用gnu-xargs，BSD-xargs不能支持-d delim。如果你像我一样使用苹果mac，你可以安装gnu-xargs：

brew install findutils

then

echo "add:sfff" | gxargs -d: -i  echo {}

2015-09-16 03:34:51

IN='bla@some.com;john@home.com;Charlie Brown <cbrown@acme.com;!"#$%&/()[]{}*? are no problem;simple is beautiful :-)'
set -f
oldifs="$IFS"
IFS=';'; arrayIN=($IN)
IFS="$oldifs"
for i in "${arrayIN[@]}"; do
echo "$i"
done
set +f

输出：

bla@some.com
john@home.com
Charlie Brown <cbrown@acme.com
!"#$%&/()[]{}*? are no problem
simple is beautiful :-)

说明：使用括号（）的简单赋值将分号分隔的列表转换为数组，前提是在执行此操作时使用正确的IFS。标准FOR循环照常处理该数组中的各个项。请注意，为IN变量提供的列表必须是“硬”引号，即带有单引号。

必须保存和恢复IFS，因为Bash不会像对待命令一样对待赋值。另一种解决方法是将赋值包装在函数内，并使用修改后的IFS调用该函数。在这种情况下，不需要单独保存/恢复IFS。感谢“比兹”指出这一点。

2014-10-10 11:33:54

这对我有用：

string="1;2"
echo $string | cut -d';' -f1 # output is 1
echo $string | cut -d';' -f2 # output is 2

2016-08-11 20:45:25

我看到了一些引用cut命令的答案，但它们都被删除了。有点奇怪的是，没有人详细阐述过这一点，因为我认为这是执行这类任务更有用的命令之一，尤其是用于解析分隔的日志文件。

在将这个特定示例拆分为bash脚本数组的情况下，tr可能效率更高，但可以使用cut，如果您想从中间拉取特定字段，则更有效。

例子：

$ echo "bla@some.com;john@home.com" | cut -d ";" -f 1
bla@some.com
$ echo "bla@some.com;john@home.com" | cut -d ";" -f 2
john@home.com

显然，您可以将其放入一个循环中，并迭代-f参数以独立拉动每个字段。

当您有一个分隔的日志文件，其中包含如下行时，这会更有用：

2015-04-27|12345|some action|an attribute|meta data

cut非常方便，能够抓取这个文件并选择一个特定的字段进行进一步处理。

2015-04-27 18:20:34

如何在Bash中分割分隔符上的字符串？

推荐文章

最新文章

标签