如何使用正则表达式验证电子邮件地址？

多年来，我慢慢地开发了一个正则表达式，它可以正确验证大多数电子邮件地址，假设它们不使用IP地址作为服务器部分。

我在几个PHP程序中使用它，而且大多数时候都可以使用。然而，不时有人与我联系，他们对使用它的网站有问题，我最终不得不做出一些调整（最近我意识到我不允许四个字符的TLD）。

你有或见过验证电子邮件的最佳正则表达式是什么？

我见过几种使用函数的解决方案，这些函数使用了几个较短的表达式，但我宁愿在简单函数中使用一个长的复杂表达式，而不是在更复杂的函数中使用几个短表达式。

当前回答

世界上最流行的博客平台WordPress使用此功能验证电子邮件地址。。。

但他们是通过多个步骤来实现的。

使用此函数中提到的正则表达式时，您不必再担心。。。

这是函数。。。

/**
 * Verifies that an email is valid.
 *
 * Does not grok i18n domains. Not RFC compliant.
 *
 * @since 0.71
 *
 * @param string $email Email address to verify.
 * @param boolean $deprecated Deprecated.
 * @return string|bool Either false or the valid email address.
 */
function is_email( $email, $deprecated = false ) {
    if ( ! empty( $deprecated ) )
        _deprecated_argument( __FUNCTION__, '3.0' );

    // Test for the minimum length the email can be
    if ( strlen( $email ) < 3 ) {
        return apply_filters( 'is_email', false, $email, 'email_too_short' );
    }

    // Test for an @ character after the first position
    if ( strpos( $email, '@', 1 ) === false ) {
        return apply_filters( 'is_email', false, $email, 'email_no_at' );
    }

    // Split out the local and domain parts
    list( $local, $domain ) = explode( '@', $email, 2 );

    // LOCAL PART
    // Test for invalid characters
    if ( !preg_match( '/^[a-zA-Z0-9!#$%&\'*+\/=?^_`{|}~\.-]+$/', $local ) ) {
        return apply_filters( 'is_email', false, $email, 'local_invalid_chars' );
    }

    // DOMAIN PART
    // Test for sequences of periods
    if ( preg_match( '/\.{2,}/', $domain ) ) {
        return apply_filters( 'is_email', false, $email, 'domain_period_sequence' );
    }

    // Test for leading and trailing periods and whitespace
    if ( trim( $domain, " \t\n\r\0\x0B." ) !== $domain ) {
        return apply_filters( 'is_email', false, $email, 'domain_period_limits' );
    }

    // Split the domain into subs
    $subs = explode( '.', $domain );

    // Assume the domain will have at least two subs
    if ( 2 > count( $subs ) ) {
        return apply_filters( 'is_email', false, $email, 'domain_no_periods' );
    }

    // Loop through each sub
    foreach ( $subs as $sub ) {
        // Test for leading and trailing hyphens and whitespace
        if ( trim( $sub, " \t\n\r\0\x0B-" ) !== $sub ) {
            return apply_filters( 'is_email', false, $email, 'sub_hyphen_limits' );
        }

        // Test for invalid characters
        if ( !preg_match('/^[a-z0-9-]+$/i', $sub ) ) {
            return apply_filters( 'is_email', false, $email, 'sub_invalid_chars' );
        }
    }

    // Congratulations your email made it!
    return apply_filters( 'is_email', $email, $email, null );
}

2013-05-11 12:20:38

其他回答

我不建议使用正则表达式，电子邮件地址太复杂了。这是一个常见的问题，所以我猜有很多库都包含验证器-如果您使用Java，apachecommons验证器的EmailValidator是一个很好的验证器。

2011-11-04 18:32:24

这是我使用的PHP代码。我选择这个解决方案是出于“误报比误报好”的精神，正如这里的另一位评论者所说的，并考虑到保持您的响应时间并降低服务器负载。。。当正则表达式可以消除大多数简单的用户错误时，真的不需要浪费服务器资源。如果你愿意，你可以随时通过发送测试邮件来跟进。

function validateEmail($email) {
  return (bool) stripos($email,'@');
}

2011-07-20 03:37:42

我从不麻烦用自己的正则表达式创建，因为很可能其他人已经提出了更好的版本。我总是使用regexlib来找到一个我喜欢的。

2008-10-14 14:23:11

出于我的目的，我需要一种方法来提取显示名称（如果提供）。感谢上提供的其他答案和正则表达式https://emailregex.com/我提出了以下解决方案：

/^(?:([^<]*?)\s*<)?((?:[a-z0-9!#$%&'*+\/=?^_`{|}~-]+(?:\.[a-z0-9!#$%&'*+\/=?^_`{|}~-]+)*|"(?:[\x01-\x08\x0b\x0c\x0e-\x1f\x21\x23-\x5b\x5d-\x7f]|\\[\x01-\x09\x0b\x0c\x0e-\x7f])*")@(?:(?:[a-z0-9](?:[a-z0-9-]*[a-z0-9])?\.)+[a-z0-9](?:[a-z0-9-]*[a-z0-9])?|\[(?:(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?)\.){3}(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?|[a-z0-9-]*[a-z0-9]:(?:[\x01-\x08\x0b\x0c\x0e-\x1f\x21-\x5a\x53-\x7f]|\\[\x01-\x09\x0b\x0c\x0e-\x7f])+)\]))>?$/gi

这与显示名称（=组1）+电子邮件地址（=组2）相匹配。

匹配示例：

john.doe@example.com
john.o'doe@example.com
John <john@doe.com>
<john@doe.com>
This is <john@127.0.0.1>

使用测试https://regex101.com/

当然，正如其他答案中提到的，还需要对显示名称和电子邮件地址的长度进行额外验证（不应超过320个UTF-8字节）。

2021-07-09 00:02:45

根据我所看到的，一个完全符合标准的正则表达式是允许的：

/^(?!(^[.-].*|.*[.-]@|.*\.{2,}.*)|^.{254}.+@)([a-z\xC0-\xFF0-9!#$%&'*+\/=?^_`{|}~.-]+@)(?!.{253}.+$)((?!-.*|.*-\.)([a-z0-9-]{1,63}\.)+[a-z]{2,63}|(([01]?[0-9]{2}|2([0-4][0-9]|5[0-5])|[0-9])\.){3}([01]?[0-9]{2}|2([0-4][0-9]|5[0-5])|[0-9]))$/gim

演示/调试分析（交互式）

拆分：

^(?!(^[.-].*|.*[.-]@|.*\.{2,}.*)|^.{254}.+@)
([a-z\xC0-\xFF0-9!#$%&'*+\/=?^_`{|}~.-]+@)
(?!.{253}.+$)
(
    (?!-.*|.*-\.)
    ([a-z0-9-]{1,63}\.)+
    [a-z]{2,63}
    |
    (([01]?[0-9]{2}|2([0-4][0-9]|5[0-5])|[0-9])\.){3}
    ([01]?[0-9]{2}|2([0-4][0-9]|5[0-5])|[0-9])
)$

分析：

(?!(^[.-].*|.*[.-]@|.*\.{2,}.*)|^.{254}.+@)

对以。，以一结尾，有。。或超过254个字符的最大长度

([a-z\xC0-\xFF0-9!#$%&'*+\/=?^_`{|}~.-]+@)

匹配一个或多个允许的字符，并应用负面外观

(?!.{253}.+$)

域名部分的负前瞻性，总共限制为253个字符

(?!-.*|.*-\.)

每个域名的负前瞻性，不允许以开头或结尾。

([a-z0-9-]{1,63}\.)+

域名中允许的字符的简单组匹配，每个字符限制为63个字符

[a-zA-Z]{2,63}

允许的顶级域的简单组匹配，该域目前仍仅限于字母，但确实包含4个字母以上的TLD。

(([01]?[0-9]{2}|2([0-4][0-9]|5[0-5])|[0-9])\.){3}
([01]?[0-9]{2}|2([0-4][0-9]|5[0-5])|[0-9])

域名的替代方案：这将IP地址中的前3个数字与匹配。然后是IP地址中没有的第四个数字。在它背后。

2014-06-07 00:25:23

如何使用正则表达式验证电子邮件地址？

推荐文章

最新文章

标签