我如何创建一个URL缩短?

我想创建一个URL缩短服务，你可以写一个长URL到输入字段和服务缩短URL为“http://www.example.org/abcdef”。

可以用包含a-z, a-z和0-9的6个字符的字符串代替"abcdef"。这样就有560 ~ 570亿个可能的字符串。

我的方法:

我有一个有三列的数据库表:

Id，整数，自动递增 long, string，用户输入的长URL 短，字符串，缩短的URL(或只有六个字符)

然后将长URL插入到表中。然后，我将为“id”选择自动递增的值，并构建它的散列。这个散列应该作为“short”插入。但是我应该构建什么样的哈希呢?像MD5这样的哈希算法会创建太长的字符串。我认为我不用这些算法。一个自建的算法也可以。

我的想法:

对于“http://www.google.de/”，我得到了自动增量id 239472。然后我执行以下步骤:

short = '';
if divisible by 2, add "a"+the result to short
if divisible by 3, add "b"+the result to short
... until I have divisors for a-z and A-Z.

可以不断重复，直到这个数不能被整除为止。你认为这是一个好方法吗?你有更好的主意吗?

由于对这个主题的持续兴趣，我已经发布了一个高效的GitHub解决方案，包括JavaScript、PHP、Python和Java的实现。如果你喜欢，添加你的解决方案:)

当前回答

我不断增加数据库中每个域的整数序列，并使用Hashids将整数编码为URL路径。

static hashids = Hashids(salt = "my app rocks", minSize = 6)

我运行了一个脚本，看看它需要多长时间才能耗尽字符长度。对于6个字符，它可以做164,916,224个链接，然后增加到7个字符。Bitly使用7个字符。在我看来，不到五个字很奇怪。

哈希id可以将URL路径解码为整数，但更简单的解决方案是使用整个短链接sho。Rt /ka8ds3作为主键。

以下是完整的概念:

function addDomain(domain) {
    table("domains").insert("domain", domain, "seq", 0)
}

function addURL(domain, longURL) {
    seq = table("domains").where("domain = ?", domain).increment("seq")
    shortURL = domain + "/" + hashids.encode(seq)
    table("links").insert("short", shortURL, "long", longURL)
    return shortURL
}

// GET /:hashcode
function handleRequest(req, res) {
    shortURL = req.host + "/" + req.param("hashcode")
    longURL = table("links").where("short = ?", shortURL).get("long")
    res.redirect(301, longURL)
}

2018-01-25 02:24:22

其他回答

下面是一个Node.js实现，可能是bit.ly。生成一个高度随机的七字符字符串。

它使用Node.js加密生成高度随机的25个字符集，而不是随机选择7个字符。

var crypto = require("crypto");
exports.shortURL = new function () {
    this.getShortURL = function () {
        var sURL = '',
            _rand = crypto.randomBytes(25).toString('hex'),
            _base = _rand.length;
        for (var i = 0; i < 7; i++)
            sURL += _rand.charAt(Math.floor(Math.random() * _rand.length));
        return sURL;
    };
}

2017-01-05 07:22:06

这是我所使用的:

# Generate a [0-9a-zA-Z] string
ALPHABET = map(str,range(0, 10)) + map(chr, range(97, 123) + range(65, 91))

def encode_id(id_number, alphabet=ALPHABET):
    """Convert an integer to a string."""
    if id_number == 0:
        return alphabet[0]

    alphabet_len = len(alphabet) # Cache

    result = ''
    while id_number > 0:
        id_number, mod = divmod(id_number, alphabet_len)
        result = alphabet[mod] + result

    return result

def decode_id(id_string, alphabet=ALPHABET):
    """Convert a string to an integer."""
    alphabet_len = len(alphabet) # Cache
    return sum([alphabet.index(char) * pow(alphabet_len, power) for power, char in enumerate(reversed(id_string))])

它非常快，可以取很长的整数。

2011-03-10 18:34:32

// simple approach

$original_id = 56789;

$shortened_id = base_convert($original_id, 10, 36);

$un_shortened_id = base_convert($shortened_id, 36, 10);

2011-12-20 10:59:29

看看https://hashids.org/，它是开源的，有多种语言版本。

他们的页面概述了其他方法的一些陷阱。

2020-03-17 11:32:46

我有一个问题的变体，因为我存储了来自许多不同作者的网页，需要防止通过猜测发现页面。所以我的短url在Base-62字符串中添加了几个额外的数字作为页码。这些额外的数字是由页面记录本身的信息生成的，它们确保3844个url中只有一个是有效的(假设2位Base-62)。您可以在http://mgscan.com/MBWL上看到概要描述。

2015-03-15 09:42:53

我如何创建一个URL缩短?

推荐文章

最新文章

标签