如何在Python中对URL参数进行百分比编码?

如果我这样做

url = "http://example.com?p=" + urllib.quote(query)

它不编码/到%2F(破坏OAuth规范化) 它不处理Unicode(它抛出异常)

有更好的图书馆吗?

当前回答

我认为模块请求要好得多。它基于urllib3。

你可以试试这个:

>>> from requests.utils import quote
>>> quote('/test')
'/test'
>>> quote('/test', safe='')
'%2Ftest'

我的答案和Paolo的答案相似。

其他回答

我认为模块请求要好得多。它基于urllib3。

你可以试试这个:

>>> from requests.utils import quote
>>> quote('/test')
'/test'
>>> quote('/test', safe='')
'%2Ftest'

我的答案和Paolo的答案相似。

在Python 3中，urllib。Quote已移动到urllib.parse。它默认情况下处理Unicode。

>>> from urllib.parse import quote
>>> quote('/test')
'/test'
>>> quote('/test', safe='')
'%2Ftest'
>>> quote('/El Niño/')
'/El%20Ni%C3%B1o/'

Python 2

从文档中可以看到:

urllib.quote(string[, safe])

替换字符串中的特殊字符使用%xx转义。字母,数字以及字符的_。-从来没有引用。缺省情况下，该函数为用于引用路径部分 URL的。可选的安全参数指定附加字符不应该被引用——它的默认值为“/”

这意味着通过“安全”将解决你的第一个问题:

>>> urllib.quote('/test')
'/test'
>>> urllib.quote('/test', safe='')
'%2Ftest'

关于第二个问题，有一个bug报告。显然它在Python 3中被修复了。你可以通过像这样编码UTF-8来解决它:

>>> query = urllib.quote(u"Müller".encode('utf8'))
>>> print urllib.unquote(query).decode('utf8')
Müller

顺便说一下，看看urlencode。

Python 3

在Python 3中，函数引号已被移动到urllib.parse:

>>> import urllib.parse
>>> print(urllib.parse.quote("Müller".encode('utf8')))
M%C3%BCller
>>> print(urllib.parse.unquote("M%C3%BCller"))
Müller

如果你正在使用Django，你可以使用urlquote:

>>> from django.utils.http import urlquote
>>> urlquote(u"Müller")
u'M%C3%BCller'

注意，对Python的更改意味着它现在是一个遗留包装器。Django .utils.http的Django 2.1源代码:

A legacy compatibility wrapper to Python's urllib.parse.quote() function.
(was used for unicode handling on Python 2)

使用furl的另一种方法:

import furl

url = "https://httpbin.org/get?hello,world"
print(url)
url = furl.furl(url).url
print(url)

输出:

https://httpbin.org/get?hello,world
https://httpbin.org/get?hello%2Cworld

推荐文章