在Python中获取HTTP GET的最快方法是什么?

如果我知道内容将是一个字符串，在Python中最快的HTTP GET方法是什么?我正在搜索文档中的一个快速一行程序，如:

contents = url.get("http://example.com/foo/bar")

但所有我能找到使用谷歌是httplib和urllib -我无法在这些库中找到一个快捷方式。

标准的Python 2.5是否有类似上述的某种形式的快捷方式，或者我应该写一个url_get函数?

我宁愿不捕获对wget或curl进行换壳的输出。

当前回答

看看httplib2，除了许多非常有用的特性之外，它还提供了您想要的东西。

import httplib2

resp, content = httplib2.Http().request("http://example.com/foo/bar")

其中内容将是响应体(作为字符串)，而resp将包含状态和响应标头。

虽然它不包含在标准的python安装中(但它只需要标准的python)，但它绝对值得一试。

2009-03-14 16:13:13

其他回答

Theller的wget解决方案非常有用，但是，我发现它并没有打印出整个下载过程中的进度。如果在reporthook中的print语句后添加一行，那就太完美了。

import sys, urllib

def reporthook(a, b, c):
    print "% 3.1f%% of %d bytes\r" % (min(100, float(a * b) / c * 100), c),
    sys.stdout.flush()
for url in sys.argv[1:]:
    i = url.rfind("/")
    file = url[i+1:]
    print url, "->", file
    urllib.urlretrieve(url, file, reporthook)
print

2010-01-05 01:21:33

对于python >= 3.6，你可以使用dload:

import dload
t = dload.text(url)

json:

j = dload.json(url)

安装: PIP安装负载

2020-02-29 23:02:00

下面是Python中的wget脚本:

# From python cookbook, 2nd edition, page 487
import sys, urllib

def reporthook(a, b, c):
    print "% 3.1f%% of %d bytes\r" % (min(100, float(a * b) / c * 100), c),
for url in sys.argv[1:]:
    i = url.rfind("/")
    file = url[i+1:]
    print url, "->", file
    urllib.urlretrieve(url, file, reporthook)
print

2009-03-14 16:47:32

如果您专门使用HTTP api，还有更方便的选择，如Nap。

例如，以下是如何从2014年5月1日起从Github获得gist:

from nap.url import Url
api = Url('https://api.github.com')

gists = api.join('gists')
response = gists.get(params={'since': '2014-05-01T00:00:00Z'})
print(response.json())

更多例子:https://github.com/kimmobrunfeldt/nap#examples

2014-05-22 17:08:22

如何也发送头

Python 3:

import urllib.request
contents = urllib.request.urlopen(urllib.request.Request(
    "https://api.github.com/repos/cirosantilli/linux-kernel-module-cheat/releases/latest",
    headers={"Accept" : 'application/vnd.github.full+json"text/html'}
)).read()
print(contents)

Python 2:

import urllib2
contents = urllib2.urlopen(urllib2.Request(
    "https://api.github.com",
    headers={"Accept" : 'application/vnd.github.full+json"text/html'}
)).read()
print(contents)

2018-09-16 06:22:04

在Python中获取HTTP GET的最快方法是什么?

推荐文章

最新文章

标签