bs4。FeatureNotFound:无法找到具有您所请求的功能的树构建器:lxml。是否需要安装解析器库?

...
soup = BeautifulSoup(html, "lxml")
File "/Library/Python/2.7/site-packages/bs4/__init__.py", line 152, in __init__
% ",".join(features))
bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?

以上输出在我的终端上。我使用的是Mac OS 10.7.x。我有Python 2.7.1，并遵循本教程获得了Beautiful Soup和lxml，它们都成功安装了，并与位于这里的单独测试文件一起工作。在导致此错误的Python脚本中，我包含了这一行: 导入comparePages 在pageCrawler文件中，我包含了以下两行代码: 从bs4导入BeautifulSoup 从urllib2导入urlopen

任何帮助找出问题是什么以及如何解决都将不胜感激。

当前回答

BS4默认情况下需要HTML文档。因此，它将XML文档解析为HTML文档。在构造函数中传递features="xml"作为参数。它解决了我的问题。

2022-07-03 04:41:01

其他回答

在python环境中安装LXML解析器。

pip install lxml

你的问题会解决的。你也可以使用内置的python包:

soup = BeautifulSoup(s,  "html.parser")

注意:“HTMLParser”模块已被重命名为“html”。在Python3中

2020-05-28 12:00:25

尽管BeautifulSoup默认支持HTML解析器如果您想使用任何其他第三方Python解析器，则需要安装该外部解析器，如(lxml)。

soup_object= BeautifulSoup(markup, "html.parser") #Python HTML parser

但是如果你没有指定任何解析器作为参数，你会得到一个没有指定解析器的警告。

soup_object= BeautifulSoup(markup) #Warnning

要使用任何其他外部解析器，您需要安装它，然后需要指定它。就像

pip install lxml

soup_object= BeautifulSoup(markup, 'lxml') # C dependent parser

外部解析器依赖于c和python，这可能有一些优点和缺点。

2018-03-24 11:06:12

我修复了以下变化

之前更改

soup = BeautifulSoup(r.content, 'html5lib' )
print (soup.prettify())

后改变

soup = BeautifulSoup(r.content, features='html')
print(soup.prettify())

我的代码正常工作

2022-03-06 14:00:24

实际上是其他作品中提到的三个选项。

# 1. 
soup_object= BeautifulSoup(markup,"html.parser") #Python HTML parser

# 2. 
pip install lxml
soup_object= BeautifulSoup(markup,'lxml') # C dependent parser 

# 3.
pip install html5lib
soup_object= BeautifulSoup(markup,'html5lib') # C dependent parser

2020-09-01 20:14:37

运行这三个命令来确保你已经安装了所有相关的软件包:

pip install bs4
pip install html5lib
pip install lxml

然后，如果需要，重新启动您的Python IDE。

这样就可以解决所有与这个问题有关的问题了。

2020-02-12 08:22:29

bs4。FeatureNotFound:无法找到具有您所请求的功能的树构建器:lxml。是否需要安装解析器库?

推荐文章

最新文章

标签