V2EX = way to explore
V2EX 是一个关于分享和探索的地方
现在注册
已注册用户请  登录
V2EX 提问指南
grey5659
V2EX  ›  问与答

运行豆瓣爬虫程序,一直在下载也不知道什么意思?

  •  
  •   grey5659 · 2016-07-02 18:54:49 +08:00 · 1632 次点击
    这是一个创建于 2847 天前的主题,其中的信息可能已经有所发展或是发生改变。

    就是这个 http://blog.csdn.net/lanbing510/article/details/45887075 运行$ python doubanSpider.py 后一直在下载,是什么意思额? /usr/local/lib/python2.7/dist-packages/bs4/init.py:166: UserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ("html.parser"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.

    To get rid of this warning, change this:

    BeautifulSoup([your markup])

    to this:

    BeautifulSoup([your markup], "html.parser")

    markup_type=markup_type)) Downloading Information From Page 1 Downloading Information From Page 2 Downloading Information From Page 3 Downloading Information From Page 4 Downloading Information From Page 5 Downloading Information From Page 6 WARNING:root:Some characters could not be decoded, and were replaced with REPLACEMENT CHARACTER. Downloading Information From Page 7 Downloading Information From Page 8 Downloading Information From Page 9 Downloading Information From Page 10 Downloading Information From Page 11 Downloading Information From Page 12 Downloading Information From Page 13 Downloading Information From Page 14 Downloading Information From Page 15 Downloading Information From Page 16 Downloading Information From Page 17 Downloading Information From Page 18 Downloading Information From Page 19 Downloading Information From Page 20 Downloading Information From Page 21 Downloading Information From Page 22 Downloading Information From Page 23 Downloading Information From Page 24

    1 条回复    2016-07-02 19:28:09 +08:00
    woniu127
        1
    woniu127  
       2016-07-02 19:28:09 +08:00
    BeautifulSoup([your markup], "lxml")
    关于   ·   帮助文档   ·   博客   ·   API   ·   FAQ   ·   我们的愿景   ·   实用小工具   ·   5289 人在线   最高记录 6543   ·     Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 · 37ms · UTC 06:57 · PVG 14:57 · LAX 23:57 · JFK 02:57
    Developed with CodeLauncher
    ♥ Do have faith in what you're doing.