In [1]: from bs4 import BeautifulSoupIn [2]: text = ”’ …: <div> …: <ul> …: <li class=“item-0” id=“first”><a href=”https://www.cnblogs.com/Elite-Wang/p/link1.html”>first item</a></li> …: <li class=“item-1”><a href=”https://www.cnblogs.com/Elite-Wang/p/link2.html”>second item</a></li> …: <li class=“item-inactive”><a href=”https://www.cnblogs.com/Elite-Wang/p/link3.html”><span class=“bold”>third item</span></a></li> …: <li class=“item-1”><a href=”https://www.cnblogs.com/Elite-Wang/p/link4.html”>fourth item</a></li> …: <li class=“item-0”><a href=”https://www.cnblogs.com/Elite-Wang/p/link5.html”>fifth item</a></li> …: </ul> …: </div> …: ”’
讯享网In [3]: bs = BeautifulSoup(text)#创建BeautifulSoup对象,可以直接传入字符串
In [4]: bs1 = BeautifulSoup(open(‘https://www.cnblogs.com/Elite-Wang/p/test.html’))#也可以传入文件对象
In [5]: bs Out[5]: <html><body><div> <ul> <li class=“item-0” id=“first”><a href=https://www.cnblogs.com/Elite-Wang/p/“link1.html”>first item</a></li> <li class=“item-1”><a href=https://www.cnblogs.com/Elite-Wang/p/“link2.html”>second item</a></li> <li class=“item-inactive”><a href=https://www.cnblogs.com/Elite-Wang/p/“link3.html”><span class=“bold”>third item</span></a></li> <li class=“item-1”><a href=https://www.cnblogs.com/Elite-Wang/p/“link4.html”>fourth item</a></li> <li class=“item-0”><a href=https://www.cnblogs.com/Elite-Wang/p/“link5.html”>fifth item</a></li> </ul> </div> </body></html>
讯享网

版权声明:本文内容由互联网用户自发贡献,该文观点仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容,请联系我们,一经查实,本站将立刻删除。
如需转载请保留出处:https://51itzy.com/kjqy/201418.html