Life is very easy with Python: Beautiful Soup CSS selector

Saturday, November 3, 2012

Beautiful Soup CSS selector

Beautiful Soup supports a subset of the CSS selector standard. Just construct the selector as a string and pass it into the .select() method of a Tag or the BeautifulSoup object itself.
I used this html file for practice. All source code available on github

 
from pprint import pprint
from bs4 import BeautifulSoup

html_content = open('bs_sample3.html') 
# http://dl.dropbox.com/u/49962071/blog/python/resource/bs_sample3.html
soup = BeautifulSoup(html_content) # making soap

pprint(soup.select("title")) # get title tag
pprint(soup.select("body a")) # all a tag inside body
pprint(soup.select("html head title")) # html->head->title
pprint(soup.select("head > title")) # head->title
pprint(soup.select("p > a")) # all a tag that inside p
pprint(soup.select("body > a")) # all a tag inside body
pprint(soup.select(".sister")) # select by class
pprint(soup.select("#link1")) # select by id
pprint(soup.select('a[href="http://example.com/elsie"]')) 
# find tags by attribute value
pprint(soup.select('a[href^="http://example.com/"]'))
# find tags by attribute value, all contains 'http://example.com/'
pprint(soup.select('p[lang|=en]')) # Match language codes

4 comments:

www.muslu.org said...: thank you very much. i will use it; April 25, 2014 at 4:49 PM
Abu Zahed Jony said...: you are most welcome; April 25, 2014 at 5:00 PM
Unknown said...: Very nice examples. Thanks.; May 29, 2015 at 8:04 AM
Unknown said...: nice one, thanks.; February 23, 2016 at 1:18 PM

Python Tutorial

Life is very easy with Python

Saturday, November 3, 2012

Beautiful Soup CSS selector

4 comments:

Post a Comment

Search This Blog

Followers

About Me

Subjects

Archive