how to extract an attribute value of div using BeautifulSoup

Question

I have a div whose id is "img-cont"

<div class="img-cont-box" id="img-cont" style='background-image: url("http://example.com/example.jpg");'>

I want to extract the url in background-image using beautiful soup.How can I do it?

oshribr · Accepted Answer · 2017-04-03 08:35:11Z

5

You can you find_all or find for the first match.

import re soup = BeautifulSoup(html_str) result = soup.find('div',attrs={'id':'img-cont','style':True}) if result is not None: url = re.findall('\("(http.*)"\)',result['style']) # return a list.

edited Apr 3, 2017 at 8:35

answered Apr 2, 2017 at 22:04

oshribr

6667 silver badges17 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

yondu_udanta Over a year ago

I have done that part.How to extract url from the result variable?

yondu_udanta Over a year ago

Thanks, It worked!!Could you please explain me this part "url = re.findall('("(http.*)")',result['style'])".

oshribr Over a year ago

the result['style'] return the string 'background-image: url("http://example.com/example.jpg");' and the re.findall() is a regex search, to read more about regex check this link docs.python.org/2/library/re.html

hallazzang · Accepted Answer · 2017-04-03 07:57:29Z

Try this:

import re from bs4 import BeautifulSoup html = '''\ <div class="img-cont-box" \ id="img-cont" \ style='background-image: url("http://example.com/example.jpg");'>\ ''' soup = BeautifulSoup(html, 'html.parser') div = soup.find('div', id='img-cont') print(re.search(r'url\("(.+)"\)', div['style']).group(1))

Collectives™ on Stack Overflow

how to extract an attribute value of div using BeautifulSoup

2 Answers 2

3 Comments

Comments

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

3 Comments

Comments

Related