lxml xpath not working

Question

I am trying to parse a webpage code for which is below. I am able to get the users using the xpath but i am unable to get their scores using xpath any ideas what i am doing wrong here ?

import requests from lxml import html internsHack = 'https://doselect.com/hackathon/inmobi-internshack/leaderboard' page = requests.get(internsHack) tree = html.fromstring(page.content) users = tree.xpath('//div[@class="md-list-item-text"]/h2/a/text()') score = tree.xpath('//div[@class="points-score"]/ng-pluralize/text()')

paul trmbrth · Accepted Answer · 2015-12-02 11:39:36Z

2

HTML source snippet:

<div class="points-score"> <ng-pluralize count="200" when="{'0': '{} point', 'one': '{} point', 'other': '{} points'}"> </div>

Get the count attribute values instead of text():

//div[@class="points-score"]/ng-pluralize/@count

score variable would then have the following value:

['200', '198', '198', '197', '197', '197', '196', '195', '194', '194']

edited Dec 2, 2015 at 11:39

paul trmbrth

20.8k4 gold badges56 silver badges67 bronze badges

answered Dec 2, 2015 at 2:01

alecxe

476k127 gold badges1.1k silver badges1.2k bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Nick Loach Over a year ago

Your answer is correct but curious to know how did you figure that count is a attribute as when i look for this in developer tools of chrome 200 points is a text item

alecxe Over a year ago

@NickLoach what you see in the "Source" in browser developer tools is a rendered page by the browser which can seriously differ from the initial page. What you get with requests is the initial unrendered page - this is what you should inspect. Hope that helps.

Nick Loach Over a year ago

Thanks for the explanation i examined request content now its illegible but you are correct about the count attribute

Collectives™ on Stack Overflow

lxml xpath not working

1 Answer 1

3 Comments

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

3 Comments

Related