Why doesn't PHP XPath find table elements even though Firefox shows they exist?

Question

I am trying to pull an exact table during a "web scrape." Used cURL to pull page into $html, which succeeds fine.

Used Firebug to get exact XPATH to the table needed.

Code follows:

$dom = new DOMDocument($html); $dom->loadHTML($html); $xpath = new DOMXpath($dom); $summary = $xpath->evaluate('/html/body/table[5]/tbody/tr/td[3]/table/tbody/tr[8]/td/table'); echo "Summary Length: " . $summary->length;

When executed, $summary->length is always zero. It doesn't pull that table node.

Any ideas?

possible duplicate of Why does my XPath query (scraping HTML tables) only work in Firebug, but not the application I'm developing? — Jens Erat
– Jens Erat, Commented Aug 14, 2013 at 22:25

Rob Kennedy · Accepted Answer · 2009-05-07 20:26:15Z

4

Firefox is liable to insert "virtual" tbody elements into tables that don't have them; do those elements exist in the original file?

answered May 7, 2009 at 20:26

Rob Kennedy

164k23 gold badges288 silver badges481 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

Faasie Over a year ago

No, they don't. But I do see them in firefox. I have used XPath Checker as well and can see the data I need. But using it in my PHP xpath->evaluate never returns data.

Greg Over a year ago

<tr> is not allowed inside <table> directly - there has to be a <tbody> / <thead> / <tfoot>. It's implied if not specified directly. HTML is weird like that... the start and end tags can both be optional!

Frank Farmer Over a year ago

If the the tbody elements don't exist in the original file, then they shouldn't be in your PHP xpath query.

Faasie Over a year ago

I apologize. The TBODY tags are there. I overlooked them when first looking at the source.

Aloe · Accepted Answer · 2013-04-21 07:58:28Z

Just remove "/tbody". From xpath you got from firefox:

.//*[@id='data']/tbody/tr[1]/td[2]/span

create this:

.//*[@id='data']/tr[1]/td[2]/span

Aloe

Collectives™ on Stack Overflow

Why doesn't PHP XPath find table elements even though Firefox shows they exist?

2 Answers 2

4 Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

4 Comments

Comments

Linked

Related