How to extract data from webpage?

Question

I want to get text and data from a webpage. when a page load completes inside web-browser control, I just want to extract text from the page by element id? please help me how can i achieve this like html-agility & c#. Sorry for my poor english.

Are alternative (more modern) libraries like CsQueries allowed? Also, if you just want the whole text of everything you don't need any library. — Benjamin Gruenbaum
– Benjamin Gruenbaum, Commented Jan 1, 2014 at 10:39
I just need few text by html id. example, <div id="getid">ID00123</div>. so i want to know how can i get "ID00123" from my program. I prefer to use c# windows app. — mbdAli
– mbdAli, Commented Jan 1, 2014 at 10:43

Darin Dimitrov · Accepted Answer · 2014-01-01 10:42:25Z

2

You could use the GetElementbyId method on the HtmlDocument which allows you to retrieve some specific DOM element by its identifier:

string html = ... Read the HTML here var htmlDoc = new HtmlAgilityPack.HtmlDocument(); htmlDoc.OptionFixNestedTags = true; htmlDoc.LoadHtml(html); var element = htmlDoc.GetElementbyId("someId"); if (element != null) { string data = element.InnerText; }

answered Jan 1, 2014 at 10:42

Darin Dimitrov

1.0m275 gold badges3.3k silver badges3k bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

mbdAli Over a year ago

Thanks. for one element ok, but I need to get around 10 elements from one page url.?

Darin Dimitrov Over a year ago

How about using a loop? If there's some pattern for the element ids you could simply loop through them.

mbdAli Over a year ago

I can see element-id by viewing page source, but there are no patterns, element ids are looks completely different. can you please provide me an example.

Darin Dimitrov Over a year ago

In this case you cannot use the element id to retrieve the values. You should use some other information that doesn't change. For example if there are some class values or even the DOM structure itself. It's impossible to say without having more details about the DOM structure you are dealing with.

mbdAli Over a year ago

Example, in c# windows app, when we enter company number, it should retrieve company information from online website and displays those information inside my c# application. Ex: page link is here.. link

|

Collectives™ on Stack Overflow

How to extract data from webpage?

1 Answer 1

6 Comments

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

6 Comments

Related