Using the JavaScript regular expression flavour, how to match a specific tag only when inside another specific tag?

Question

I want to match only the <br> tags that are inside the <main> tag and not all of them:

Is it possible to do it with a JS regex? I'm trying to do a find and replace (with regex) in all files in a project.

Here's the raw text:

<br> <main> <input> <br> <hr> <br> <etc> </main>

i think it will be better to first capture the main tag part and then br inside it.. — rock321987
– rock321987, Commented Jun 13, 2016 at 4:46
also, it will be better to use html parser instead of regex unless you are bound to use it — rock321987
– rock321987, Commented Jun 13, 2016 at 4:53
Why RegEx, though? You can match the tags themselves with document.querySelector("main br") and possibly the DOMParser. — Sebastian Simon
– Sebastian Simon, Commented Jun 13, 2016 at 4:54
It has to be regex because I'm trying to do a find and replace (with regex) in all files in a project. — nunoarruda
– nunoarruda, Commented Jun 13, 2016 at 5:46

anubhava · Accepted Answer · 2016-06-13 13:13:45Z

Using DOM is always better for parsing HTML text. However if for reason you cannot use DOM here is a regex solution to match all <br> tags between <main> and </main>.

/<\s*br\s*\/?>(?=.*?(?:(?!<main>)[\s\S])*?<\/main>)/gi

RegEx Breakup:

<\s*br\s*\/?> # matches <br> or <br /> (?= # start of lookahead .*? # any arbitrary text, lazy (?: # start of non-capturing group (?! # start of negative lookahead <main> # literal text <main> ) # end of negative lookahead [\s\S]*? # match 0 or more of any char including newline, lazy ) # end of non-capturing group <\/main> # match </main> ) # end of lookahead /gi # make it global ignore case match

RegEx Demo

Naga Sai A · Accepted Answer · 2016-06-13 05:32:31Z

Created codepen URL for capturing only the BR elements with MAIN using

 document.body.childNodes

codepen-http://codepen.io/nagasai/pen/MeybzK

First got all the childNodes of Body and then MAIN and filtered BR tags from that

Hope this is helpful for you

 function allTags() { var c = document.body.childNodes; //console.log(c); var txt = ""; var i; for (i = 0; i < c.length; i++) { if (c[i].nodeName == "MAIN") { // alert(c[i].childNodes.length ) for (j = 0; j < c[i].childNodes.length; j++) { //alert(c[i].childNodes[j].nodeName); if (c[i].childNodes[j].nodeName == "BR") { txt = txt + c[i].childNodes[j]; } } } } console.log(txt); document.getElementById("demo").innerHTML = txt; }

HTML:

 <p>11</p><br> <main> <br> <div>q1111</div><br> </main> <button onclick="allTags()">Tags</button> <div id="demo"></div>

sure Roko... I just left the gist of code which i have used to achieve ..I will update it now :)

Touffy · Accepted Answer · 2016-06-13 13:40:36Z

As others have commented and I always say, don't use regexes to parse HTML. And yes, you can use the DOM to replace elements. It won't be quite as compact (and unreadable) as a regex but I think this is still quite short :

for(let br of document.querySelector('main br')) br.parentNode.replaceChild(br, document.createElement('span'))

(assuming you wanted to replace <br>s with another Element. Deleting or replacing with text would be just as easy.)

Collectives™ on Stack Overflow

Using the JavaScript regular expression flavour, how to match a specific tag only when inside another specific tag?

3 Answers 3

Comments

2 Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

2 Comments

Comments

Linked

Related