Questions tagged [html]
HyperText Markup Language (HTML) is the main markup language for displaying web pages and other information that can be displayed in a web browser.
332 questions
2 votes
0 answers
29 views
How to get <table border="1"> in pandoc?
Sure, to put borders on HTML tables in pandoc, I could just do $ pandoc file.docx | perl -pwle 's/<table/$& border="1"/' | w3m -dump -T text/html But how might I instead ...
0 votes
1 answer
65 views
Convert html reference to man page (helpPC reference)
I found HelpPC by David Jurgens very helpful when I need to check description of assembly instruction, it is brief and clear comparing to the heavy intel's manuals or Felix cloutier html version. The ...
0 votes
1 answer
153 views
Why does Firefox 'copy as cURL' not download any file not download anything here?
I tried to do this: curl "https://imslp.org/wiki/Goldberg-Variationen%2C_BWV_988_(Bach%2C_Johann_Sebastian)" | perl -nle 'print "$1" while /<span id="num-of-ratings-[0-9]{6}...
3 votes
2 answers
223 views
Embedded special characters skewing sed output
The Issue I've been parsing a file with sed trying to tweeze out the desired data. This has worked fine for most lines in the file but there appears to be some embedded special characters that are ...
-2 votes
4 answers
267 views
How to strip data from html using awk?
I'd like to retrieve data from here https://www.sbs.com.au/ondemand/tv-series/la-unidad/season-1. I wget the page to file. The data I seek is in the form of (samples): https://www.sbs.com.au/ondemand/...
1 vote
2 answers
447 views
how can I select element using xmllint command?
I am trying to select "Bvlgari omnia crystalline'perfume' 100ml" by making use of xmllint from the codes below. But As I'm newbie in the field of linux,It is insanely difficult to figure out ...
0 votes
1 answer
154 views
Wget download wrong content
I'm trying to download a specific sitemap.xml (https://www.irna.ir/sitemap/all/sitemap.xml). The problem is that when you load the specific sitemap.xml for a few seconds one white page with a header ...
3 votes
2 answers
587 views
CSS not updating on a `http.server` website
I have a website using the Python http.server module. I wanted 2 users to work on the same files (HTML, CSS, JS) so I set the chmod tag to 777. The problem is that the CSS content now only updates ...
3 votes
4 answers
819 views
Convert pipe delimited column data to HTML table format for email
I am trying to convert delimited data format to html column table output for email printing and I am unsure how to use pipe delimiter as a separater for HTML tabular formatting. Below is what I could ...
0 votes
2 answers
135 views
BSD sed/awk moving portion of line to line above (switching attribute in HTML file)
My situation is simple : I have an HTML file with several lines containing only the indented <section> block tag, each line followed by an (also indented) <h3 id="YYYY">...</...
0 votes
1 answer
66 views
Use wget to retrieve Supplemental Data from Science dot org
I'm building a pipeline in Snakemake to analyse some data. One of the data files I'm using is provided as supplemental data as part of this publication. The paper is behind a paywall, but I've ...
1 vote
3 answers
191 views
sed: To match a newline and spaces
I have a following file: <head> <title>this is a title</title> <style> here goes a style sheet </style> </head> I need to strip the <title> element ...
0 votes
1 answer
1k views
curl webpage and convert to markdown
having a dilemma with downloading webpages and converting them to markdown, for example: F=$(curl -O --silent https://www.guru3d.com/story/msi-teases-spatium-m560-ssd-with-innovative-nonmetallic-vc-...
1 vote
1 answer
114 views
How can I include any content in the sed replace command? [duplicate]
I want to be able to handle any type of content stored into the bash variable ${CONTENT}, to be used as sed replacement text into another content, no matter if there are quotation marks, single quotes ...
0 votes
1 answer
1k views
Is there a tool that preserves CSS formatting during HTML to PDF conversion?
I tried the options in Is there a script or tool that converts HTML to PDF? with command: pandoc documentation.html -o test.pdf --pdf-engine=xelatex but unfortunately they do not preserve the CSS ...