Dump html content with lynx

Lynx is an incredibly useful text-only browser, that you can use to read webpages (interactively) from the command line. I used it sometimes to access websites restricted to some areas where I could access a server, for example.

It can be used to output the webpage (non interactively) to the standard output, both the HTML source:

lynx -source "URL" > page.html

Or the plain text:

lynx -dump "URL" > page.txt

Leave a comment