Dump html content with lynx

Lynx is an incredibly useful text-only browser, that you can use to read webpages (interactively) from the command line. I used it sometimes to access websites restricted to some areas where I could access a server, for example.

It can be used to output the webpage (non interactively) to the standard output, both the HTML source:

lynx -source "URL" > page.html

Or the plain text:

lynx -dump "URL" > page.txt
Advertisement

Leave a Reply

Please log in using one of these methods to post your comment:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s