~callum/readable-rss

improvements
Use date from <time> tag if present
Fix for GNU diff and change chars used in paths

refs

main
browse  log 

clone

read-only
https://git.sr.ht/~callum/readable-rss
read/write
git@git.sr.ht:~callum/readable-rss

You can also use your local clone with git send-email.

#readable-rss

Purpose: Download articles from those annoying RSS/Atom feeds which don't include the full article.

Included: A shell script and javascript program which extracts the article URLs, downloads them and converts them using Mozilla's readability.js.

#Requirements

  • Standard UNIX utilities
  • Curl
  • nodejs
  • jsdom (npm install jsdom)
  • Mozilla's readability script (npm install @mozilla/readability)

#Usage

Put HTTP/S RSS feed URLs in a file called feed-urls, one URL per line.

$ chmod +x get-articles.sh
$ ./get-articles.sh

Articles will be in the articles/ folder. The dates are the date the article was downloaded, not necessarily when it was published.