1. agg, the news aggregator
    1. about
    2. news
    3. changelog
      1. 2011-06-26 agg-0.3.0
      2. 2011-05-11 agg-0.2.1
      3. 2011-05-10 agg-0.2.0
      4. 2011-04-16 agg-0.1.1
      5. 2011-04-08 agg-0.1.0
    4. dependencies
    5. install
    6. faq
      1. Writing file names that are are specified in the feed? What about security?
      2. But a malicious feed could use up all space/inodes.
      3. Why no download mechanism?
      4. But do I have to download the feed by hand?
      5. But it only works on a single feed!
      6. Why no user interface?
      7. How to fetch only new items from feeds that don't use publication dates?
    7. bugs to be fixed
    8. authors
    9. repo
    10. homepage
    11. download
    12. license

agg, the news aggregator

about

agg is a news aggregator (currently RSS 2.0 only) for POSIX-compliant systems (currently tested on GNU/Linux only).

It follows the UNIX philosophy and simply reads a news feed from stdin and creates or updates a filesystem representation of that feed.

No command line parameters, no user interface, not even networking.

news

changelog

2011-06-26 agg-0.3.0

2011-05-11 agg-0.2.1

2011-05-10 agg-0.2.0

2011-04-16 agg-0.1.1

2011-04-08 agg-0.1.0

Initial release.

dependencies

install

make test install

For configuration see Make.config.

Please, run the test suites, they've been written for you and take few seconds on a 500 MHz CPU anyways.

faq

Writing file names that are are specified in the feed? What about security?

agg removes all slashes from file and directory names before they are written, so everything ends up where it belongs. You should run it in a dedicated directory, though.

But a malicious feed could use up all space/inodes.

Depends on your operating system (configuration). It's not the job of a news aggregator to enforce quotas.

Why no download mechanism?

Because it's a news aggregator, not a download-and-news-aggregation-program.

But do I have to download the feed by hand?

wget $URL -O - | agg

But it only works on a single feed!

for feed in `cat feeds`; do
    (wget $feed -qO - | agg) &
done

You get the point.

Why no user interface?

Because it's a news aggregator, not a download-and-news-aggregation-and-news-reader-program. The file system hierarchy is pretty much usable using various unixoid tools.

Sky is the limit. Feel free to write your own frontend; you should be able to find mine on my blog.

How to fetch only new items from feeds that don't use publication dates?

Not supported by agg itsself, since it would require a second level storage that contains (hashes of) everything the agg directory contained -- including items you explicitly deleted. You can easily build such functionality on top using a few lines of shell code.

Again, its a news aggregator not a caching program.

bugs to be fixed

authors

repo

homepage

download

license

Copyright (C) 2011 Andreas Waidler arandes@programmers.at

Permission to use, copy, modify, and/or distribute this software for any purpose with or without fee is hereby granted, provided that the above copyright notice and this permission notice appear in all copies.

THE SOFTWARE IS PROVIDED "AS IS" AND THE AUTHOR DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY SPECIAL, DIRECT, INDIRECT, OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.