|
1 year ago | |
---|---|---|
doxa_rss | 1 year ago | |
.gitignore | 1 year ago | |
LICENSE.txt | 1 year ago | |
README.md | 1 year ago | |
setup.py | 1 year ago |
doxajournal.ru, an internet magazine in Russian that mostly writes about modern universitites and things related to them, uses a fairly outdated CMS that generates HTML that seems to be close to being completely obfuscated.
They do also have no RSS/Atom feed, so I decided to write a scraper that would generate one for me.
This is a simple Python/Flask app, and is run just like any other Flask app.
Just make sure to set the DOXA_KEY
environment variable to something long,
and then point your feed reader at localhost/<password you just set>
. Unless
you set the variable manually, it will be set to doxa
for you.
This app depends on flask
, beautifulsoup4
, requests
, feedgen
,
python-dateutil
, and pytz
.
See LICENSE.txt.