i was playing with the google sitemap (means i was activating the google sitemap support in django :)
and then i had an idea… this sitemap basically describes all the urls of a site. with this, i could simply check all my urls on the w3 validator. automatically. every night, for example.
but this requires to be able to get some kind of machine-readable output from the w3 validator.
as it turns out, there are 2 ways:
i use a lot of rss/atom feeds. by use i mean that i am reading many news-sites, blogs, etc. using their feeds.
for some reason those feeds always contain the last n entries, where n is a fixed number.
for all Django users…
please, please, pretty please use raw_id_admin (django model docs).
because, when you have a model that contains a ForeignKey (and people usually have ForeignKeys), then when you use an automatic manipulator for the given model, it will load in ALL THE DATA FROM ALL THE RELATED MODELS. for example, if you have 20.000 entries in the related table, then it will load in all those 20.000 entries.
except, if you use raw_id_admin. it’s an attribute of the ForeignKey, and contrary to the documentation, it’s effect is not restricted to the admin-framework.
Joseph Heck made some speed-tests regarding this issue