[TAG] Time to drop the Site Map?
mso@oz.net
mso
Fri Oct 7 00:53:40 MSD 2005
mso at oz.net wrote:
> Note that prior to issue 93 or so, we have no metadata on the articles so
> we can't regenerate the links. The sitemap program has a string constant
> containing all the HTML for the legacy links.
Correction: we do have metadata (title, author ID, sort order). What we
don't have is the article body in a "source" format. The HTML file *is*
the source, with all its <head> tags, extra information around the title
or author lines, copyright blurb, evolving or custom formatting, etc.
There are also discrepencies between the metadata (extracted from the
TOCs) and the sitemap: missing links, different title wording, author's
name spelled differently, articles in different order. Perhaps they came
from the articles being updated and the sitemap not, but who knows for
sure? That's why we don't regenerate the old sitemap links but instead
hold onto the legacy HTML. If nobody has complained about a sitemap link
missing or misworded in all these years, I guess they're good enough.
--
-- Mike Orr <mso at oz.net>
More information about the TAG
mailing list