looking for a program that will scan the webserver log files for pages
served up by search engines—obstensibly for Disturbing Search Requests.
She hasn't found any, so today I quickly wrote one up for her.
The odd thing I noticed though, as I watched her use the program on her site and my site and my blog is that my blog has way more search requests than
In fact, going over the three largest sites on this server (www.springdew.com, www.conman.org and boston.conman.org) that my blog/online journal
here averages about twice the search requests as the other sites.
I think that has something to do with the way this site works. Google (just
to pick a search engine) will have indexed the same entry about five
times—once on main page, once for day, once for the month, once for the year (don't want to bog down the
server needlessly for that example) and once for itself.
I'm not sure how much that affects the final ranking of a particular page
since they're all intrasite links but it does have to skew the results
somehow. Somehow it feels like I'm Google Bombing
my own site with my own site.
Obligatory Contact Info
You have my permission to link freely to any entry here. Go
ahead, I won't bite. I promise.
The dates are the permanent links to that day's entries (or
entry, if there is only one entry). The titles are the permanent
links to that entry only. The format for the links are
simple: Start with the base link for this site: http://boston.conman.org/, then add the date you are
interested in, say 2000/08/01,
so that would make the final URL:
You can also specify the entire month by leaving off the day
portion. You can even select an arbitrary portion of time.
You may also note subtle shading of the links and that's
intentional: the “closer” the link is (relative to the
page) the “brighter” it appears. It's an experiment in
using color shading to denote the distance a link is from here. If
you don't notice it, don't worry; it's not all that
It is assumed that every brand name, slogan, corporate name,
symbol, design element, et cetera mentioned in these pages is a
protected and/or trademarked entity, the sole property of its
owner(s), and acknowledgement of this status is implied.