[xquery-talk] MarkMail now archiving xquery-talk
Jason Hunter
jhunter at servlets.com
Thu Nov 29 18:26:28 PST 2007
For the last few months I've been working on a new project at Mark
Logic: a web site for interacting with email archives. It makes heavy
use of XML and XQuery. Each email is stored internally as an XML
document, and all searches, faceted navigation, analytic calculations,
and HTML page renderings are performed in XQuery on a single MarkLogic
Server machine.
We launched the site about three weeks ago. At launch we had loaded all
the public emails from the Apache Software Foundation. That's about 500
lists and 4,000,000 messages. We've now started adding other lists, and
we loaded xquery-talk this week:
http://x-query.markmail.org
We also loaded a few others:
http://xml-dev.markmail.org
http://xsl-list.markmail.org
http://css-d.markmail.org
You can search across all 4.5M emails from the home page:
http://markmail.org
As you'll see with the chart, one of our goals with the site has been to
focus heavily on analytics. We have lots of graphs and counts. Every
query you write gets its own histogram chart.
http://x-query.markmail.org/search/?q=subject%3Aupdate
Another goal has been interactivity. Every search result screen gives
you lots of ways to refine your search (by sender, list, attachment
type, etc). Plus we did a lot with keyboard shortcuts. You can hit "n"
and "p" to move to the next and previous result and "j" and "k" to move
up and down the thread view. There's a lot of little things like this.
Plus if your result message includes Office or PDF files they're in-line
interactive too.
http://markmail.org/search/ext:ppt+xml
The subdomain you use implicitly limits the messages you search. Thus
http://x-query.markmail.org searches only lists with "x-query" in their
name (a single list).
You can search all Apache lists at http://apache.markmail.org, all
Apache Axis lists at http://axis.markmail.org, or across all lists at
http://markmail.org. You can always limit your search view using
"list:axis" in your query, but using the domain handles that a bit more
elegantly.
I hope you all find this useful!
Notes on using the site:
* Search using keywords as well as from:, subject:, extension:, and
list: constraints
* The GUI doesn't yet expose it, but you can negate any search item,
like -subject:soap
* You can sort results by date by adding order:date-forward or
order:date-backward to your query
* Remember to use "n" and "p" keyboard shortcuts to navigate the search
results
* You're going to want JavaScript enabled
-jh-
More information about the talk
mailing list