Monday, October 29, 2012

From graphs to applications


In 2012, the quantity of material collected by the HMT project has grown rapidly.  To cope with this, we have been developing an automated system to identify the relations among all citable objects in the HMT data archive (texts, images, artifacts like manuscript pages, to name a few).   In mathematical terms, these relations form a graph.

In the HMT graph, all nodes are identified by URN values (CTS URNs for texts, or CITE URNs for other kinds of objects).  This simple, consistent reference format made it easy for us to develop a network service  for working with the HMT graph:  supply the service with a URN value, and the service finds all links to that URN.

This is an important development for the long-term development of our digital multitext, and will certainly be the subject of future blog posts.  For today, I simply want to announce a test site with end-user applications built on the graph service.

Like our other services for retrieving HMT data, the graph service replies with a simple XML format;  as in our other service implementations, we can include XSLT stylesheets to format these graph descriptions as web pages for human readers.
We have written three sets of stylesheets that turn the graph data into three quite distinct applications:


  • a facsimile browser, for reading diplomatic edition of texts alongside documentary images
  • a multitext reader, for reading multiple versions of a single text
  • a graph navigator, for exploring links in the HMT project graph


You'll find test versions of all three of these apps at our new HMT Apps test page:  http://beta.hpcc.uh.edu/tomcat/hmtapps/

If you're curious about how the graph service works, try viewing the XML source of one of the application's web pages.  If you just want to try out an app, feel free.  Expect that the test versions on this site will evolve rapidly over the next several months.  We'll post announcements on this blog when we install more static release versions elsewhere.


Sunday, October 7, 2012

Announcing the Open Paleography project

The Homer Multitext blog is an appropriate forum to announce a new project growing directly out of experience with the Homer Multitext project, and developing technology that will contribute directly to future work on the HMT project.

For almost three years, the HMT project has been collecting in structured notebooks paleographic observations about the manuscripts we are editing.  With the announcement of the Open Paleography project, we aim to expand this work to a general crowd-sourced collection of paleographic observations.

The Open Paleography project differs from other projects with similar aims in its application of the CITE architecture.  Paleographic observations identify a physical artifact, a textual passage, and a region of interest on a documentary image using technology-independent, machine-actionable URNs.  In turn, each observation itself is identified with a CITE URN.  The openly licensed data set is exposed to the software and end-users in the following ways:


  • because all the data sets are CITE Collections, they are available through the CITE Collections Service API
  • because data are stored in Google Fusion Tables,  they are available both through Google's programmatic API and through the user interfaces to Google Fusion Tables


The Open Paleography project is currently testing and helping develop two generic applications that work with any CITE Collection. The first is a collaborative CITE Collection editor allowing authorized contributors to add to a CITE Collection from a Web browser.  The second is a general querying and viewing application for end users.  Both of these applications will find immediate application in the HMT project.

Home page of the Open Paleography project:  http://shot.holycross.edu/op