The editorial work of the Homer Multitext project is ongoing, and, as good photography of more manuscripts and papyri becomes available, is open-ended. While we have provided openly licensed access to our source images and editorial work in progress since our first digital photography in 2007, we have not previously offered packaged publications of our archive.
That is changing in 2014. The project’s editors have decided on a publishing cycle of roughly three issues a year (since our work tends to be concentrated around an academic calendar of fall term, spring term, and summer work). Published issues of the project archive must satisfy four requirements.
- The issue must be clearly identified. Our releases are labelled with a year and issue number: our first issue is 2014.1.
- All content published in a given issue must pass a clearly identified review process. Teams of contributing editors work in individual workspaces. (We use github repositories to track the work history of these teams.) When a block of work passes a series of manual review and automated tests, it migrates from “draft” to “provisionally accepted” status and is added to the project’s central archival repository. This is the repository that we are publishing for the first time this week.
- All published material must be in appropriate open digital formats. Apart from our binary image data, all the data we create are structured in simple tabular text files or XML files with published schemas.
- All published material must be appropriately licensed for scholarly use. All of our work is published under a Creative Commons Attribution-ShareAlike license. (Licenses for some of our image collections additionally include a “non-commercial” clause: in those cases, a license for commercial reuse must be separately negotiated with the copyright holder.)
Access to the Published Digital Archive
The published packages are available for download from http://beta.hpcc.uh.edu/hmt/archival-publications as zip files. An accompanying README explains the contents of each zip file.
We are also distributing our published issues as nexus artifacts (previously mentioned briefly here), a system that allows software to identify and retrieve published versions automatically. Whether manually or automatically downloaded, it now becomes possible for scholars (and their software) to work with citable data sets from the constantly changing archive of the HMT project.
Tracking Work in Progress
We will continue to make our work in progress available. For easy access to the current state of “provisionally accepted” material in our archive, we also generate a nightly set of packages. These are available for manual download here, but are not distributed through our nexus server.
They should be considered unpublished: other publications should cite only published issues of the archive.
They should be considered unpublished: other publications should cite only published issues of the archive.
Like our individual editorial teams, we manage our publication repository through github: http://homermultitext.github.io/. Our data archive includes a publicly available issue tracker where you can submit questions or bug reports, and follow our progress.
More technical information
If you’re interested in technical information about how we develop the published archive and use it to build applications, Christopher Blackwell and I have recently published a discussion here.
Links
- Downloadable zip files of published data: http://beta.hpcc.uh.edu/hmt/archival-publications
- Nightly versions of “provisionally accepted” work: http://beta.hpcc.uh.edu/hmt/nightlies
- Homer Multitext project on github: http://homermultitext.github.io/
- Issue tracker for the publication archive: https://github.com/homermultitext/hmt-archive/issues
- maven coordinates for automated retrieval of packages in group org.homermultitext, with version number 2014.1:
- hmt-collections (CITE Collections)
- hmt-images (documentary information from the project’s Image Collections; binary image data for the cataloged images may be downloaded from http://amphoreus.hpcc.uh.edu/hmt/hmt-image-archive/)
- hmt-indices (CITE Indices)
- hmt-editions (editions of texts), in two formats identified with maven type
- xml (TEI-compliant XML)
- tabulated (tabular representation)
- hmt-rdf (Complete represenation of the published issue as an RDF graph in TTL format)