Selected Papers from the 2013 TEI Conference Bridging the Gap : Greater Usability for TEI encoding

This article shows how, with a focus on user-friendliness and a bottom-up approach, the digital work environment ediarum was successfully developed. With ediarum, researchers can comfortably encode and edit in TEI, as well as publish their results in an online or print edition. This solution, developed by the TELOTA initiative of Berlin-Brandenburg Academy of Sciences and Humanities, is based on three software components: eXistdb, Oxygen XML Author, and ConTeXt. These components are combined, supplemented with additional functions, and tailored to t a project’s needs. After a pilot run, ediarum has been implemented for multiple internal and external research projects. The experience that was gained and our self developed program components are available to the Digital Humanities community.

and money.Moreover, the know-how is often not available to correctly encode transcriptions into TEI.An important additional factor is that despite the many advantages of a Web publication, the printed edition-especially for veteran edition projects-remains the bar of what serious results should look like.Thus a new technology must justify itself not only economically, but academically as well (see Sahle 2013, 109-10).
Another cause of this gap is the dearth of programs to help researchers who lack the technical knowledge to transcribe and annotate their texts in TEI.While other prevalent text-editing programs oer an intuitive user interface, this is not the case for most XML editors. 1 A factor contributing to the lack of such a user interface could be the high complexity of a TEI-encoded edition.

The Bridge: Greater Usability
The solution that is repeatedly suggested-that researchers should learn TEI-XML or even learn how to program-does not seem viable, and in any case simply shifts the problem to the side of the user. 2 The lack of knowledge of how to encode a manuscript in TEI should not hinder the use of TEI.On the contrary, an easy method to adopt TEI for using should be made available.The center of editorial work should remain the evaluation and examination of manuscripts. 3 one considers the developments in graphical user interfaces over the last few years, one observes that programs are oering simpler and more intuitive interfaces-even with sometimes more complex program tasks and functions behind them.It is not without reason that there are now entire elds that concern themselves with the "ergonomics" of software.
In our opinion, the Digital Humanities community should not avoid such developments but instead should take advantage of them.A simple and intuitive user interface would help reduce the fear and frustration that often meets new technologies (Nielsen 1993, 24-25). 4User-friendly programs create a positive user experience and would increase interest in using TEI for the transcription and annotation of manuscripts.
A fundamental part of user-friendly software is providing the user with visual feedback for an executed action (Nielsen 1993, 134).For example, after the appropriate markup for a crossedout piece of text is entered, this text can then be displayed as crossed out.Although this type of feedback might seem trivial, another type certainly is not: print output.As mentioned above, the printed edition is still the main goal for most scholarly edition projects, even when it is supplemented with a Web publication (Sahle 2013, 111-112).When the formatting process begins after the encoding is completed, the researcher can only check their work in an abstract manner.
When the publication and creation of the edition is not separated, but happens simultaneously in one work environment, the researcher has the benet of being able to see his or her markup with one click displayed properly in the desired medium.This not only improves the supervision of the work in progress, but also increases trust in and satisfaction with the technology-in this case the TEI encoding of manuscripts.
2. The Pilot Project 8 TELOTA, 5 the digital humanities initiative at the Berlin-Brandenburg Academy of Sciences and Humanities (BBAW), 6 had to confront these issues as they were given the task to develop a digital work environment for the edition project Schleiermacher in Berlin 1808-1834. 7This environment was intended to assist the researchers in transcribing, encoding, and annotating the letters, lectures, and daily calendar of the Protestant theologian Friedrich Schleiermacher.In addition to creating a user interface for transcribing and editing, the environment had to be able to produce both a Web and a print edition from the TEI-XML data.Ideally this would happen automatically, so that the editors would be able to see their results immediately.

9
To develop such a solution from scratch requires signicant resources.To reduce time and eort of development it is preferable to build on existing software.After evaluating the available software, the team chose the following programs: • eXistdb 8 for the database in which the TEI les would be stored.A decisive factor was the ability to retrieve and run XQuery and XSLT scripts from within the eXistdb and not just externally.
• Oxygen XML Author 9 as the user interface for entering and editing the centrally stored TEI les.Decisive were the extensive functions of Oxygen XML Author that allowed visual editing of XML for persons without technical knowledge.ConTeXt 10 for the formatting language to generate an automatic print-ready proof from the XML les.Among its benets were its use of the long-established formatting language TeX, the possibility of processing XML directly, and the option to add one's own functions with the help of the programming language Lua.
Although eXistdb and ConTexT are open source software, this is not the case for Oxygen XML Author.A completely open-source solution would have been preferable, but there is no software currently available that oers such extensive functions for end users.
Because of the solution's modular organization, the software components can be replaced if necessary (for example when a tool is no longer supported by the community or a company).
Essentially every piece of software can and will become outdated at some point; what is important is that the data is in a format that can still be reopened and reused.

A Central Database Enables Collaborative Work 13
The digital work environment uses the open-source XML database eXistdb as its central repository for the TEI documents.The database is installed on a server and available online.This gives all the researchers on the project access to the same data collection, allowing them to work collaboratively.The contents of the database are directly available and searchable for users as a collection of les in Oxygen XML Author.Collaborative access is made possible through the WebDAV protocol, which locks les that are in use to avoid conicts and unintended overwrites.

A User-Friendly Environment with Oxygen XML Author 14
Oxygen XML Author was chosen to create the user interface with which the editors transcribe and edit in the work environment. 11While using this program, the editors usually do not see the code, but instead are in a user-friendly "author view" that is styled with CSS.More than one sub-view was made for the project within this author view, so that editors can choose the one most tting for their current phase of work.In the various views, only certain TEI markup is shown or highlighted in the transcription.For example, when an editor only wants to examine the indexed passages of a text, then he or she can choose the view which only highlights the <index> elements.Above all, however, the end user can enter markup into the manuscript or into the TEI Header with one click in the toolbar.In this way, text phenomena like struck-out text can be documented, editorial comments can be added, and the creation date of a manuscript can be noted.For every type of editorial markup there is a special button that enters the appropriate valid XML.If needed not only single elements but complex XML fragments will be inserted.Furthermore, the buttons for the metadata insert proper elements in the correct place of the <teiHeader>.If necessary, the editor is asked via an extra dialog box for the value of an attribute.The editor is thus not concerned with the name of the actual TEI element, whether it needs an attribute, or whether in this context there are structural variations to be considered.The entire text can thus be quickly and simply marked up with TEI conformant XML.The researcher can also, if desired, view the XML code at any time for more precise editing or simply to follow the process more closely.

Central Indexes 16
Besides the transcription of the text, the database can also include central indexes (for persons, places, works, etc.) in the form of XML les.These can be edited and linked through the texts.
If, for example, one wishes to mark up a person's name, one selects the word in the text to be indexed, and a dropdown list appears with all the names from the person index, from which the editor can choose the appropriate one.Selecting the name causes both the correct element <persName> and the correct ID of the person (as an attribute) to be entered.The index with the actual references to the correct text passages is then automatically created for both the print and Web publications from the marked-up TEI document.This index function is created through a specially-programmed Java operation for Oxygen XML Author and with various XQuery scripts in the database.When programming the Java operation, we used parameters and not constants in the source code.Thus, the parameter values could be comfortably entered in Oxygen XML Author and changed if necessary.

Print Edition 19
A further output option is the print edition, implemented with the help of ConTeXt, which automatically generates a PDF (at any point in the workow) from a TEI document.With the correct conguration, the format and presentation of the PDF can meet the research project's exact needs for printed editions.Each TEI element is given a specic formatting command through a conguration le.In this way the dierent apparatuses can, for example, appear as footnotes that refer to the main text with the help of line numbers and lemmata.The print edition can also provide the correct index for each transcription and resolve any cross-references between manuscripts.The print edition helps the researchers to check their TEI-encoded transcriptions; additionally, it obviates the need for a separate typesetting process.

Feedback
The researchers of Schleiermacher in Berlin 1808-1834 had a two-hour training session introducing them to the work environment.They were also given an extensive handbook, a quick start guide, and a "cheat sheet" with an overview of all the available functions. 12r working with the environment for two months, the researchers were asked to share their opinions.The feedback was overwhelmingly positive, and they conrmed that the environment eased their work on the edition and saved them time.The ability to check their work immediately on the website or in the print edition was seen as a great advantage.The researchers were also thankful that they were not expected to work directly in the XML code, but instead could mark up the texts with TEI conform XML in a simple graphical user interface.At the same time, some researchers had familiarized themselves with the TEI XML behind the interface and did sometimes edit directly in the XML, especially in the beginning when some functions were not yet available.

From the Bottom Up
In the end the pilot run was so successful that other projects within the Berlin-Brandenburg Academy of the Sciences and Humanities expressed interest.In the spring of 2013 the work environment was implemented for the Academy's project Commentaria in Aristotelem Graeca et Byzantina (CAGB) 13 and in the summer of 2013 for Regesta Imperii-Friedrich III. 14 This was done Bridging the Gap: Greater Usability for TEI encoding 10 using the same technical concept, but with customized functions for each project for the text entry and for the publication (Web and print).This was due to the fact that the TEI schemata were in both cases very dierent from the pilot project.For example, CAGB is not an edition of manuscript transcriptions but of manuscript descriptions.In contrast, the time and eort for implementation would be minimal if one were using a schema from a project for which the work environment is already implemented.The amount of work does not depend on the software or technology, but on the complexity and diversity of TEI-encoded editions.
With every new implementation, the work environment developed further and was supplemented with new functions.This applied not just to the Java operations but also to the XQuery scripts, which are responsible for various fundamental tasks such as making the index le available for the Java operations.For these frequently used XQuery and XSLT scripts and for further congurations, an eXistdb application 15 was developed in 2013 that allows for a simple and fast installation into the database. 16New functions are then, when suitable, integrated into the already existing work environment so that after the launch of the work environment the operability continually improves.
After we published a report about ediarum, 17 it caught the attention of other research institutions.
During this time TELOTA introduced and explained the environment to interested institutions in various presentations.As a result, for example, the Academy of Science and Literature Mainz adopted the concept of ediarum and built their own digital work environment based on it.Other interested persons can read about the concept in a series of blog entries that are written as tutorials. 18In addition, the custom Java functions were made available by TELOTA on GitHub for the TEI community. 19There the functions can be downloaded and used directly in Oxygen Frameworks or, if necessary, changed in accordance with theGNU LGPL (Lesser General Public License).
At the end of 2013, TELOTA began implementing ediarum for the historical-critical edition of Jeremias Gotthelf. 20In this collaboration with Bern University, new functions are being developed and existing ones improved.For instance, we plan to build a function to allow end users to insert overlapping markup (using, e.g., <anchor> elements and @spanTo) in Oxygen XML Author.

Summary and Prospects
The bottom-up development of ediarum with its strong focus on user-friendliness brings with it certain implications.
It has required a lot of work during the initial implementation and customization of the environment, especially to generate the website and print edition.TELOTA is now considering simplifying the creation of the Web and print editions, so that less time is needed for programming.
The goal, however, will be not to develop a complete plug-and-play solution.That would be impossible, considering the complexity and diversity of critical editions.The aim for a future version of ediarum will be rather to create a solution for encoding editions in TEI which involves the lowest possible amount of time and eort for programming while being customized to the specic needs of a project.
The concept of ediarum's development with its tailored solution has various advantages: • Time and eort for programming is spared through the consistent use of existing software and the elimination of extraneous functionality not required by a particular project.In this way a functional software solution is reached more quickly and practically.

•
The work environment is customized to the exact needs of the project.Due to the userfriendly interface, researchers enjoy operating it and use it in their daily work.

•
The automatic generation of the Web and print edition from the XML le allows the researcher to immediately check their TEI encoding.This eases the publication process, leaving more time for the actual editorial work.
Thus, through addressing the needs of a specic project, we found a solution that can be used by many: ediarum helps bridge the gap between hesitant users and the many possibilities and advantages of TEI encoding.

Figure 1 .
Figure 1.Workflow in the digital work environment.

Figure 2 .
Figure 2. Toolbar for the Schleiermacher edition with functions for inserting TEI markup.

Figure 3 .
Figure 3. Customized Oxygen XML Author with the index function's dialogue box opened.

Figure 4 .
Figure 4. Website for the Schleiermacher edition.

Figure 5 .
Figure 5. Print edition for the Schleiermacher edition generated via ConTeXt.