This is the first in a series of posts which constitute a “lit review” of sorts to document the range of methods scholars are using to compute the distribution of topics over time.
Graphs of topic prevalence over time are some of the most ubiquitous in digital humanities discussions of topic modeling. They are used as a mechanism for identifying spikes in discourse and for depicting the relationship between the various discourses in a corpus.
Topic prevalence over time is not, however, a measure that is returned with the standard modeling tools such as MALLET or Gensim. Instead, it is computed after the fact by combining the model data with external metadata and aggregating the model results. And, as it turns out, there are a number of ways that the data can be aggregated and displayed.
In this series of notebooks, I am looking at 4 different strategies for computing topic significance over time.
Read the full post here.
A growing number of researchers in the humanities are using computational tools and methods that are more typically associated with social and scientific research. These tools and techniques enable researchers to pursue new forms of inquiry and new questions and bring more attention to—and cultivate broader interest in—traditional humanities and humanities data. This paper from ECAR and the Coalition for Networked Information (CNI) outlines a practical framework for capacity building to develop institutional digital humanities support for IT staff, librarians, administrators, and faculty with administrative responsibilities.
Read more here.
From “Latest Success Story! Medieval Handwriting and Handwritten Text Recognition”:
Two partners in the READ project network have now successfully trained a new model to recognise Gothic handwriting! The State Archives of Zurich (READ project partner) and the University of Zurich (READ project Memorandum of Understanding partner) have collaborated on the automatic recognition of a collection of medieval charters.
In 1336 a cartulary was written in Königsfelden, close to the city of Brugg (which is now part of Switzerland). Königsfelden abbey was a well-endowed institution with close ties to the dukes of Habsburg. In a neat and regular handwriting, the charters of the institution were copied on roughly 260 parchment pages. The cartulary is available online via e-codices.
At the University of Zurich, there is an ongoing project to create a digital scholarly edition of the charters of Königsfelden abbey. The cartulary is an important source for early writing practices and has already been partially transcribed. The project team have been using our Transkribus platform to produce their transcriptions and they used these transcripts to train and test a Handwritten Text Recognition (HTR) model.
Read the full post here.