Editors’ Choice: Ways to Compute Topics over Time, Part 1

By: Jeri E. WieringaJune 22, 2017June 22, 2017

Creative Commons image by vial3tt3 via Flickr

This is the first in a series of posts which constitute a “lit review” of sorts to document the range of methods scholars are using to compute the distribution of topics over time.

Graphs of topic prevalence over time are some of the most ubiquitous in digital humanities discussions of topic modeling. They are used as a mechanism for identifying spikes in discourse and for depicting the relationship between the various discourses in a corpus.

Topic prevalence over time is not, however, a measure that is returned with the standard modeling tools such as MALLET or Gensim. Instead, it is computed after the fact by combining the model data with external metadata and aggregating the model results. And, as it turns out, there are a number of ways that the data can be aggregated and displayed.

In this series of notebooks, I am looking at 4 different strategies for computing topic significance over time.

Read the full post here.

Editors’ Choice: Mozilla AI at Internet Archive Europe: Owning Your AI Stack
by Beatrice Murch
July 15, 2026
Editors’ Choice: Pre-revolution network connections of the 1989 Polish Round Table participants
by Kelly Bodwin, California Polytechnic State University - San Luis Obispo;, Gregory F. Domber, California Polytechnic State University - San Luis Obispo;, Riley Sanders, California Polytechnic State University - San Luis Obispo
July 15, 2026
Editors’ Choice: Dataset Context(ualisation) in Documentation: Best Practices, Recommendations and Open Questions | Journal of Open Humanities Data
by Henk Alkemade, Gustavo Candela, Steven Claeyssens, Selda Eren, Maria Eskevich, Nuno Freire, Antoine Isaac, Jörg Lehmann, Giulia Osti & Mari Wigham
July 15, 2026