Editors’ Choice: LLMBench: A Comparative Close Reading Workbench For Large Language Models

By: BerryDMApril 22, 2026April 22, 2026

Editors’ Summary: In this post, the author provides a detailed overview of the functions of the new tool, LLMbench. Berry points to Google PAIR’s LLM Comparator as a useful tool for side-by-side evaluation of models from the perspective of model developers, but the tool lacked the ability to do comparative close reading. LLMbench is a browser-based tool, like Voyant, that treats the text itself as a probabilistic object. The author details the six modes of LLMbench and how they can be utilized for humanistic research, including the ‘Compare’ mode that relies on the “logprob” data. He argues that the logprobs are an underutilized tool in humanistic and social scientific readings of AI. The deployed version is available at https://llm-bench-mu.vercel.app/.

See full post.

Editors’ Choice: Mozilla AI at Internet Archive Europe: Owning Your AI Stack
by Beatrice Murch
July 15, 2026
Editors’ Choice: Pre-revolution network connections of the 1989 Polish Round Table participants
by Kelly Bodwin, California Polytechnic State University - San Luis Obispo;, Gregory F. Domber, California Polytechnic State University - San Luis Obispo;, Riley Sanders, California Polytechnic State University - San Luis Obispo
July 15, 2026
Editors’ Choice: Dataset Context(ualisation) in Documentation: Best Practices, Recommendations and Open Questions | Journal of Open Humanities Data
by Henk Alkemade, Gustavo Candela, Steven Claeyssens, Selda Eren, Maria Eskevich, Nuno Freire, Antoine Isaac, Jörg Lehmann, Giulia Osti & Mari Wigham
July 15, 2026