Resource: Waedeker: Wikipedia’s knowledge in a handy Baedeker format
Create a compact offline Wikipedia archive as a ZIM file for any region in the world – perfect for hiking, expeditions or areas without network coverage. See full post.
Create a compact offline Wikipedia archive as a ZIM file for any region in the world – perfect for hiking, expeditions or areas without network coverage. See full post.
Cambridge’s GLAM institutions (galleries, libraries, archives, garden and museums) house millions of objects from across the globe, representing an unparalleled repository of cultural and natural history. However, challenges such as analogue formats, handwritten records, fragmented objects, multilingual sources and complex surfaces make much of this data difficult to access. To address these challenges, the AI […]
A worsening problem I am having is an overall decline in basic digital literacy in my students. Since many of my classes turn on interrogating humanities materials with digital tools, or interrogating the digital from a humanities perspective (ie, DH!) this means I am spending ever more frustrating amounts of time just trying to get […]
The ultimate guide on how to write alt text and image descriptions for the visually impaired, written by someone with low vision who uses alt text. I started writing guides on how to write alt text and image descriptions for the visually impaired people like me who have to guess what is in an image […]
A 23-page standard-size, full-color zine co-created by Quinn Dombrowski, Tessa Walsh, Anna Kijas, Ilya Kreymer, and Amanda Wyatt Visconti. DIY Web Archiving shows you why everyone should participate in preserving the things on the web they care about, and how anyone can do so (no special expertise required!). Based on the 11/25/2024 virtual workshop co-sponsored […]
I’m sharing the product of five hours of work with Claude Cowork spread over a day. I have been using Cowork now for ten days and this is my biggest project to date. I’m an historian, not a coder. Cowork has been a revelation. The product is a static five annotation layer IIIF viewer for […]
The MultiClinAI Track is organized by the Barcelona Supercomputing Center’s NLP for Biomedical Information Analysis group and promoted by European projects such as DataTools4Heart and AI4HF. MultiClinAI is a shared task focused on the creation of comparable multilingual corpora via annotation projection, as well as the multilingual extraction of clinical concepts. See full post.
I want to share an interesting pattern – get claude code on the web (or any other autonomous agent + sandbox) to do R&D on a coding question. Based on these blog posts – https://simonwillison.net/2026/Feb/6/pydantic-monty/ and https://simonwillison.net/2025/Nov/6/async-code-research/. I wanted to learn about WASM and Pyodide as I don’t know much about these technologies. I asked […]
Artificial Intelligence (AI) is now a regular topic of conversation in archives. Managers and stakeholders are asking whether AI can speed up description, identify sensitive content, or provide new forms of access. This document offers practical guidance on how to prepare archival collections for AI in ways that remain true to archival principles and ethical […]
Extracting Keywords from Crowdsourced Collections was a Digital Scholarship @ Oxford (DiSc) Research Development Grant-funded project based in the Faculty of English at the University of Oxford. Using the Their Finest Hour Online Archive, a digital collection of 2,000+ records and 26,000+ files related to the Second World War, as a case study, this project […]