Our goals:

This project aims to aggregate, parametrise, and synthesise primary datasets that inform the evolution of societies in the Ancient Mediterranean (ie. examine the links between social connectivity and complexity manifest in material and textual evidence). In the process, the team will develop a suite of digital approaches and tools, inspired by large-scale ecological projects like Ocean Health Index (Lowndes et al. 2017). The two mains outputs of this project will be

  • a comparative study of proxies for evolution of social complexity in the Ancient Mediterranean
  • digital tools, workflows and processes that scale and that historians and archaeologists can use in their own research.

In its digital aim, this project will deliver the next component of a digital ecosystem for ‘small-data’ domains like the humanities, namely research infrastructure for combining 'messy' and heterogeneous legacy data with 'clean' digitally born data, allowing for their streamlining and analysis in a collaborative environment.

In its empirical mission, this project will evaluate existing hypotheses of Ancient Mediterranean dynamics by aggregating and synthesising available digital datasets.


The Social Dynamics in the Ancient Mediterranean (SDAM) research group is supported by the the Aarhus University Research Foundation in 2019-2022.

  • First peak at multigraph

    Climbing the Mountain Some while ago, I saw this comic about the never-ending story of learning programming. The never-ending story applies to any collaborative discovery endeavour. At SDAM, we have been agonizing over what can we actually do with all the large data, once we aggregated it. It’s a question that’s been giving us anxiety, because until you harvest, streamline and wrangle your data, you cannot even start answering it. Just like a potter who collects clay from an unknown source without knowing whether it will be good for anything. You knead and levigate the cold, hard mass and only...

  • Lemmatization and POStagging of Ancient Greek Texts

    Introduction In the second half of January, while I was working with Vojtěch Linka on a quantitative text analysis of the concept of pain in Corpus Aristotelicum and Corpus Hippocraticum, we realized that the data which I regularly use for this type of tasks (i.e. Lemmatized Ancient Greek dataset, LAG) are substantially incomplete. We noticed this fact once we were working with Aristotelian texts (tlg0086), as we realized that the dataset completely misses such works as Ethica Nicomachea and Ethica Eudemia - works really crucial for our task at hands! This was quite surprising finding, since from the LAG documentation...

  • Epigraphic Database Heidelberg using R

    This post is about accessing the “Epigraphic Database Heidelberg” (EDH), which is one of the longest running database projects in digital Latin epigraphy. The [EDH] database started as early as year 1986, and in 1997 the Epigraphic Database Heidelberg website was launched at https:/edh-www.adw.uni-heidelberg.de where inscriptions, images, bibliographic and geographic records can be searched and browsed online. Despite the possibility of accessing the [EDH] database through a Web browser, it is many times convenient to get the Open Data Repository by the [EDH] through its public Application Programming Interface (API). For inscriptions, the generic search pattern Uniform Resource Identifier (URI)...

  • In search of good practices in digital research

    In the world of digital scholarship, I count myself among rather late adopters. It was during my doctoral studies when I first tried to learn some programming stuff. It took me another three years until I was able to write and run my own first programming scripts in Python. However, since I reached that stage a couple of years ago, I quickly became a programming-enthusiast, fully immersed in the exciting world of web scraping, exploring countless datasets available via GitHub, and always experimenting with new Python packages and libraries. Here I differentiate between two idealized types of behavior: computational skills...

  • Short report: User Acceptance Testing

    User Acceptance Test or UAT is a type of testing performed by the Client to certify the system with respect to the requirements that was agreed upon. This report is to illustrate the application of UAT in developing, testing, and releasing a new or updated R package. User Acceptance Test for R package release In the DTAP process (development → test → acceptance → production), which is part of a software development process, the release of a new version of an R package is at the final step where the deployment needs to meet certain requirements. The “acceptance” portion in...