About this project
Origin
Roman Letters grows out of a simple observation in Patrick Wyman's 2016 USC dissertation, Letters, Mobility, and the Fall of the Roman Empire: the late Roman world left behind an extraordinary volume of surviving correspondence. Senators, bishops, monks, and imperial officials all relied on letters to maintain relationships across vast distances, and many of those letters still exist, scattered across digital archives and critical editions.
This project collects that scattered corpus into a single, structured database and provides tools for exploring the communication networks it reveals.
The dataset
The database currently contains:
Letters span from roughly 97 to 800 AD, covering the transition from the unified Roman Empire to the early medieval kingdoms of western Europe. Major collections include the letters of Augustine, Gregory the Great, Symmachus, Basil of Caesarea, Jerome, Cassiodorus, and Sidonius Apollinaris, among others.
Methodology
Text collection
Texts were collected by scraping and parsing freely available digital sources. Each source required a custom parser to handle its markup, encoding, and structure. Latin and Greek originals came primarily from The Latin Library, Tertullian.org, Perseus Digital Library, and OpenGreekAndLatin's First1KGreek project (CSEL XML editions). English translations came primarily from New Advent (the Nicene and Post-Nicene Fathers series) and Tertullian.org. Additional volumes were drawn from Internet Archive scans of MGH and CSEL print editions, Latin Wikisource, the Fordham Medieval Sourcebook, Livius.org, and Demonax.info. Where OCR-sourced text was used (particularly Patrologia Graeca volumes from Internet Archive), the raw text was cleaned to remove scanning artifacts before import.
Translations
English translations are drawn from established public-domain scholarly sources: the Nicene and Post-Nicene Fathers series (New Advent), the Attalus project, Livius.org, Tertullian.org, Roger Pearse, the Fordham Medieval Sourcebook, and CELT. Where no English translation exists, the original Latin or Greek text is provided. The original text is preserved alongside every translation.
A quality note: bulk translations of Greek collections (particularly Isidore of Pelusium and Libanius) are thematic renderings rather than precise philological translations. OCR-sourced texts from Patrologia Graeca may contain scanning artifacts in the Latin and Greek originals. Corrections from domain experts are welcome.
Recipient and sender identification
Sender and recipient names were extracted by automated parsing of letter headers (e.g., "To Eusebius", "Augustine to Paulinus") and then reconciled against a shared people table. Common variants, nicknames, and titles were normalized during a manual review pass. Where a letter's addressee is unknown or disputed, the recipient is recorded as "Unknown" or left blank.
Location assignment and confidence levels
Geographic coordinates for letter origins and destinations were assigned in three tiers, following Wyman's methodology:
- Strong - historically established location, confirmed by prosopographic data or explicit mention in the letter
- Approximate - inferred from collection context, known residence periods, or regional information
- Unknown - no reliable location data; coordinates not used in distance calculations
Confidence levels are displayed on individual letter pages and used to filter the map and network visualizations.
Distance calculation
Straight-line distances between sender and recipient locations are computed using the haversine formula (great-circle distance between two latitude/longitude coordinates). Only letters with "strong" or "approximate" confidence on both endpoints are included in distance calculations.
Road routing
The map timelapse can optionally display routed paths along the ancient Roman road network rather than straight-line arcs. Road data comes from the Ancient World Mapping Center (AWMC) Barrington Atlas road network, provided as GeoJSON. Paths are computed using BFS (breadth-first search) over the road graph, snapping letter endpoints to the nearest road node. Where no road path can be found, the display falls back to a straight-line arc.
Sources
Digital text sources
- New Advent - English translations of patristic letters from the Nicene and Post-Nicene Fathers series (public domain)
- The Latin Library - Latin originals from standard critical editions
- Tertullian.org (Roger Pearse) - Latin texts and English translations for several minor collections
- Perseus Digital Library - Greek and Latin originals with TEI XML markup
- Internet Archive - OCR scans of Patrologia Graeca volumes, MGH volumes, and CSEL print editions
- Latin Wikisource - Braulio of Zaragoza and other Iberian authors
- OpenGreekAndLatin / First1KGreek - CSEL XML editions of Ennodius, Ruricius of Limoges, Avitus of Vienne, and Paulinus of Nola
- dMGH (Monumenta Germaniae Historica) - Avitus of Vienne (MGH edition)
- Fordham Medieval Sourcebook - Selected letters of Boniface
- Livius.org and Demonax.info - Synesius of Cyrene
Map and geographic data
- DARE (Digital Atlas of the Roman Empire) - Historical map tiles (CC BY 4.0)
- AWMC (Ancient World Mapping Center) - Barrington Atlas road network GeoJSON (ODbL)
Scholarly references
- Patrick Wyman, Letters, Mobility, and the Fall of the Roman Empire (PhD dissertation, University of Southern California, 2016) - the primary scholarly framework for network analysis and geographic scope
- Cristiana Sogno, Bradley K. Storin, and Edward J. Watts (eds.), Late Antique Letter Collections: A Critical Introduction and Reference Guide (University of California Press, 2017) - reference for collection scope and authorship context
License
- Code - MIT License. Source available on GitHub.
- Data and translations - CC BY 4.0. Attribution: Roman Letters / romanletters.org.
- Map tiles - DARE tiles are CC BY 4.0. Credit: Johan Ahlfeldt, Digital Atlas of the Roman Empire.
- Road data - AWMC Barrington Atlas road network, available under ODbL.
Credits
- Created by Craig Vander Galien
- Inspired by Patrick Wyman and his dissertation on late Roman epistolary networks
Open source
The full dataset, scraping scripts, and this website are open source. The dataset is permanently archived and available on multiple platforms.
Cite this dataset: Vander Galien, Craig. (2026). Roman Letters: 7,049 Letters from the Late Roman World (100-800 AD). Zenodo. doi.org/10.5281/zenodo.19142059