A plaintext version of the Lewis-Short Latin-English lexicon
821bd8fc — Peter Aronoff 3 years ago
Fix weird whitespace bug
ee1e8310 — Peter Aronoff 5 years ago
2f415fee — Peter Aronoff 5 years ago
Fix H1


browse  log 



You can also use your local clone with git send-email.

#A Latin Dictionary (edd. Lewis & Short) as plain text


This repo contains two plain text versions of A Latin Dictionary (edd. Charlton T. Lewis and Charles Short). These files are based on the XML version that the Perseus Digital Library has made available under a Creative-Commons license. The only difference between the two files here is that one replaces straight quotes with curly quotes. (The text already contains other unicode features, so it seemed worth doing.)


This dictionary is widely available online in various ebook formats and as XML, but not as plain text as far as I know. Plain text is much easier to work with than those other formats. In addition, the PerseusDL XML file leaves the Greek in beta code format.

My motivation is to use this version of Lewis and Short to create tools to help myself and other teachers create glossaries for students. (As those tools develop, I'll put a link here.) However, I hope that other people find these files useful for lots of other projects. See below for license.


I used a tool provided by PerseusDL to transform the beta code in the XML file into unicode Greek and a small set of sed commands to transform all of the remaining character entities into unicode. (E.g., ā becomes ā.) Finally a Python script extracted the content from the XML. (I've included the sed and Python scripts in case anyone wants to adapt or improve those.)


Text provided by Perseus Digital Library, with funding from The National Endowment for the Humanities.

Original version available for viewing and download at http://www.perseus.tufts.edu/


The two Lewis and Short text files are licensed under the CC BY-SA 3.0 license. The Python script and sed commands are licensed under the BSD 3-Clause license.