Though, as noted earlier, we will be unable to release the first edition of the Dictionary at the end of this month as originally planned, nevertheless we are continuing to make good progress on the data-collection and -entry side of things. I thought it would be fun to take stock of things as they stand, and put up a few teaser statistics.
There are currently 297 entries ready for publication: Each of these entries includes the etymological derivation of the name; brief notes about any major royalty (kings/queens; emperors/empresses), popes, or saints who bore the name; and any other relevant information concerning the linguistic development of the name, references to secondary literature, or cross-references to other application dictionary entries. 176 of the entries are masculine names, ranging from Adolf to Zdyslav, and the remaining 121 are feminine, ranging from Accorsa to Zoete.
There are 3880 citations distributed over these 297 entries, making an average of just over 13 citations per entry. Of course, the reality is much different: Many of the entries have only a single citation, and a bare handful of entries have hundreds. Such minimal data is already indicative of the larger sample being confirmative of Zipf’s Law; one exciting consequence of the Dictionary is that research concerning empirical matters such as Zipf’s Law will be much easier to undertake as a huge body of data will all be gathered in a single place.
The citations are taken from records from the Czech Republic, Germany, England, France, Italy, Scotland, Spain, and Sweden. The earliest citations are from 779, a handful of masculine names from a Carolingian charter; the latest are from 1600 and are drawn from English parish records. Approximately 1850 citations are from Latin-language records; the remainder are from various vernaculars.
I rather like nice little statistics: Maybe we’ll do updates like this the beginning of every month!