WFL is presenting a poster on how to browse the resource and what kind of research questions can the webapp help with. These are some passages from the DH2017 Book of Abstracts:
Word Formation Latin (WFL) is a derivational morphology resource for Classical Latin, where lemmas are analysed into their formative components, and relationships between them are established on the basis of Word Formation Rules (WFRs). For example amo (to love) and amator (lover) are connected with a relationship that describes a change from a verb to a noun through the addition of a suffix (-a-tor) that in itself bears semantic information (in this case it characterises agentive and instrumental nouns, i.e. someone or something performing an action).
The lexical basis used for the resource comprises the whole 69,682 lemmas featured in the morphological analyser for Latin LEMLAT 3.0 (http://www.lemlat3.eu/).
The word formation lexicon is built in two steps:
- Word formation rules (WFRs) are detected using a mixture of previous literature on Latin derivational morphology (Jenks, 1911; Fruyt, 2011; Oniga, 1988) and semi-automatic procedures (Passarotti and Mambrini 2012).
- WFRs are applied to lexical data: lemmas and WFRs are paired using a MySQL relational database, and a number of MySQL queries provide the candidate lemmas for each WFR. Input and output pairs are then checked manually, in order to clear out false friends and duplicate results due to homography.
This poster will describe the resource, and, more specifically, it will illustrate the web application that is being developed to easily access the data.
The WFL dataset is both integral part of Lemlat (https://github.com/CIRCSE/LEMLAT3), and used in a standalone web application (http://wfl.marginalia.it). The database will be made available for download, so that extensive queries can be run and the data can be used and reused at will. The web application is intuitive and user-friendly. It supports those scholars and students that are not familiar with database querying languages such as SQL, but also Classicists with specific scientific questions.
The lexicon can be browsed either by WFR, affix, input and output Part-of-Speech (PoS) or lemma. Drop-down menus provide the available options for each selection, such as the list of affixes and lemmas. Results are visualised as lists of lemmas and tree graphs, whose nodes are lemmas and edges are WFRs. Trees are interactive. Clicking on a node shows the full derivational tree (“word formation cluster”) for the lemma reported in that node. For example, figure 1 shows the word formation cluster for the lemma computo, ‘to calculate’. Clicking on an edge shows the lemmas built by the WFR described by that edge.
Methodological motivations will be given for each browsing option together with suggestions for potential uses of the web to investigate Latin derivational processes.
Figure 1. Word formation cluster for computo, ‘to calculate’.
Four browsing choices can help the scholar with an array of linguistic investigations.
- By WFR – opens research questions on a specific word formation behaviour; for example, it is possible to view and download a list of all verbs that derive from a noun with a conversive derivation process (e.g. radix ‘root’ -> radicor ‘to grow roots’).
- By Affix – acts similarly as above, but works more specifically on affixal behaviour: for example, it is possible to see all agentive nouns in -tor and verify how many correspond to a female equivalent in -trix.
- By PoS – useful for studies on macro-categories, such as nominalisation or verbalisation.
- By Lemma – useful when studying the productivity of one specific morphological family (like the one for bellum above) or a group of morphological families.
These explorations lead in many directions through investigations on derivational production and semantics (Can semantic identification of outputs help to show which WFRs are more morphotactically transparent? Which inputs produce a certain kind of outputs? Etc.).
Forcellini, A. “Lexicon totius latinitatis ab Aegidio Forcellini seminarii Patavini alumno lucubratum, deinde a Iosepho Furlanetto eiusdem seminarii alumno emendatum et auctum, nunc vero curantibus Francisco Corradini et Iosepho Perin seminarii Patavini item alumnis emendatius et auctius melioremque in formam redactum.” Tom. I AG, Patavii, Typis Seminarii (1940).
Georges, Karl Ernst. Ausführliches lateinisch-deutsches und deutsch-lateinisches Handwörterbuch... Vol. 2. Hahn’sche Verlags-buchhandlung, 1880.
Glare, Peter GW. Oxford latin dictionary. Clarendon Press. Oxford University Press, 1982.
Gradenwitz, Otto. Laterculi vocum Latinarum: voces Latinas et a fronte et a tergo ordinandas. S. Hirzel, 1904.
Jenks, Paul Rockwell. A manual of Latin word formation for secondary schools. DC Heath & Company, 1911.
Fruyt, Michèle. “Word‐Formation in Classical Latin.” A Companion to the Latin Language (2011): 157-175.
Oniga, Renato. I composti nominali latini: una morfologia generativa. Vol. 29. Pàtron, 1988.
Passarotti, Marco Carlo. “Development and perspectives of the Latin morphological analyser LEMLAT.” Linguistica computazionale 20, no. A (2004): 397-414.
Passarotti, Marco, and Francesco Mambrini. “First Steps towards the Semi-automatic Development of a
Wordformation-based Lexicon of Latin.” In Eighth International Conference on Language Resources and Evaluation, (LREC 2012), pp. 852-859. European Language Resources Association (ELRA), 2012.
This is the poster presented at DH2017: