Call for application — internship/position on LMF-TEI

Short-term research internship/position — Comparing and Improving Lexical Representation Standards

Inria is looking for a highly motivated young researcher (PhD or PhD Student) to provide an in-depth analysis of the ISO 24613 (Lexical Markup Framework – LMF) and its current application in both the Text Encoding Initiative (TEI) guidelines and the W3C OntoLex initiative.

The researcher’s background and skills should comprise:

– Research interest in lexical information (from a lexicographic, corpus linguistic or computational linguistic viewpoint). The researcher may work on his/her own data.

– Understanding of data modeling in XML and/or OWL, with basic skills in XSLT. Experience with XML schema languages is a plus.

The core activity of the short research stay will be to examine how well the LMF meta-model is reflected in the TEI guidelines and the current OntoLex specification, in order to create a customisation of the TEI guidelines that has at least the same coverage as the Ontolex specification. The work will include gathering lexical samples that could serve as a proof of concept for this customisation.

Salary: may range from 1100€ to 2100€ (after deductions) depending on status and experience

Duration: 5-month employment, starting as soon as possible

Location: Berlin (Germany) with the work contract established in France. Depending on the current location and constraints of the applicant, the precise organisation of work can be subject to further agreements.

Contact: application comprising research CV and motivation letter should be sent tolaurent.romary@inria.fr

Background reading:

Laurent Romary. TEI and LMF crosswalks. Stefan Gradmann and Felix Sasaki. Digital Humanities: Wissenschaft vom Verstehen, Humboldt Universität zu Berlin, to appear —http://hal.inria.fr/hal-00762664

Laurent Romary, Werner Wegstein. Consistent modelling of heterogeneous lexical structures. Journal of the Text Encoding Initiative, TEI Consortium, 2012 —http://hal.inria.fr/hal-00704511

Lothar Lemnitzer, Laurent Romary, Andreas Witt. Representing human and machine dictionaries in Markup languages. Ulrich Heid. Dictionaries. An International Encyclopedia of Lexicography. Supplementary volume: Recent developments with special focus on computational lexicography, Mouton de Gruyter, 2014 —http://hal.inria.fr/inria-00441215

John McCrae, Dennis Spohr, Philipp Cimiano, “Linking Lexical Resources and Ontologies on the Semantic Web with Lemon”, in The Semantic Web: Research and Applications, Lecture Notes in Computer Science Volume 6643, 2011, pp 245-259