Student research opportunities
Finding Definitions for Complex Terms
Project Code: CECS_935
This project is available at the following levels:
Masters, PhD
Please note that this project is only for higher degree (postgraduate) applicants.
Keywords:
Natural Language Processing, Text Accessibility, Text Simplification, Monolingual Text Alignment, Definition Extraction
Supervisor:
Dr Gabriela FerraroOutline:
Text Simplification is a hot topic in Natural Language Processing (NLP). The aim of Text Simplification is to make a text more accessible. In order to simplify a text several NLP techniques can be applied, for instance, text segmentation, terminology linking to definitions, lexical simplification, syntactic simplification, paraphrasing, among others. This projects deals with linking terminology to definitions in the scope of a patent document. Patents documents are very difficult to comprehend, especially for their complex terminology. Thus, means that helps to understand those terms are highly desirable.
Goals of this project
In same cases, the terms in a patent are explicitly defined in the patent document itself. The aim of this project is to align terms with their definitions in the scope of a document by applying Natural Language Processing techniques. Three main tasks are distinguish, i) terminology identification, ii) definition extraction and iii) term-definition alignment.
Requirements/Prerequisites
Good coding skills in Java.
Background Literature
Foundations of Statistical Natural Language Processing by Christopher D. Manning and Hinrich Schutze
About the state-of-the-art in Definition Extraction
About Monolingual Text Alignment



