BroadSem Induction of Broad-Coverage Semantic Parsers

Summary

In the last one or two decades, language technology has achieved a number of important successes, for example, producing functional machine translation systems and beating humans in quiz games. The key bottleneck which prevents further progress in these and many other natural language processing (NLP) applications (e.g., text summarization, information retrieval, opinion mining, dialog and tutoring systems) is the lack of accurate methods for producing meaning representations of texts. Accurately predicting such meaning representations on an open domain with an automatic parser is a challenging and unsolved problem, primarily because of language variability and ambiguity. The reason for the unsatisfactory performance is reliance on supervised learning (learning from annotated resources), with the amounts of annotation required for accurate open-domain parsing exceeding what is practically feasible. Moreover, representations defined in these resources typically do not provide abstractions suitable for reasoning.

In this project, we will induce semantic representations from large amounts of unannotated data (i.e. text which has not been labeled by humans) while guided by information contained in human-annotated data and other forms of linguistic knowledge. This will allow us to scale our approach to many domains and across languages. We will specialize meaning representations for reasoning by modeling relations (e.g., facts) appearing across sentences in texts (document-level modeling), across different texts, and across texts and knowledge bases. Learning to predict this linked data is closely related to learning to reason, including learning the notions of semantic equivalence and entailment. We will jointly induce semantic parsers (e.g., log-linear feature-rich models) and reasoning models (latent factor models) relying on this data, thus, ensuring that the semantic representations are informative for applications requiring reasoning.

Resources

Show all and search (55)

Unfold all

Fold all

More information & hyperlinks

Web resources:	https://cordis.europa.eu/project/id/678254
Start date:	01-05-2016
End date:	30-04-2022
Total budget - Public funding:	1 457 185,00 Euro - 1 457 185,00 Euro

Cordis data

Original description

Status

CLOSED

Url

https://cordis.europa.eu/project/id/678254

Call topic

ERC-StG-2015

Update Date

27-04-2024

Geographical location(s)

Structured mapping

Unfold all

Fold all

EU-Programme-Call

Organisations

Show all (2)

BroadSem | Induction of Broad-Coverage Semantic Parsers

Original description

Status

Url

Call topic

Update Date