MultiConvAI | Enabling Multilingual Conversational AI

Summary
In recent past, Conversational Artificial Intelligence (AI) has made major advances, thanks to the availability of big data and increasingly powerful deep learning. Task-based statistical dialogue systems (SDS) are now viable, embedded in popular commercial applications (e.g. the Apple’s Siri, Amazon’s Echo, Google’s Assistant) and cost-effective in many scenarios (e.g., customer support, call centre service, searching, booking). Yet current SDSs are only available for a handful of resource-rich languages, leaving the majority of the worlds languages and their speakers behind. Our project will develop the first prototype system for scaling conversational AI to multiple languages. This will be based on new methodology that learns multilingual word representations (i.e. embeddings, WEs) without the need for expensive training data, using a process called semantic specialisation that complements WEs with common-sense and linguistic knowledge in external knowledge graphs. Building on our promising pilot studies, we will develop Natural Language Understanding (NLU) modules for SDS via 1) more effective semantic specialisation based on joint multi-source multi-target training; and 2) focus on typologicallydiverse languages. We foresee a pioneering use of selective sharing and structural adaptation for obtaining WEs and optimisation for the target languages guided by typological knowledge. The best resulting technology will be integrated in a demo prototype system which users and industries can deploy to generate multilingual NLU input for more widely portable SDS. Since we also plan to explore the possibility to form a start-up company, we will use the system to demonstrate the potential to our network of industry contacts and potential customers. On a larger scale, extending the multilingual scope of SDSs can have major socioeconomic benefits: it can broaden the global reach of conversational AI and it can enhance its commercial viability.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/957356
Start date: 01-01-2021
End date: 30-06-2022
Total budget - Public funding: - 150 000,00 Euro
Cordis data

Original description

In recent past, Conversational Artificial Intelligence (AI) has made major advances, thanks to the availability of big data and increasingly powerful deep learning. Task-based statistical dialogue systems (SDS) are now viable, embedded in popular commercial applications (e.g. the Apple’s Siri, Amazon’s Echo, Google’s Assistant) and cost-effective in many scenarios (e.g., customer support, call centre service, searching, booking). Yet current SDSs are only available for a handful of resource-rich languages, leaving the majority of the worlds languages and their speakers behind. Our project will develop the first prototype system for scaling conversational AI to multiple languages. This will be based on new methodology that learns multilingual word representations (i.e. embeddings, WEs) without the need for expensive training data, using a process called semantic specialisation that complements WEs with common-sense and linguistic knowledge in external knowledge graphs. Building on our promising pilot studies, we will develop Natural Language Understanding (NLU) modules for SDS via 1) more effective semantic specialisation based on joint multi-source multi-target training; and 2) focus on typologicallydiverse languages. We foresee a pioneering use of selective sharing and structural adaptation for obtaining WEs and optimisation for the target languages guided by typological knowledge. The best resulting technology will be integrated in a demo prototype system which users and industries can deploy to generate multilingual NLU input for more widely portable SDS. Since we also plan to explore the possibility to form a start-up company, we will use the system to demonstrate the potential to our network of industry contacts and potential customers. On a larger scale, extending the multilingual scope of SDSs can have major socioeconomic benefits: it can broaden the global reach of conversational AI and it can enhance its commercial viability.

Status

CLOSED

Call topic

ERC-2020-POC

Update Date

27-04-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon 2020
H2020-EU.1. EXCELLENT SCIENCE
H2020-EU.1.1. EXCELLENT SCIENCE - European Research Council (ERC)
ERC-2020
ERC-2020-PoC