SALSA | efficient Syntactic Analysis for Large-scale Sentiment Analysis

Summary
One of the key aspects of any successful business is knowing how customers feel about its brand and products. For this purpose, sentiment analysis or opinion mining tools could be paramount in helping companies to succeed. However, the current state of the art sentiment analysis solutions (both commercial or academic) present important drawbacks including low accuracy, low performance (response time around 100-1000 ms), high computational cost and/or high price (around €500/month on average) relegating these solutions to consolidated big brands, social listening agencies or consulting firms offering social listening services.

SALSA aims to democratize the analysis and transformation of internet/social data into knowledge creation for decision-makers, making large-scale sentiment analysis technology viable for small entities without massive computational power. SALSA will explore the potential of the powerful models and algorithms developed within ERC Starting Grant FASTPARSE to create the first AI-based syntax-guided sentiment analysis engine which is: a) accurate, due to using syntactic information to infer the opinions contained in each sentence from its structure and the relationship between its words, rather than shallow methods that consider words in isolation and b) cost-effective, due to employing fast parsers that have a throughput in the order of 1000 sentences per second on consumer-grade hardware, and that can work without time- and memory-hungry large language models.

SALSA will follow an open-source software business model in which we will explore several sources of revenue, most based on service-level agreement. This will highly contribute to the competitiveness of the EU technological market by reducing their dependency on the oligopoly of technological giants (mostly American and Chinese) that currently have a dominant position in language technologies, largely thanks to their enormous computational resources.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/101100615
Start date: 01-02-2023
End date: 31-07-2024
Total budget - Public funding: - 150 000,00 Euro
Cordis data

Original description

One of the key aspects of any successful business is knowing how customers feel about its brand and products. For this purpose, sentiment analysis or opinion mining tools could be paramount in helping companies to succeed. However, the current state of the art sentiment analysis solutions (both commercial or academic) present important drawbacks including low accuracy, low performance (response time around 100-1000 ms), high computational cost and/or high price (around €500/month on average) relegating these solutions to consolidated big brands, social listening agencies or consulting firms offering social listening services.

SALSA aims to democratize the analysis and transformation of internet/social data into knowledge creation for decision-makers, making large-scale sentiment analysis technology viable for small entities without massive computational power. SALSA will explore the potential of the powerful models and algorithms developed within ERC Starting Grant FASTPARSE to create the first AI-based syntax-guided sentiment analysis engine which is: a) accurate, due to using syntactic information to infer the opinions contained in each sentence from its structure and the relationship between its words, rather than shallow methods that consider words in isolation and b) cost-effective, due to employing fast parsers that have a throughput in the order of 1000 sentences per second on consumer-grade hardware, and that can work without time- and memory-hungry large language models.

SALSA will follow an open-source software business model in which we will explore several sources of revenue, most based on service-level agreement. This will highly contribute to the competitiveness of the EU technological market by reducing their dependency on the oligopoly of technological giants (mostly American and Chinese) that currently have a dominant position in language technologies, largely thanks to their enormous computational resources.

Status

SIGNED

Call topic

ERC-2022-POC2

Update Date

09-02-2023
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon Europe
HORIZON.1 Excellent Science
HORIZON.1.1 European Research Council (ERC)
HORIZON.1.1.0 Cross-cutting call topics
ERC-2022-POC2 ERC PROOF OF CONCEPT GRANTS2
HORIZON.1.1.1 Frontier science
ERC-2022-POC2 ERC PROOF OF CONCEPT GRANTS2