ViDaR | ViDaR: R-enabled large-scale data analytics in ViDa

Summary
In this project we will build ViDaR, an interface for integrating R with ViDa. R is among the leading data analytics environments (the leading open-source), and is heavily used by data and domain scientists and data analysts in their daily routine. ViDa, developed by an ERC grant, is the state-of-the-art query engine for raw data. It relies on data virtualization, i.e., abstracting data out of its form and manipulating it regardless of the way it is stored or structured, to enable efficient, scalable, querying and manipulation of data in-situ, at their raw format and shape. Integration of ViDa with R will have a positive impact on both systems. For ViDa, it will provide capabilities for data exploration, visualization, mining and analytics, as well as powerful libraries for numerical and statistical computing, thereby substantially growing its user base. For R, it will increase its scale and performance, and reduce the time and effort spent by data scientists on tedious data management tasks. The resulting solution will serve as a proof-of-concept of ViDa’s performance, capabilities, and flexibility for integration with any third-party software that needs to manage vast amounts of raw data.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/768910
Start date: 01-01-2018
End date: 30-06-2019
Total budget - Public funding: 150 000,00 Euro - 150 000,00 Euro
Cordis data

Original description

In this project we will build ViDaR, an interface for integrating R with ViDa. R is among the leading data analytics environments (the leading open-source), and is heavily used by data and domain scientists and data analysts in their daily routine. ViDa, developed by an ERC grant, is the state-of-the-art query engine for raw data. It relies on data virtualization, i.e., abstracting data out of its form and manipulating it regardless of the way it is stored or structured, to enable efficient, scalable, querying and manipulation of data in-situ, at their raw format and shape. Integration of ViDa with R will have a positive impact on both systems. For ViDa, it will provide capabilities for data exploration, visualization, mining and analytics, as well as powerful libraries for numerical and statistical computing, thereby substantially growing its user base. For R, it will increase its scale and performance, and reduce the time and effort spent by data scientists on tedious data management tasks. The resulting solution will serve as a proof-of-concept of ViDa’s performance, capabilities, and flexibility for integration with any third-party software that needs to manage vast amounts of raw data.

Status

CLOSED

Call topic

ERC-2017-PoC

Update Date

27-04-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon 2020
H2020-EU.1. EXCELLENT SCIENCE
H2020-EU.1.1. EXCELLENT SCIENCE - European Research Council (ERC)
ERC-2017
ERC-2017-PoC