SHEPHERD | Cross-layer resilience for rack-scale disaggregated memory

Summary
Data-intensive applications, such as search, social connectivity, and e-commerce, are increasingly relying on computer memory for storing and processing a large amount of data with low latency. Emerging disaggregated memory meets this growing demand by leveraging recent technology advances in high-density, byte-addressable non-volatile memory and low-latency interconnects to provide a large memory pool shared across micro-server compute nodes. With a larger memory density and capacity though, memory errors will become more frequent. Addressing such errors becomes paramount as they can cripple service availability and cause data loss. However, established hardware techniques for tolerating memory errors transparently to application software, originally developed for DRAM-based main-memory systems, become impractical at the scale of disaggregated memory due to increasing performance, energy and area overheads. This research will investigate and develop cross-layer memory resilience techniques where software and hardware work together cooperatively to prevent service unavailability and data loss due to memory errors in disaggregated memory with low overhead. It capitalizes on the existing strengths of the experienced researcher (ER) in the area of system software design for non-volatile memory and expertise of the host on memory fault tolerance and micro-server architectures. It will lead to several publications in top-quality venues and will inform forthcoming industrial efforts to develop resilient disaggregated memory micro-server architectures, thus contributing to the research output, visibility, and economy of Europe. Overall, the fellowship will give the ER the opportunity to reintegrate into a long-term academic position in Europe, after a long and fruitful research career in the academic and industrial sectors of the United States. The ER will emerge from the project with new skills and the capability to effectively lead his own research group.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/101029391
Start date: 01-09-2021
End date: 30-12-2024
Total budget - Public funding: 145 941,12 Euro - 145 941,00 Euro
Cordis data

Original description

Data-intensive applications, such as search, social connectivity, and e-commerce, are increasingly relying on computer memory for storing and processing a large amount of data with low latency. Emerging disaggregated memory meets this growing demand by leveraging recent technology advances in high-density, byte-addressable non-volatile memory and low-latency interconnects to provide a large memory pool shared across micro-server compute nodes. With a larger memory density and capacity though, memory errors will become more frequent. Addressing such errors becomes paramount as they can cripple service availability and cause data loss. However, established hardware techniques for tolerating memory errors transparently to application software, originally developed for DRAM-based main-memory systems, become impractical at the scale of disaggregated memory due to increasing performance, energy and area overheads. This research will investigate and develop cross-layer memory resilience techniques where software and hardware work together cooperatively to prevent service unavailability and data loss due to memory errors in disaggregated memory with low overhead. It capitalizes on the existing strengths of the experienced researcher (ER) in the area of system software design for non-volatile memory and expertise of the host on memory fault tolerance and micro-server architectures. It will lead to several publications in top-quality venues and will inform forthcoming industrial efforts to develop resilient disaggregated memory micro-server architectures, thus contributing to the research output, visibility, and economy of Europe. Overall, the fellowship will give the ER the opportunity to reintegrate into a long-term academic position in Europe, after a long and fruitful research career in the academic and industrial sectors of the United States. The ER will emerge from the project with new skills and the capability to effectively lead his own research group.

Status

SIGNED

Call topic

MSCA-IF-2020

Update Date

28-04-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon 2020
H2020-EU.1. EXCELLENT SCIENCE
H2020-EU.1.3. EXCELLENT SCIENCE - Marie Skłodowska-Curie Actions (MSCA)
H2020-EU.1.3.2. Nurturing excellence by means of cross-border and cross-sector mobility
H2020-MSCA-IF-2020
MSCA-IF-2020 Individual Fellowships