STEP-RL | Specializing TEmporal Planning using Reinforcement Learning

Summary
"Planning - devising a strategy to achieve a desired objective - is one of the basic forms of intelligence. Temporal planning studies the automated synthesis of strategies when time and temporal constraints matter. Temporal planning is one of the most strategic fields of Artificial Intelligence, with applications in autonomous robotics, logistics, flexible production, and many other fields.

Historically, the research on temporal planning follows a general-purpose framework: a generic engine searches for the strategy by reasoning on the problem statement (i.e. the starting condition and the desired objective), as well as on a formal model of the domain (i.e. the possible actions). Despite substantial progress in the recent years, domain-independent temporal planning still suffers from scalability issues, and fails to deal with real-word problems. The alternative is to devise ad-hoc, domain-specific solutions that, although efficient, are costly to develop, rigid to maintain, and often inapplicable in non-nominal situations.

STEP-RL will study the foundations of a new approach to Temporal Planning, that is domain-independent and efficient at the same time. The idea is to adopt a framework based on Reinforcement Learning, where a domain-independent temporal planner is specialized with respect to the domain at hand. STEP-RL continuously improves its ability to solve temporal planning problems by learning from experience, thus becoming increasingly efficient by means of self-adaptation.

STEP-RL will advance the state of the art in temporal planning beyond the ""efficiency vs flexibility"" dilemma, that I had to personally face in the many industrial projects I worked on."
Results, demos, etc. Show all and search (0)
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/101115870
Start date: 01-01-2024
End date: 31-12-2028
Total budget - Public funding: 1 493 750,00 Euro - 1 493 750,00 Euro
Cordis data

Original description

"Planning - devising a strategy to achieve a desired objective - is one of the basic forms of intelligence. Temporal planning studies the automated synthesis of strategies when time and temporal constraints matter. Temporal planning is one of the most strategic fields of Artificial Intelligence, with applications in autonomous robotics, logistics, flexible production, and many other fields.

Historically, the research on temporal planning follows a general-purpose framework: a generic engine searches for the strategy by reasoning on the problem statement (i.e. the starting condition and the desired objective), as well as on a formal model of the domain (i.e. the possible actions). Despite substantial progress in the recent years, domain-independent temporal planning still suffers from scalability issues, and fails to deal with real-word problems. The alternative is to devise ad-hoc, domain-specific solutions that, although efficient, are costly to develop, rigid to maintain, and often inapplicable in non-nominal situations.

STEP-RL will study the foundations of a new approach to Temporal Planning, that is domain-independent and efficient at the same time. The idea is to adopt a framework based on Reinforcement Learning, where a domain-independent temporal planner is specialized with respect to the domain at hand. STEP-RL continuously improves its ability to solve temporal planning problems by learning from experience, thus becoming increasingly efficient by means of self-adaptation.

STEP-RL will advance the state of the art in temporal planning beyond the ""efficiency vs flexibility"" dilemma, that I had to personally face in the many industrial projects I worked on."

Status

SIGNED

Call topic

ERC-2023-STG

Update Date

12-03-2024
Images
No images available.
Geographical location(s)