grant

Discovery Projects - Grant ID: DP240103278 [ 2024-01-01 - 2026-12-31 ]

Research Grant

[Cite as https://purl.org/au-research/grants/arc/DP240103278]

Researchers: M. Ehsan Abbasnejad (Chief Investigator) ,  Prof Javen Qinfeng Shi (Chief Investigator)

Brief description Learning to Reason in Reinforcement Learning. Deep Reinforcement Learning (RL) uses deep neural networks to represent and learn optimal decision-making policies for intelligent agents in complex environments. However, most RL approaches require millions of episodes to converge to good policies, making it difficult for RL to be applied in real-world scenarios taking significant resources. This project aims to equip RL with capabilities such as counterfactual reasoning and outcome anticipation to significantly reduce the number of interactions required, improve generalisation, and provide the agent with the capability to consider the cause-effects. These improvements would narrow the gap between AI and human capabilities and broaden the adoption of RL in real-world applications.

Funding Amount $544,551

Funding Scheme Discovery Projects

View this grant in the ARC Data Portal

Click to explore relationships graph
Identifiers
Viewed: [[ro.stat.viewed]]