Reinforcement Learning In A Physics-Inspired Semi-Markov Environment. (arXiv:2004.07333v1 [cs.LG])

[Submitted on 15 Apr 2020]

Abstract: Reinforcement learning (RL) has been demonstrated to have great potential in
many applications of scientific discovery and design. Recent work includes, for
example, the design of new structures and compositions of molecules for
therapeutic drugs. Much of the existing work related to the application of RL
to scientific domains, however, assumes that the available state representation
obeys the Markov property. For reasons associated with time, cost, sensor
accuracy, and gaps in scientific knowledge, many scientific design and
discovery problems do not satisfy the Markov property. Thus, something other
than a Markov decision process (MDP) should be used to plan / find the optimal
policy. In this paper, we present a physics-inspired semi-Markov RL
environment, namely the phase change environment. In addition, we evaluate the
performance of value-based RL algorithms for both MDPs and partially observable
MDPs (POMDPs) on the proposed environment. Our results demonstrate deep
recurrent Q-networks (DRQN) significantly outperform deep Q-networks (DQN), and
that DRQNs benefit from training with hindsight experience replay. Implications
for the use of semi-Markovian RL and POMDPs for scientific laboratories are
also discussed.

Submission history

From: Colin Bellinger [view email]
[v1]
Wed, 15 Apr 2020 20:43:29 UTC (1,078 KB)

Source: http://arxiv.org/abs/2004.07333

Generative Data Intelligence

Reinforcement Learning in a Physics-Inspired Semi-Markov Environment. (arXiv:2004.07333v1 [cs.LG])

Submission history

Top Altcoins Poised For Major Drop Next Week

Top Altcoins Poised For Major Drop Next Week

Latest Intelligence

Top Altcoins To Hit All-Time Highs This Week

Top Altcoins To Hit All-Time Highs This Week

Ripple’s Q1 2024 Report: XRP Ledger’s Groundbreaking Innovations and Soaring Activity

Ripple’s Q1 2024 Report: XRP Ledger’s Groundbreaking Innovations and Soaring Activity

25 Fastest Blockchain Networks Revealed, Bitcoin and Ethereum Not in Top 15: CoinGecko

Prominent Analyst Sees Dogecoin Poised for Explosive Growth