Maximum a posteriori policy optimisationAuthors: Abbas Abdolmaleki, Jost Tobias Springenberg, Nicolas Heess, Yuval Tassa, Remi MunosWe introduce a new algorithm for reinforcement learning...
Learning navigation without building maps
We depart from the traditional approaches which rely on explicit mapping and exploration (like a cartographer who tries to...
Reading Time: 2 minutesSpamming is the act of sending unsolicited message via electronic messaging systems. Unsolicited or unwanted mails not only consume your...
The modern healthcare system is necessarily a set of dependent and independent institutions. Because of its mission-critical nature, the system’s evolution involves a...
Solving real problems in infrastructure, finance and e-commerceIt’s exactly two years since we published “Avoiding the pointless blockchain project“, a checklist of questions...