OpenAI: Hindsight Experience Replay — RL technique for sparse-reward learning efficiency | SignalBreak | SignalBreak