Training an AI to play Resident Evil Requiem using Behavior Cloning + HG-DAgge [P]
Our take
![Training an AI to play Resident Evil Requiem using Behavior Cloning + HG-DAgge [P]](https://external-preview.redd.it/noWmQ9-3r-JtsFMmjT3hF1M_H3SrjiBqHxoaSpDykgs.jpeg?width=320&crop=smart&auto=webp&s=dc5e4bca3bed0b9704897094fd1b26ff8bdf684c)
| Code of Project: https://github.com/paulo101977/notebooks-rl/tree/main/re_requiem I’ve been working on training an agent to play a segment of Resident Evil Requiem, focusing on a fast-paced, semi-linear escape sequence with enemies and time pressure. Instead of going fully reinforcement learning from scratch, I used a hybrid approach:
The environment is based on gameplay capture, where I map controller inputs into a discretized action space. Observations are extracted directly from frames (with some preprocessing), and the agent learns to mimic and then refine behavior over time. One of the main challenges was the instability early on — especially when the agent deviates slightly from the demonstrated trajectories (classic BC issue). HG-DAgger helped a lot by correcting those off-distribution states. Another tricky part was synchronizing actions with what’s actually happening on screen, since even small timing mismatches can completely break learning in this kind of game. After training, the agent is able to:
I’m still experimenting with improving robustness and generalization (right now it’s quite specialized to this segment). Happy to share more details (training setup, preprocessing, action space, etc.) if anyone’s interested. [link] [comments] |
Read on the original site
Open the publisher's page for the full experience
Related Articles
- I Trained an AI to Beat Final Fight… Here’s What Happened [p]Hey everyone, I’ve been experimenting with Behavior Cloning on a classic arcade game (Final Fight), and I wanted to share the results and get some feedback from the community. The setup is fairly simple: I trained an agent purely from demonstrations (no reward shaping initially), then evaluated how far it could go in the first stage. I also plan to extend this with GAIL + PPO to see how much performance improves beyond imitation. A couple of interesting challenges came up: Action space remapping (MultiBinary → emulator input) Trajectory alignment issues (obs/action offset bugs 😅) LSTM policy behaving differently under evaluation vs manual rollout Managing rollouts efficiently without loading everything into memory The agent can already make some progress, but still struggles with consistency and survival. I’d love to hear thoughts on: Improving BC performance with limited trajectories Best practices for transitioning BC → PPO Handling partial observability in these environments Here’s the code if you want to see the full process and results: notebooks-rl/final_fight at main · paulo101977/notebooks-rl Any feedback is very welcome! submitted by /u/AgeOfEmpires4AOE4 [link] [comments]
- I Trained an AI to Beat Final Fight… Here’s What Happened [P]Hey everyone, I’ve been experimenting with Behavior Cloning on a classic arcade game (Final Fight), and I wanted to share the results and get some feedback from the community. The setup is fairly simple: I trained an agent purely from demonstrations (no reward shaping initially), then evaluated how far it could go in the first stage. I also plan to extend this with GAIL + PPO to see how much performance improves beyond imitation. A couple of interesting challenges came up: Action space remapping (MultiBinary → emulator input) Trajectory alignment issues (obs/action offset bugs 😅) LSTM policy behaving differently under evaluation vs manual rollout Managing rollouts efficiently without loading everything into memory The agent can already make some progress, but still struggles with consistency and survival. I’d love to hear thoughts on: Improving BC performance with limited trajectories Best practices for transitioning BC → PPO Handling partial observability in these environments Here’s the code if you want to see the full process and results: notebooks-rl/final_fight at main · paulo101977/notebooks-rl Any feedback is very welcome! submitted by /u/AgeOfEmpires4AOE4 [link] [comments]