Bogacsovics, GergőKhandoker, Riman Faradis2026-02-122026-02-122025https://hdl.handle.net/2437/404429This thesis explores reinforcement learning in a Unity environment, moving step by step from simple energy collection to complex obstacle avoidance. It uses Unity ML‑Agents to train agents and tests how reward design affects learning. The project shows that small mistakes in rewards can cause unusual behaviors, like the agent freezing or crashing on purpose. Through iterative tuning and debugging, high‑performing agents with high success rates across all levels was achieved. Overall, the thesis showcases strong technical execution, clear documentation of challenges, and valuable insights into designing adaptive and interpretable RL training setups.54enReinforcement LearningUnity ML‑AgentsDesigning and implementing a survival agent using reinforcement learning in UnityInformatics::Computer ScienceHozzáférhető a 2022 decemberi felsőoktatási törvénymódosítás értelmében.