Rati Devidze
(რატი დევიძე*)
I am a Reinforcement Learning & Machine Learning Engineer at minds.ai focused on applying Reinforcement Learning (RL) and Preference Learning to semiconductor fabrication (FAB) optimization.
I completed my PhD in Machine Teaching group of Max Planck Institute for Software Systems: MPI-SWS, in Saarbrücken, Germany. I was also a member of the Saarbrücken Graduate School of Computer Science. My doctoral research was supervised by Prof. Adish Singla.
My work centers on developing principled learning frameworks for complex, stochastic, and highly constrained environments. In particular, I study reward design aiming to construct informative and interpretable signals that guide agent behavior toward optimal decisions. These approaches enable efficient learning in settings where explicit reward specification is challenging, while aligning agent behavior with domain-specific objectives such as throughput maximization, cycle time reduction, and resource efficiency.
More broadly, my research interests include Reinforcement Learning and its subfields—such as reward design, inverse reinforcement learning, and meta-reinforcement learning as well as preference learning for sequential decision-making.
For more information about me, please find my [C.V.].
news
| March 2025 | I successfully defended my PhD. | June 2024 | I joined minds.ai. |
|---|---|
| May 2024 | I submitted my PhD thesis: Reward Design for Reinforcement Learning Agents . |
| Jan 19, 2024 | Our paper Ethics in Action: Training Reinforcement Learning Agent for Moral Decision-making In Text-based Adventure Games was accepted to AISTATS'24. |
| Dec 15, 2023 | Our paper Informativeness of Reward Functions in Reinforcement Learning was accepted to AAMAS'24. |
| Sep 14, 2022 | Our paper Exploration-Guided Reward Shaping for Reinforcement Learning under Sparse Rewards was accepted to NeurIPS'22. |
| Sep 28, 2021 | Our paper Explicable Reward Design for Reinforcement Learning Agents was accepted to NeurIPS'21. |
Campus E1.5, Room 336
Max Planck Institute for Software Systems
66123, Saarbrücken