Applied Reinforcement Learning • Decision Systems • From Theory to Implementation
I work on applied reinforcement learning and sequential decision-making bridging theory to reproducible implementations (PyTorch, custom environments, evaluation).
Reinforcement Learning • Decision-Making Systems • Reward Design • Safe & Robust Learning • Multi-Agent / Distributed Optimization
Rewriting my earlier work in energy-aware optimization into an RL formulation and building an implementable research prototype.
I am an applied ML researcher focused on building reinforcement learning systems that can be tested, measured, and extended. I enjoy designing environments, implementing agents, and running structured evaluations to answer research questions with reproducible code.
M.Sc. in Business Management
International Marketing & Sales
Hochschule der Wirtschaft für Management (HDWM), Germany (2023–2025)
M.Sc. in Software Engineering
University of Najaf Abad Azad, Iran (2014–2018)
B.Sc. in Hardware Engineering
University of Najaf Abad Azad, Iran (2008–2014)
My research focuses on designing reinforcement learning systems that move beyond theoretical formulation toward robust, reproducible implementations. I am particularly interested in how environment modeling, reward design, and stability mechanisms influence long-term decision performance.
I aim to develop learning-based optimization frameworks for distributed and strategic decision-making problems, with an emphasis on reliability, evaluation methodology, and practical scalability.
This paper proposes an energy-aware clustering protocol to extend network lifetime in wireless sensor networks through optimized cluster-head selection mechanisms.
This research transforms the Business Chessboard Strategy (BCS) into an AI-executable framework using Hierarchical Reinforcement Learning. By modeling strategic decisions as a Hierarchical Markov Decision Process (H-MDP), the project enables RL agents to simulate, evaluate, and optimize long-term strategies under uncertainty. A custom simulation environment (BCS-Simulator) is developed to train and analyze intelligent agents, combining deep RL methods with explainable AI tools. The goal is to bridge human strategic theory and computational decision optimization.
Built a custom 4×4 mini-chess environment and trained a DQN agent end-to-end using Python + PyTorch. The project demonstrates environment modeling, reward shaping, illegal move masking, training loops, and evaluation.
View DetailsTranslating an earlier energy optimization / clustering idea into an RL problem definition (state/action/reward), and planning reproducible experiments and baselines.
Selected practical projects focused on automation, data, and applied AI.
CLI-based tool for cleaning raw CSV sales data and generating a structured summary report. Built for fast, reusable data cleanup workflows.
python src/run.py --input data/raw_sales.csv --outdir output
A practical and intuitive introduction to reinforcement learning, guiding readers from bandit problems to Deep Q-Networks (DQN). The book emphasizes environment design, reward modeling, instability analysis, and hands-on implementation.
View on Amazon
A strategic guide to building successful businesses through structured decision-making, foresight, and chess-inspired thinking. Focused on practical frameworks for sustainable growth.
View on Amazon
Email: negarinarianfar@gmail.com
GitHub: github.com/negarinarianfar
Linkedin: linkedin.com/in/negar-arianfar