Research

Research Interests

I am a generalist researcher as opposed to a specialist. In short, a specialist tends to focus on a relatively narrow domain and become an expert in it. A generalist, however, broadens the domain at the expense of losing depth. While this might hinder a generalist from becoming a true expert in one domain, it allows him/her to connect and join ideas from different disciplines and act as a bridge between specialists. As a generalist, I try to be topic- and method-agnostic, but I have priorities in my research, nonetheless.

Since 2018, my advisors (Prof. Nick Street and Prof. Barrett Thomas) and I started exploring the application of Reinforcement Learning (a machine learning technique that can be used to learn how to make an optimal sequence of decisions) in Health Analytics. More specifically, we look at the dosing of a commonly used anticoagulant called warfarin.

In the first part of this work, we showed that one can achieve a better dosing protocol using Deep Q-Networks (DQN). However, the model acts as a black box and the user might not trust its recommendations. In the second phase, we focused on the maintenance dosing protocol. In the maintenance phase, we adjust the dose as a percent of change of the current dose. We implemented the work using a Policy Gradient method (Proximal Policy Optimization). We used some Action Forging techniques to reduce the number of dose changes and learn the dose change options instead of pre-selecting them. Finally, Policy Distillation and Decision Trees helped us turn the dosing protocol into a table. The final dosing protocol performs better and is explainable and individualized.

Both of these works only prescribe the dose for weekly use. However, in reality, the physician should prescribe the dose as well as the next time for blood tests and dose adjustment. Adding duration to the decision-making expands the action space dramatically. As part of my PhD dissertation, we proposed a sequential architecture in which the dose decision is made first, and the duration is decided next. This way, we reduce the action space and improve the training process.

My other areas of interest include:

Machine Learning
Health Analytics
AI in Optimization
Ethics of AI, Fairness, Interpretability, Explainability

Current Research

Anzabi Zadeh, S., Street, W., & Thomas, B. "An Explainable Deep Reinforcement Learning Model for Warfarin Maintenance Dosing Using Policy Distillation and Action Forging", 2024, DOI: 10.48550/arXiv.2404.17187.

Publications

Anzabi Zadeh, S., Street, W., & Thomas, B. "Optimizing Warfarin Dosing using Deep Reinforcement Learning." Journal of Biomedical Informatics, (in press), DOI: 10.1016/j.jbi.2022.104267.
Ashrafi, M., & Anzabi Zadeh, S. (2017). "Lifecycle risk assessment of a technological system using dynamic Bayesian networks." Quality and Reliability Engineering International, 33(8), 2497-2520, DOI: 10.1002/qre.2213.

Presentations

Anzabi Zadeh, S., Street, W., & Thomas, B. (October 2022). Optimal Dosing of Warfarin Using Policy Gradient Methods, INFORMS Annual Meeting, Indianapolis, IN.
Anzabi Zadeh, S. (March 2022). Optimizing warfarin dosing using deep reinforcement learning, Tippie Advisory Council, Iowa City, IA.
Anzabi Zadeh, S., Street, W., & Thomas, B. (October 2021). Optimal Dosing Protocol For Warfarin Using Deep Reinforcement Learning, INFORMS Annual Meeting, Anaheim, CA.
Anzabi Zadeh, S., Street, W., & Thomas, B. (November 2020). Application Of Deep Reinforcement Learning In Optimal Warfarin Dosing. INFORMS Annual Meeting, Virtual.
Anzabi Zadeh, S., Street, W., & Thomas, B. (October 2019). Optimal Warfarin Dosing Using Reinforcement Learning. INFORMS Annual Meeting, Seattle, WA.
Namdar, J., Zhao, K., Anzabi Zadeh, S., Blackhurst, J. (October 2019). Modeling and Analysis of Cascading Disruptions in Chain Network: A Cascading Simulation Model, INFORMS Annual Conference, Seattle, WA.

Software

For my dissertation, I developed a Reinforcement Learning package in Python called ReiL. It is not the best, the fastest, the most efficient, or even the right way of doing RL, but it is available for use under MIT License. You may install it from PyPI (pip install ReiL), or fork the source code from the repository here: https://research-git.uiowa.edu/sanzabizadeh/Reinforcement-Learning

Most of the code is fully type-annotated and documented, and I am dedicated to continuing to improve and support it.

Not[e]worthy!

Publications

َAnzabi Zadeh, S., & Binesh, M. (2016) "Organizational Knowledge Creation Theory, and few questions." Auto Industry, 208, 20-25 (in Farsi).
Wootton, S., & Horne, T. (2013) Strategic Thinking: A Step-by-step Approach to Strategy and Leadership (N. Dehghan, A. Moghaddar, S. Anzabi Zadeh, Trans.) Fozhan (Original Work Published 2000).
Anzabi Zadeh, S., Saniee, S., & Yekta Kooshali, A. (2005) “Resources and Their Role in Corporate Strategy”, MASAF quarterly journal (in Farsi).

Research

The Effect of Integration on Supply Chain Performance (Supervisor: Prof. S.H. Zegordi).
Solving Fuzzy Decompositions (SoT and ToS) using Electromagnetic Metaheuristic Method (Supervisor: Prof. M.H. Fazel Zarandi).
Member of MASAF Strategic Management Research Group, Amirkabir University of Technology (2004 - 05).

Page updated

Google Sites

Report abuse