Skip to content

snehalmahajan2912/MARL-A-1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 

Repository files navigation

MARL-A-1

Optimal Value Function:

Value Iteration: Hostel: 19.704912026114062

Academic Building: 21.65262962191563

Canteen: 20.783236612706798

Optimal Policy:

Policy Iteration: Hostel: Eat Food

Academic Building: Attend Class

Canteen: Attend Class

  1. Both value iteration and policy iteration have provided optimal solutions, but they may slightly differ due to the convergence patterns of the algorithms.
  2. The optimal value function obtained from both methods indicates the expected cumulative rewards from each state, with Academic Building having the highest value followed by Canteen and then Hostel.
  3. The optimal policy derived from policy iteration suggests that the student should Eat Food in Hostel, Attend Class in the Academic Building, and Attend Class in the Canteen.
  4. The results indicate that attending class in the Academic Building and Canteen is more beneficial compared to staying in the Hostel, which aligns with the rewards assigned to each action and state.
  5. The differences in the optimal value function and policy obtained from the two methods can be attributed to the iterative nature of policy iteration and the direct computation approach of value iteration.
  6. In conclusion, both policy iteration and value iteration have provided effective solutions to the MDP problem, offering insights into the optimal values and policies for each state. The discrepancies in the results highlight the nuances in the convergence paths of the algorithms, but both approaches have successfully identified the best actions for the student in each location on the campus.
State Action Next State Transition Probability Reward
Hostel Attend Class Academic Building 0.5 3
Hostel Attend Class Hostel 0.5 -1
Hostel Eat Food Canteen 1.0 +1
Academic Building Attend Class Academic Building 0.7 3
Academic Building Attend Class Canteen 0.3 +1
Academic Building Eat Food Canteen 0.8 +1
Academic Building Eat Food Academic Building 0.2 3
Canteen Attend Class Academic Building 0.6 3
Canteen Attend Class Hostel 0.3 -1
Canteen Attend Class Canteen 0.1 +1
Canteen Eat Food Canteen 1.0 +1

Diagram:

WhatsApp Image 2024-08-28 at 11 58 31_d88c473d

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages