Skip to content
This repository has been archived by the owner on Oct 31, 2023. It is now read-only.

Latest commit

 

History

History

a2c

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 

A2C Examples (and Tutorial)

The examples of A2C implementations illustrate the different capabilities of the library and act as a tutorial. To understand all the different aspects, you have to read the examples in the following order:

  • Mono CPU - The simplest implementation
  • Mono CPU 2 - Splitting the agent in three to illustrate the modularity of salina, seperating the critic from the policy.
  • Multi CPUs - Making use of the RemoteAgent to execute over multiple CPUs in parallel
  • Asynchronous Evaluation - Making use of the asynchronous functionnality to evaluate policies in parallel without slowing down the learning
  • GPU - Making use of a GPU for the loss computation
  • Complete - The reference implementation of A2C used for benchmarking. It contains different policies architectures to illustrate how one can define complex policies.