Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

PyBrain

Compare

  Analyzed about 2 months ago

PyBrain is a modular Machine Learning Library for Python. It's goal is to offer flexible, easy-to-use yet still powerful algorithms for Machine Learning Tasks and a variety of predefined environments to test and compare your algorithms. PyBrain is short for Python-Based Reinforcement Learning ... [More] , Artificial Intelligence and Neural Network Library. It's the Swiss army knife for machine learning and neural networking. [Less]

36K lines of code

0 current contributors

about 1 year since last commit

6 users on Open Hub

Activity Not Available
5.0
 
I Use This

libpgrl

Compare

  Analyzed about 1 year ago

LibPGThe PG library was intended to be a high-performance policy-gradient reinforcement learning library. Since the first version it has been extended to a number of value based RL algorithms, so the name is only historical. It is now a general RL library which implements, for example, natural actor ... [More] critic, and least squares policy iteration. It has been designed with large distributed RL systems in mind. It's not perfect, but it is pretty fast. API documentation and examples are provided. What libpg does NOT provide is model based planning algorithms such as value iteration, or real-time dynamic programming, or exact policy gradient. There is limited support for belief state tracking in the simulators/Cassandra/ directory (named because we use the POMDP file format created by Anthony Cassandra). One day I'd like to extend it to these situations, but that will require some uptake of the library. Project goalsProvide easy to use implementations of state-of-the-art RL algorithms for the non RL savvy, allowing immediate application to difficult industry problems. High performance, especially on multi-agent RL problems Extensible plug'n'play algorithms for research purposes Main algorithms implementedRL algorithms: SARSA QLearning Vanilla Policy-Gradient (online or batch, including line search) Natural actor-critic Least Squares TD-Q(\lambda) (for LSPI) Least Squares Policy Iteration Misc supporting algorithms: Line searches for batch mode Tikhonov Regularisation HMM based POMDP state estimation Finite history transformation [Less]

0 lines of code

0 current contributors

over 9 years since last commit

0 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: MPL-1.1