Is there an R package with a neural net fitting and update function for episodic training in reinforcement learning? [closed]

For a project in the fields of Reinforcement Learning for a small gridworld-example with a lot of simulated episodes I want to do the following in R:

0a. As a burn-in I simulate 1000 episodes with a random strategy
0b. For each possible action I train a model on the already simulated runs with the state as my features
1. I simulate a new episode by choosing epsilon-greedy the actions in the episode with the current state as the features
1. I evaluate the model up till now by handing out a reward based on the performance
1. With the new learned episode I want to update the models and then for the next episode start at step 1. until I have enough episodes overall and the performance does not improve significantly

For this I first thought about using a decision tree to model expected reward ~ state, but then after each step the model has to be trained completely new, cause it has no "update" concept.

Because neural nets are trained sequentially observation by observation with stochastic gradient descent, I now wanna train a neural net with the burn-in observations and then evaluate and update it after each episode (each episode could create multiple observations for each action, so it may be, that multiple update-steps have to be performed).

The question now is, if anyone knows a package in R, which can either fit a neural net and then update it with a new observation/multiple new observation/s.

I once have written a neural net by myself in R, but its not the fastest programmed and I am pretty sure that there should exist faster and better implementations.

Thanks a lot for your help!

Is there an R package with a neural net fitting and update function for episodic training in reinforcement learning? [closed]

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

SAHARA FLASH LIVE IN WERAGOLLA 2018-04-20

Download The Last Ship 3ª Temporada Dublado e Legendado – MEGA

Four Air Leitchville Pty Ltd v Hurlad Pty Ltd (No 3) [2024] FCA 238

Bureau of Internal Revenue: Regional Offices (Directory)

Black Angus Grilled Artichokes

TunerPad KeyGen FREE

Windows Update / Microsoft Update の接続先 URL について

Serial child killer David Threinen’s reign of terror

Philly Mobster Ronnie Turchi Took Last Ride In October ’99, Turned Up Trunk...

AP Inter 1st / 2nd Year Hall Tickets Download 2019

Adolescence A Stage of Growth and Change Class 7 Extra Questions and Answers...

HP P2000 Storage Error Controller A Unknown Issue Resolution Request

The 10 Tennessee Cities With The Largest Black Population For 2021

PHOTOS: Taarak Mehta Ka Ooltah Chashmah cast then and now; Check out your...

Moondru Mudichu 04-10-2017 – Polimer tv Serial

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

Forum Post: RE: Plugin timeout exception in custom workflow activity

(get) Tej Dosa Letter 81 - How To Make An Extra $200-$500/Week (In 2025)

JACOB FORREST OGDEN Arrested by Clackamas County Sheriff's Office on Dec 30,...