Reinforcement Learning - [Q_learning on multiple use case environment set up: Traffic Light Control System & Trading agent)

Defining and Solving RL Environments

Worked on Traffic Light Control Scenario for both Stochastic and Deterministic environment using Q learning; Q-learning function: Q(s,a)←Q(s,a)+α[r+γargmaxQ(s′,a′)−Q(s,a)] This was updated eg previously the agent allowed only 1 car to pass at a time, which isn't effective in real world, the controller has been updated to allow all cars from opposite poles to pass at a time, this better models the real world.

Then further applied another Tabular learning method of Monte Carlo(MC) Control Method, using the EveryVist MC Method Monte-Carlo Learnin function Q(s,a)←Q(s,a)+α[Gt−Q(s,a)]

Also an agent was trained on the NVDA stock prices within a period and the agent traded with additional details and directions In evaluating the agent, I had to introduce some randomness especially where the agent goes for greedy actions as against the agent always repeating same start day and policy More interesting, the nvda stock agent actually performed better with this bit of randomness

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
README.md		README.md
fall25_rl_assignment1_part3_nvda.ipynb		fall25_rl_assignment1_part3_nvda.ipynb
part1_and_part2_traffic_light_control.ipynb		part1_and_part2_traffic_light_control.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning - [Q_learning on multiple use case environment set up: Traffic Light Control System & Trading agent)

About

Uh oh!

Releases

Packages

Languages

Mach-A/RL_use_case

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning - [Q_learning on multiple use case environment set up: Traffic Light Control System & Trading agent)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages