Framework of Cart Pole w/ Reinforcement Learning

I am working on a side project that is modelling a the inverted pendulum problem and solving it with a reinforcement learning algorithm, most notably Q-Learning. I have already engineered a simple MDP solver for a grid world - easy stuff.

However, I am struggling to figure out how to do this after days of scouring research papers. Nothing explains how to build up a framework for representing the problem.

When modelling the problem, can a standard Markov Decision Process be used? Or must it be a POMDP?

What is represented in each state (i.e. what state info is passed to the agent)? The coordinates, velocity, angle of the pendulum etc?

What actions can the agent take? Is it a continuous range of velocities in + or - x direction?

Advice on this is greatly appreciated.

Framework of Cart Pole w/ Reinforcement Learning

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

David Perell - Write of Passage 2025

Mp3 Download: Mdu - Mazola

Karimnagar District Police Office Mobile Numbers List in Telangana State

Firefighters attend car crash in Melton Mowbray

CAMDEN CAMPERS SALE IS ON NOW THIS CRACKING VW AUTOHOMES KOMET HAS BEEN...

Java error when using Sky Go app

Lady Gaga – MAYHEM (Bonus Tracks Version) [iTunes Rip M4A]

Missing man located Bayview Avenue and Wilket Road area, Alexander Klopot, 31

Why do I get 'Access is Denied' when using Set-Service with Admin privileges?

Praye – Wodin (Throwback Music)

charmilles roboform E998

Shatta Wale – You Shock Me (Prod. by Willis Beatz)

99 God Status for Whatsapp, Facebook

Ek Bar Baby Selfish Hoke Apne Liye Jiyo Na Lyrics Translation | Race 3

The Who – Who’s Next (1971/2023) [High Fidelity Pure Audio Blu-Ray Disc]

Final Purple Gang-Related Indictment Ensnared ‘Candy’ Davidson In Drug Bust...

Outlook.com issue with window 8

Maryland: State Police report 416 DWI / DUI drivers during December 2014;...

3S USB Mass Production Utility v2.178