Reinforcement Learning

Rabatt

Von:

Satyanarayana S

Thayyaba Khatoon Md

N V Madhu Bindu

Beschreibung

Format auswählen

Kartonierter EinbandCHF 68.40

Kartonierter Einband

CHF 76.00

CHF68.40

-10%Sie sparen CHF 7.60

76.00

CHF68.40

Print on Demand - Exemplar wird für Sie besorgt.

Kostenlose Lieferung

Beschreibung

This book is structured into five units, offering a holistic learning experience. The journey starts with an introduction to bandit algorithms, exploring core concepts like the Upper Confidence Bound (UCB) and Probably Approximately Correct (PAC) algorithms. The next unit introduces the full Reinforcement Learning (RL) framework, going beyond bandit algorithms to consider agent-environment interactions over multiple time steps. Markov Decision Processes (MDPs) are introduced as a fundamental framework for modeling sequential decision-making tasks. The fourth unit covers Dynamic Programming methods, Temporal Difference (TD) methods, and the Bellman Optimality equation in RL. These concepts empower agents to effectively plan, learn, and optimize their actions. The final unit explores advanced RL techniques, such as Eligibility Traces, Function Approximation, Least Squares Methods, Fitted Q-learning, Deep Q-Network (DQN), and Policy Gradient algorithms.

Autorentext

Satyanarayana S. est un scientifique émérite, un professeur, un stratège en trading d'algos, un inventeur, un auteur et un chef d'entreprise né en Inde. Il est titulaire d'une maîtrise en sciences et ingénierie informatiques et d'un doctorat en algorithmes d'apprentissage automatique des graphes informatiques, obtenus respectivement à l'université Acharya Nagarjuna et à l'université Dravidian.