click to view more

Stochastic Optimization Methods for Policy Evaluation in Reinforcement Learning

by Stochastic Optimization Methods for Policy Evaluation in Reinforcement Learning

$42.30

List Price: ~~$50.00~~

Save: $7.70 (15%)

In Stock - Ship in 24 hours with Free Online tracking.
FREE DELIVERY by Monday, June 09, 2025

24/24 Online
Yes High Speed
Yes Protection

Tell-A-Friend

Last update: 2025-06-02 19:59:50.186

Description

This monograph introduces various value-based approaches for solving the policy evaluation problem in the online reinforcement learning (RL) scenario, which aims to learn the value function associated with a specific policy under a single Markov decision process (MDP). Approaches vary depending on whether they are implemented in an on-policy or off-policy manner. In on-policy settings, where the evaluation of the policy is conducted using data generated from the same policy that is being assessed, classical techniques such as TD(0), TD(λ), and their extensions with function approximation or variance reduction are employed in this setting. For off-policy evaluation, where samples are collected under a different behavior policy, this monograph introduces gradient-based two-timescale algorithms like GTD2, TDC, and variance-reduced TDC. These algorithms are designed to minimize the mean-squared projected Bellman error (MSPBE) as the objective function. This monograph also discusses their finite-sample convergence upper bounds and sample complexity.

Last updated on 2025-06-02 19:59:50.186

Product Details

Now Publishers Brand
Aug 15, 2024 Pub Date:
1638283702 ISBN-10:
9781638283706 ISBN-13:
English Language
9.21 in * 0.12 in * 6.14 in Dimensions:
0 lb Weight:

Money Back

Love it! Use it! Reuse it!

Free Shipping

Shipping is on us

Free Support

24/24 available

Best Deal

Quality guaranteed

Science

Math

General

The New York Times® Bestsellers