Files in this item

FilesDescriptionFormat

application/pdf

application/pdfYAN-THESIS-2020.pdf (536kB)
(no description provided)PDF

Description

Title:Bregman Augmented Lagrangian Method: Convergence, acceleration, and applications in reinforcement learning
Author(s):Yan, Shen
Advisor(s):He, Niao
Department / Program:Industrial&Enterprise Sys Eng
Discipline:Industrial Engineering
Degree Granting Institution:University of Illinois at Urbana-Champaign
Degree:M.S.
Genre:Thesis
Subject(s):Bregman Augmented Lagrangian Method (BALM)
Bregman Proximal Point Method (BPP)
Acceleration (acc-BPP, acc-BALM)
Reinforcement Learning
Abstract:In this thesis, the algorithm Bergman proximal point method (BPP), and its application to Bregman augmented Lagrangian method(BALM) is considered. Unlike classical augmented Lagrangian method (ALM ), whose convergence rate and its relation with the proximal point method is well-understood, the convergence rate for BALM has not yet been thoroughly studied in the literature. We analyze, in this thesis, the convergence rates of BALM in terms of the primal objective as well as the feasibility violation. We show that the algorithm can also be applied to variational inequality problems with convex constraints, and fully characterize the iteration complexity of the algorithm derived from the inexact version of BALM. Furthermore, we develop, for the first time, an accelerated Bregman proximal point method, that improves the convergence rate from $\cO(1/\sum_{k=0}^{T-1}\eta_k)$ to $\cO(1/(\sum_{k=0}^{T-1}\sqrt{\eta_k})^2)$, where $\{\eta_k\}_{k=0}^{T-1}$ is the sequence of proximal parameters. When applied to the dual of convex constrained convex programs, this leads to the construction of an accelerated BALM, that achieves the improved rates for both primal and dual convergences. Finally, numerical experiments comparing the performance of different Bregman divergences as well as the acceleration versions, with applications to Markov decision problems/reinforcement learning are presented at the end.
Issue Date:2020-12-07
Type:Thesis
URI:http://hdl.handle.net/2142/109433
Rights Information:Copyright 2020 Shen Yan
Date Available in IDEALS:2021-03-05
Date Deposited:2020-12


This item appears in the following Collection(s)

Item Statistics