Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Simple
Original
Articles by
Brendan O'Donoghue
Probabilistic Inference in Reinforcement Learning Done Right
22 November 2023 by
Jean Tarbouriech
and
others
at
Google
Machine Learning
,
Artificial Intelligence
Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization
4 June 2023 by
Brendan O'Donoghue
Machine Learning
,
Artificial Intelligence
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
6 March 2023 by
Ted Moskovitz
and
others
Machine Learning
Optimistic Meta-Gradients
9 January 2023 by
Sebastian Flennerhag
and
others
Machine Learning
,
Artificial Intelligence
POMRL: No-Regret Learning-to-Plan with Increasing Horizons
30 December 2022 by
Khimya Khetarpal
and
others
Artificial Intelligence
,
Machine Learning
On the connection between Bregman divergence and value in regularized Markov decision processes
25 October 2022 by
Brendan O'Donoghue
Machine Learning
,
Artificial Intelligence
The Neural Testbed: Evaluating Predictive Distributions
1 February 2022 by
Ian Osband
and
others
Machine Learning
,
Artificial Intelligence
Practical Large-Scale Linear Programming using Primal-Dual Hybrid Gradient
7 January 2022 by
David Applegate
and
others
Optimization and Control
Discovering Diverse Nearly Optimal Policies with Successor Features
4 January 2022 by
Tom Zahavy
and
others
Artificial Intelligence
,
Machine Learning
Discovering a set of policies for the worst case reward
10 December 2021 by
Tom Zahavy
and
others
Artificial Intelligence
,
Machine Learning
Variational Bayesian Optimistic Sampling
29 October 2021 by
Brendan O'Donoghue
and
Tor Lattimore
Machine Learning
Reward is enough for convex MDPs
27 October 2021 by
Tom Zahavy
and
others
Artificial Intelligence
,
Machine Learning
Solving Mixed Integer Programs Using Neural Networks
23 July 2021 by
Vinod Nair
and
others
Optimization and Control
,
Artificial Intelligence
Matrix games with bandit feedback
12 June 2021 by
Brendan O'Donoghue
and
others
Machine Learning
,
Computation
Sample Efficient Reinforcement Learning with REINFORCE
24 December 2020 by
Junzi Zhang
and
others
Machine Learning
,
Optimization and Control
Operator splitting for a homogeneous embedding of the linear complementarity problem
14 September 2020 by
Brendan O'Donoghue
Optimization and Control
Making Sense of Reinforcement Learning and Probabilistic Inference
14 February 2020 by
Brendan O'Donoghue
and
others
Machine Learning
,
Artificial Intelligence
Hamiltonian descent for composite objectives
17 November 2019 by
Brendan O'Donoghue
and
Chris Maddison
Optimization and Control
Variational Bayesian Reinforcement Learning with Regret Bounds
1 July 2019 by
Brendan O'Donoghue
Machine Learning
,
Artificial Intelligence
Verification of Non-Linear Specifications for Neural Networks
25 February 2019 by
Chongli Qin
and
others
Machine Learning
Strength in Numbers: Trading-off Robustness and Computation via Adversarially-Trained Ensembles
22 November 2018 by
Edward Grefenstette
and
others
Neural and Evolutionary Computing
,
Cryptography and Security
Hamiltonian Descent Methods
13 September 2018 by
Chris Maddison
and
others
Optimization and Control
,
Machine Learning
Globally Convergent Type-I Anderson Acceleration for Non-Smooth Fixed-Point Iterations
12 August 2018 by
Junzi Zhang
and
others
Optimization and Control
Adversarial Risk and the Dangers of Evaluating Against Weak Attacks
12 June 2018 by
Jonathan Uesato
and
others
Machine Learning
,
Cryptography and Security
The Uncertainty Bellman Equation and Exploration
8 June 2018 by
Brendan O'Donoghue
and
others
Artificial Intelligence
,
Machine Learning
Training verified learners with learned verifiers
29 May 2018 by
Krishnamurthy Dvijotham
and
others
Machine Learning
PGQ: Combining policy gradient and Q-learning
6 March 2017 by
Brendan O'Donoghue
and
others
Machine Learning
,
Artificial Intelligence
Conic Optimization via Operator Splitting and Homogeneous Self-Dual Embedding
25 July 2016 by
Brendan O'Donoghue
and
others
Optimization and Control
Large-Scale Convex Optimization for Dense Wireless Cooperative Networks
2 June 2015 by
Yuanming Shi
and
others
Information Theory
,
Optimization and Control
Adaptive Restart for Accelerated Gradient Schemes
18 April 2012 by
Brendan O'Donoghue
and
Emmanuel Candés
Optimization and Control
Load more