Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Claim page
Ian Osband
Follow
Activity
Upvotes
Folders
Articles
33
OpenAI o1 System Card
21 December 2024 by
Openai
and
others
Artificial Intelligence
GPT-4o System Card
25 October 2024 by
Openai
and
others
Computation and Language
,
Artificial Intelligence
Fine-Tuning Language Models via Epistemic Neural Networks
10 May 2023 by
Ian Osband
and
others
Computation and Language
,
Artificial Intelligence
Approximate Thompson Sampling via Epistemic Neural Networks
18 February 2023 by
Ian Osband
and
others
Machine Learning
,
Artificial Intelligence
Robustness of Epinets against Distributional Shifts
1 July 2022 by
Xiuyuan Lu
and
others
Machine Learning
Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
8 June 2022 by
Vikranth Dwaracherla
and
others
Machine Learning
,
Artificial Intelligence
Epistemic Neural Networks
13 April 2022 by
Ian Osband
and
others
Machine Learning
,
Artificial Intelligence
Evaluating High-Order Predictive Distributions in Deep Learning
28 February 2022 by
Ian Osband
and
others
Machine Learning
,
Artificial Intelligence
From Predictions to Decisions: The Importance of Joint Predictive Distributions
1 February 2022 by
Zheng Wen
and
others
Machine Learning
The Neural Testbed: Evaluating Predictive Distributions
1 February 2022 by
Ian Osband
and
others
Machine Learning
,
Artificial Intelligence
Matrix games with bandit feedback
12 June 2021 by
Brendan O'Donoghue
and
others
Machine Learning
,
Computation
Reinforcement Learning, Bit by Bit
14 March 2021 by
Xiuyuan Lu
and
others
Machine Learning
,
Artificial Intelligence
Hypermodels for Exploration
12 June 2020 by
Vikranth Dwaracherla
and
others
Machine Learning
,
Optimization and Control
Making Sense of Reinforcement Learning and Probabilistic Inference
14 February 2020 by
Brendan O'Donoghue
and
others
Machine Learning
,
Artificial Intelligence
Behaviour Suite for Reinforcement Learning
13 August 2019 by
Ian Osband
and
others
Machine Learning
,
Artificial Intelligence
Meta-learning of Sequential Strategies
18 July 2019 by
Pedro Ortega
and
others
Machine Learning
,
Artificial Intelligence
Scalable Coordinated Exploration in Concurrent Reinforcement Learning
16 December 2018 by
Maria Dimakopoulou
and
others
Machine Learning
,
Artificial Intelligence
Randomized Prior Functions for Deep Reinforcement Learning
15 November 2018 by
Ian Osband
and
others
Machine Learning
,
Artificial Intelligence
The Uncertainty Bellman Equation and Exploration
8 June 2018 by
Brendan O'Donoghue
and
others
Artificial Intelligence
,
Machine Learning
Deep Exploration via Randomized Value Functions
5 June 2018 by
Ian Osband
and
others
Machine Learning
,
Artificial Intelligence
Noisy Networks for Exploration
15 February 2018 by
Meire Fortunato
and
others
Machine Learning
Gaussian-Dirichlet Posterior Dominance in Sequential Learning
1 February 2018 by
Ian Osband
and
Benjamin Van Roy
Machine Learning
,
Probability
A Tutorial on Thompson Sampling
19 November 2017 by
Daniel Russo
and
others
at
Google
Machine Learning
Learning from Demonstrations for Real World Reinforcement Learning
18 July 2017 by
Todd Hester
and
others
Artificial Intelligence
,
Machine Learning
Minimax Regret Bounds for Reinforcement Learning
1 July 2017 by
Mohammad Gheshlaghi Azar
and
others
Machine Learning
,
Artificial Intelligence
On Optimistic versus Randomized Exploration in Reinforcement Learning
13 June 2017 by
Ian Osband
and
Benjamin Van Roy
Machine Learning
On Lower Bounds for Regret in Reinforcement Learning
9 August 2016 by
Ian Osband
and
Benjamin Van Roy
Machine Learning
Posterior Sampling for Reinforcement Learning Without Episodes
9 August 2016 by
Ian Osband
and
Benjamin Van Roy
Machine Learning
Why is Posterior Sampling Better than Optimism for Reinforcement Learning
22 July 2016 by
Ian Osband
and
Benjamin Van Roy
Machine Learning
,
Artificial Intelligence
Deep Exploration via Bootstrapped DQN
1 July 2016 by
Ian Osband
and
others
Machine Learning
,
Artificial Intelligence
Load more
This is an AI-generated summary
Key points
Topics
Machine Learning
Artificial Intelligence
Optimization and Control
Systems and Control
Computation and Language
Computation
Probability