Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Claim page
Daniele Calandriello
Follow
Activity
Upvotes
Folders
Articles
30
Building Math Agents with Multi-Turn Iterative Preference Learning
27 February 2025 by
Wei Xiong
and
others
Machine Learning
On Teacher Hacking in Language Model Distillation
4 February 2025 by
Daniil Tiapkin
and
others
Machine Learning
,
Artificial Intelligence
Multi-turn Reinforcement Learning from Preference Human Feedback
2 December 2024 by
Lior Shani
and
others
Machine Learning
Nash Learning from Human Feedback
11 June 2024 by
Rémi Munos
and
others
at
Inria Rocquencourt
Machine Learning
,
Artificial Intelligence
Demonstration-Regularized RL
10 June 2024 by
Daniil Tiapkin
and
others
Machine Learning
Offline Regularised Reinforcement Learning for Large Language Models Alignment
29 May 2024 by
Pierre Harvey Richemond
and
others
at
Google
Machine Learning
,
Artificial Intelligence
Generalized Preference Optimization: A Unified Approach to Offline Alignment
28 May 2024 by
Yunhao Tang
and
others
Machine Learning
,
Artificial Intelligence
Decoding-time Realignment of Language Models
24 May 2024 by
Tianlin Liu
and
others
Machine Learning
,
Artificial Intelligence
Understanding the performance gap between online and offline alignment algorithms
14 May 2024 by
Yunhao Tang
and
others
Machine Learning
,
Artificial Intelligence
Human Alignment of Large Language Models through Online Preference Optimisation
13 March 2024 by
Daniele Calandriello
and
others
Machine Learning
,
Artificial Intelligence
A General Theoretical Paradigm to Understand Learning from Human Preferences
1
22 November 2023 by
Mohammad Gheshlaghi Azar
and
others
at
Google
Artificial Intelligence
,
Machine Learning
Model-free Posterior Sampling via Learning Rate Randomization
27 October 2023 by
Daniil Tiapkin
and
others
Machine Learning
Fast Rates for Maximum Entropy Exploration
6 June 2023 by
Daniil Tiapkin
and
others
Machine Learning
Unlocking the Power of Representations in Long-term Novelty-based Exploration
2 May 2023 by
Alaa Saade
and
others
Machine Learning
Understanding Self-Predictive Learning for Reinforcement Learning
6 December 2022 by
Yunhao Tang
and
others
Machine Learning
,
Artificial Intelligence
ParK: Sound and Efficient Kernel Ridge Regression by Feature Space Partitions
17 October 2022 by
Luigi Carratino
and
others
at
University of Genoa
Machine Learning
Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
28 September 2022 by
Daniil Tiapkin
and
others
Machine Learning
BYOL-Explore: Exploration by Bootstrapped Prediction
16 June 2022 by
Zhaohan Daniel Guo
and
others
Machine Learning
,
Artificial Intelligence
Information-theoretic Online Memory Selection for Continual Learning
10 April 2022 by
Shengyang Sun
and
others
Machine Learning
Scaling Gaussian Process Optimization by Evaluating a Few Unique Candidates Multiple Times
30 January 2022 by
Daniele Calandriello
and
others
Machine Learning
On the Emergence of Whole-body Strategies from Humanoid Robot Push-recovery Learning
29 April 2021 by
Diego Ferigo
and
others
Robotics
,
Machine Learning
Sampling from a
k
-DPP without looking at all items
30 June 2020 by
Daniele Calandriello
and
others
Machine Learning
,
Data Structures and Algorithms
Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
26 February 2020 by
Daniele Calandriello
and
others
Machine Learning
Gaussian Process Optimization with Adaptive Sketching: Scalable and No Regret
27 August 2019 by
Daniele Calandriello
and
others
Machine Learning
Statistical and Computational Trade-Offs in Kernel K-Means
27 August 2019 by
Daniele Calandriello
and
Lorenzo Rosasco
Machine Learning
,
Data Structures and Algorithms
Exact sampling of determinantal point processes with sublinear time preprocessing
8 July 2019 by
Michał Dereziński
and
others
Machine Learning
On Fast Leverage Score Sampling and Optimal Learning
24 January 2019 by
Alessandro Rudi
and
others
Machine Learning
,
Data Structures and Algorithms
Distributed Adaptive Sampling for Kernel Matrix Approximation
27 March 2018 by
Daniele Calandriello
and
others
Machine Learning
,
Data Structures and Algorithms
Second-Order Kernel Online Convex Optimization with Adaptive Sketching
15 June 2017 by
Daniele Calandriello
and
others
Machine Learning
Analysis of Kelner and Levin graph sparsification algorithm for a streaming setting
13 September 2016 by
Daniele Calandriello
and
others
Machine Learning
,
Data Structures and Algorithms
This is an AI-generated summary
Key points
Topics
Machine Learning
Artificial Intelligence
Data Structures and Algorithms
Computer Science and Game Theory
Multiagent Systems
Computation and Language
Robotics