Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Articles by Pierre Harvey Richemond | Synthical
Claim page
Pierre Harvey Richemond
Follow
Activity
Upvotes
Folders
Articles
5
Scaling Instructable Agents Across Many Simulated Worlds
1
11 October 2024 by
Sima Team
and
others
at
Google
Robotics
,
Artificial Intelligence
Offline Regularised Reinforcement Learning for Large Language Models Alignment
29 May 2024 by
Pierre Harvey Richemond
and
others
at
Google
Machine Learning
,
Artificial Intelligence
Generalized Preference Optimization: A Unified Approach to Offline Alignment
28 May 2024 by
Yunhao Tang
and
others
Machine Learning
,
Artificial Intelligence
Human Alignment of Large Language Models through Online Preference Optimisation
13 March 2024 by
Daniele Calandriello
and
others
Machine Learning
,
Artificial Intelligence
Understanding Self-Predictive Learning for Reinforcement Learning
6 December 2022 by
Yunhao Tang
and
others
Machine Learning
,
Artificial Intelligence
Topics
Machine Learning
Artificial Intelligence
Robotics
Human-Computer Interaction