Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Claim page
Colin Raffel
Follow
Activity
Upvotes
Folders
Articles
72
AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution
22 November 2024 by
Fengyuan Liu
and
others
Machine Learning
Realistic Evaluation of Model Merging for Compositional Generalization
26 September 2024 by
Derek Tam
and
others
Machine Learning
,
Computation and Language
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning
13 August 2024 by
Prateek Yadav
and
others
Machine Learning
,
Artificial Intelligence
A Survey on Data Selection for Language Models
2 August 2024 by
Alon Albalak
and
others
at
UC Santa Barbara
Computation and Language
,
Machine Learning
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
25 June 2024 by
Guilherme Penedo
and
others
Computation and Language
Soft Merging of Experts with Adaptive Routing
13 May 2024 by
Mohammed Muqeeth
and
others
Machine Learning
Merging by Matching Models in Task Parameter Subspaces
13 April 2024 by
Derek Tam
and
others
at
University of North Carolina at Chapel Hill
Machine Learning
,
Computation and Language
Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models
8 April 2024 by
Bowen Pan
and
others
Machine Learning
,
Artificial Intelligence
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
16 February 2024 by
Ajay Patel
and
others
at
University of Pennsylvania
Computation and Language
,
Machine Learning
Learning to Route Among Specialized Experts for Zero-Shot Generalization
8 February 2024 by
Mohammed Muqeeth
and
others
Machine Learning
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
2 January 2024 by
Haikang Deng
and
Colin Raffel
Computation and Language
Distributed Inference and Fine-tuning of Large Language Models Over The Internet
13 December 2023 by
Alexander Borzunov
and
others
Machine Learning
,
Distributed, Parallel, and Cluster Computing
Efficient Online Data Mixing For Language Model Pre-Training
5 December 2023 by
Alon Albalak
and
others
at
UC Santa Barbara
Computation and Language
,
Machine Learning
Evaluating the Factual Consistency of Large Language Models Through News Summarization
2 December 2023 by
Derek Tam
and
others
Computation and Language
ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization
22 November 2023 by
Prateek Yadav
and
others
Machine Learning
,
Artificial Intelligence
TIES-Merging: Resolving Interference When Merging Models
27 October 2023 by
Prateek Yadav
and
others
at
MIT
Machine Learning
,
Artificial Intelligence
Scaling Data-Constrained Language Models
26 October 2023 by
Niklas Muennighoff
and
others
Computation and Language
,
Artificial Intelligence
Knowledge is a Region in Weight Space for Fine-tuned Language Models
12 October 2023 by
Almog Gueta
and
others
Machine Learning
,
Artificial Intelligence
NPEFF: Non-Negative Per-Example Fisher Factorization
7 October 2023 by
Michael Matena
and
Colin Raffel
Machine Learning
Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
3 October 2023 by
Alon Albalak
and
others
Machine Learning
,
Computation and Language
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
19 September 2023 by
Colin Raffel
and
others
at
Google
Machine Learning
,
Computation and Language
ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning
13 September 2023 by
Shachar Don-Yehiya
and
others
at
Hebrew University of Jerusalem
Machine Learning
,
Computation and Language
Large Language Models Struggle to Learn Long-Tail Knowledge
27 July 2023 by
Nikhil Kandpal
and
others
Computation and Language
,
Machine Learning
Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models
7 June 2023 by
Nikhil Kandpal
and
others
Machine Learning
,
Software Engineering
Crosslingual Generalization through Multitask Finetuning
29 May 2023 by
Niklas Muennighoff
and
others
Computation and Language
,
Artificial Intelligence
Efficient Methods for Natural Language Processing: A Survey
24 March 2023 by
Marcos Treviso
and
others
Computation and Language
Petals: Collaborative Inference and Fine-tuning of Large Models
2 March 2023 by
Alexander Borzunov
and
others
at
University of Washington
Machine Learning
,
Distributed, Parallel, and Cluster Computing
Bidirectional Language Models Are Also Few-shot Learners
6 February 2023 by
Ajay Patel
and
others
Machine Learning
,
Computation and Language
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
11 December 2022 by
Bigscience Workshop
and
others
Computation and Language
What Language Model to Train if You Have One Million GPU Hours?
8 November 2022 by
Teven Scao
and
others
Computation and Language
,
Artificial Intelligence
Load more
This is an AI-generated summary
Key points
Topics
Machine Learning
Computation and Language
Artificial Intelligence
Computer Vision and Pattern Recognition
Audio and Speech Processing
Sound
Distributed, Parallel, and Cluster Computing
Cryptography and Security
Software Engineering
Computers and Society
Symbolic Computation
Mathematical Software
Neural and Evolutionary Computing