Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Claim page
Niklas Muennighoff
Follow
Activity
Upvotes
Folders
Articles
31
OLMoE: Open Mixture-of-Experts Language Models
3 days ago by
Niklas Muennighoff
and
others
at
University of Washington
Computation and Language
,
Artificial Intelligence
OLMoE: Open Mixture-of-Experts Language Models
3 days ago by
Niklas Muennighoff
and
others
Computation and Language
,
Artificial Intelligence
KTO: Model Alignment as Prospect Theoretic Optimization
1
4 days ago by
Kawin Ethayarajh
and
others
at
Stanford University
Machine Learning
,
Artificial Intelligence
A Survey on Data Selection for Language Models
2 August 2024 by
Alon Albalak
and
others
at
UC Santa Barbara
Computation and Language
,
Machine Learning
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
26 July 2024 by
Chaofan Tao
and
others
at
University of Hong Kong
Computation and Language
,
Artificial Intelligence
Consent in Crisis: The Rapid Decline of the AI Data Commons
24 July 2024 by
Shayne Longpre
and
others
Computation and Language
,
Artificial Intelligence
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
23 July 2024 by
Xingyao Wang
and
others
at
University of Illinois at Urbana-Champaign
Software Engineering
,
Artificial Intelligence
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
16 July 2024 by
Hongjin Su
and
others
Computation and Language
,
Artificial Intelligence
RegMix: Data Mixture as Regression for Language Model Pre-training
1 July 2024 by
Qian Liu
and
others
Computation and Language
,
Artificial Intelligence
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
26 June 2024 by
Terry Yue Zhuo
and
others
Software Engineering
,
Artificial Intelligence
DataComp-LM: In search of the next generation of training sets for language models
20 June 2024 by
Jeffrey Li
and
others
Machine Learning
,
Computation and Language
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
14 June 2024 by
Holy Lovenia
and
others
Computation and Language
KMMLU: Measuring Massive Multitask Language Understanding in Korean
6 June 2024 by
Guijin Son
and
others
Computation and Language
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
4 June 2024 by
Kenneth Enevoldsen
and
others
Computation and Language
,
Artificial Intelligence
Lessons from the Trenches on Reproducible Evaluation of Language Models
29 May 2024 by
Stella Biderman
and
others
at
University of York
Computation and Language
C-Pack: Packaged Resources To Advance General Chinese Embedding
12 May 2024 by
Shitao Xiao
and
others
at
Renmin University of China
Computation and Language
,
Artificial Intelligence
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
23 April 2024 by
Taishi Nakamura
and
others
Computation and Language
,
Artificial Intelligence
Generative Representational Instruction Tuning
1
17 April 2024 by
Niklas Muennighoff
and
others
at
University of Hong Kong
Computation and Language
,
Artificial Intelligence
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence
10 April 2024 by
Bo Peng
and
others
Computation and Language
,
Artificial Intelligence
Language models scale reliably with over-training and on downstream tasks
13 March 2024 by
Samir Yitzhak Gadre
and
others
at
University of Washington
Computation and Language
,
Machine Learning
StarCoder 2 and The Stack v2: The Next Generation
1
29 February 2024 by
Anton Lozhkov
and
others
at
Princeton University
Software Engineering
,
Artificial Intelligence
OLMo: Accelerating the Science of Language Models
28 February 2024 by
Dirk Groeneveld
and
others
at
University of Washington
Computation and Language
OctoPack: Instruction Tuning Code Large Language Models
18 February 2024 by
Niklas Muennighoff
and
others
Computation and Language
,
Artificial Intelligence
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
12 February 2024 by
Ahmet Üstün
and
others
Computation and Language
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
9 February 2024 by
Shivalika Singh
and
others
at
IT University of Copenhagen
Computation and Language
,
Artificial Intelligence
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
31 January 2024 by
Luca Soldaini
and
others
Computation and Language
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
1 January 2024 by
Terry Yue Zhuo
and
others
at
Monash University
Computation and Language
,
Artificial Intelligence
StarCoder: may the source be with you!
1
13 December 2023 by
Raymond Li
and
others
at
Carnegie Mellon University
Computation and Language
,
Artificial Intelligence
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI
4 November 2023 by
Shayne Longpre
and
others
at
MIT
Computation and Language
,
Artificial Intelligence
Scaling Data-Constrained Language Models
26 October 2023 by
Niklas Muennighoff
and
others
Computation and Language
,
Artificial Intelligence
Load more
This is an AI-generated summary
Key points
Topics
Computation and Language
Artificial Intelligence
Machine Learning
Software Engineering
Information Retrieval
Programming Languages
Computers and Society
Computer Vision and Pattern Recognition