Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Simple
Original
Articles by
Max Ryabinin
Multilingual Language Model Pretraining using Machine-translated Data
18 February 2025 by
Jiayi Wang
and
others
Computation and Language
Towards Best Practices for Open Datasets for LLM Training
14 January 2025 by
Stefan Baack
and
others
Computers and Society
,
Artificial Intelligence
Label Privacy in Split Learning for Large Models with Parameter-Efficient Training
21 December 2024 by
Philip Zmushko
and
others
Machine Learning
,
Cryptography and Security
INTELLECT-1 Technical Report
2 December 2024 by
Sami Jaghouar
and
others
Distributed, Parallel, and Cluster Computing
RedPajama: an Open Dataset for Training Large Language Models
19 November 2024 by
Maurice Weber
and
others
Computation and Language
,
Machine Learning
Multilingual Pretraining Using a Large Corpus Machine-Translated from a Single Source Language
31 October 2024 by
Jiayi Wang
and
others
Computation and Language
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
4 June 2024 by
Ruslan Svirschevski
and
others
Computation and Language
The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models
8 April 2024 by
Giwon Hong
and
others
Computation and Language
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
29 February 2024 by
Zhuoming Chen
and
others
at
Carnegie Mellon University
Computation and Language
Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements
22 January 2024 by
Anton Voronov
and
others
at
Moscow Institute of Physics and Technology
Computation and Language
Distributed Inference and Fine-tuning of Large Language Models Over The Internet
13 December 2023 by
Alexander Borzunov
and
others
Machine Learning
,
Distributed, Parallel, and Cluster Computing
Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics
1 November 2023 by
Anton Voronov
and
others
at
Higher School of Economics
Computer Vision and Pattern Recognition
,
Machine Learning
Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy
13 October 2023 by
Anton Baryshnikov
and
Max Ryabinin
at
Higher School of Economics
Computer Vision and Pattern Recognition
,
Computation and Language
SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient
29 June 2023 by
Max Ryabinin
and
others
Distributed, Parallel, and Cluster Computing
,
Machine Learning
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
12 June 2023 by
Ying Sheng
and
others
at
Higher School of Economics
Machine Learning
,
Artificial Intelligence
Petals: Collaborative Inference and Fine-tuning of Large Models
2 March 2023 by
Alexander Borzunov
and
others
at
Higher School of Economics
Machine Learning
,
Distributed, Parallel, and Cluster Computing
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
11 December 2022 by
Bigscience Workshop
and
others
Computation and Language
RuCoLA: Russian Corpus of Linguistic Acceptability
23 October 2022 by
Vladislav Mikhailov
and
others
at
Higher School of Economics
Computation and Language
Training Transformers Together
7 July 2022 by
Alexander Borzunov
and
others
Machine Learning
,
Distributed, Parallel, and Cluster Computing
Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees
22 May 2022 by
Aleksandr Beznosikov
and
others
Machine Learning
It's All in the Heads: Using Attention Heads as a Baseline for Cross-Lingual Transfer in Commonsense Reasoning
30 November 2021 by
Alexey Tikhonov
and
Max Ryabinin
Computation and Language
,
Machine Learning
Distributed Deep Learning in Open Collaborations
8 November 2021 by
Michael Diskin
and
others
Machine Learning
,
Distributed, Parallel, and Cluster Computing
Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices
8 November 2021 by
Max Ryabinin
and
others
Machine Learning
,
Distributed, Parallel, and Cluster Computing
Secure Distributed Training at Scale
7 October 2021 by
Eduard Gorbunov
and
others
Machine Learning
,
Distributed, Parallel, and Cluster Computing
Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets
14 May 2021 by
Max Ryabinin
and
others
Machine Learning
,
Artificial Intelligence
Embedding Words in Non-Vector Space with Unsupervised Graph Learning
6 October 2020 by
Max Ryabinin
and
others
Computation and Language
,
Machine Learning
Long-term operation of a multi-channel cosmic muon system based on scintillation counters with MRS APD light readout
19 September 2008 by
A. Akindinov
and
others
Instrumentation and Detectors
START as the detector of choice for large-scale muon triggering systems
24 May 2006 by
A. Akindinov
and
others
Instrumentation and Detectors
Scintillation counter with MRS APD light readout
27 March 2006 by
A. Akindinov
and
others
Instrumentation and Detectors