Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Claim page
Francesco De Toni
Follow
Activity
Upvotes
Folders
Articles
5
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
7 March 2023 by
Hugo Laurençon
and
others
at
Leipzig
Computation and Language
,
Artificial Intelligence
SantaCoder: don't reach for the stars!
1
24 February 2023 by
Loubna Ben Allal
and
others
Software Engineering
,
Artificial Intelligence
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
11 December 2022 by
Bigscience Workshop
and
others
Computation and Language
Entities, Dates, and Languages: Zero-Shot on Historical Texts with T0
11 April 2022 by
Francesco Toni
and
others
Computation and Language
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
25 January 2022 by
Angelina Mcmillan-Major
and
others
Computation and Language
,
Databases
This is an AI-generated summary
Key points
Topics
Computation and Language
Artificial Intelligence
Software Engineering
Machine Learning
Databases