Bestand:On the Dangers of Stochastic Parrots Can Language Models Be Too Big.pdf

Beschrijving

Beschrijving	English: The past 3 years of work in NLP have been characterized by the development and deployment of ever larger language models, especially for English. BERT, its variants, GPT-2/3, and others, most recently Switch-C, have pushed the boundaries of the possible both through architectural innovations and through sheer size. Using these pretrained models and the methodology of fine-tuning them for specific tasks, researchers have extended the state of the art on a wide array of tasks as measured by leaderboards on specific benchmarks for English. In this paper, we take a step back and ask: How big is too big? What are the possible risks associated with this technology and what paths are available for mitigating those risks? We provide recommendations including weighing the environmental and financial costs first, investing resources into curating and carefully documenting datasets rather than ingesting everything on the web, carrying out pre-development exercises evaluating how the planned approach fits into research and development goals and supports stakeholder values, and encouraging research directions beyond ever larger language models.
Datum	1 maart 2021
Bron	https://dl.acm.org/doi/abs/10.1145/3442188.3445922 https://doi.org/10.1145/3442188
Auteur	Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, Shmargaret Shmitchell

De gebruiker mag:

Onder de volgende voorwaarden:

naamsvermelding – U moet op een gepaste manier aan naamsvermelding doen, een link naar de licentie geven, en aangeven of er wijzigingen in het werk zijn aangebracht. U mag dit op elke redelijke manier doen, maar niet zodanig dat de indruk wordt gewekt dat de licentiegever instemt met uw werk of uw gebruik van zijn werk.

Korte naam	On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? "1F99C
Omschrijving afbeelding	- Computing methodologies -> Natural language processing.
Auteur	Emily M. Bender; Timnit Gebru; Angelina McMillan-Major; Shmargaret Shmitchell
Id	doi:10.1145/3442188.3445922
Gebruikte software	LaTeX with acmart 2020/11/15 v1.75 Typesetting articles for the Association for Computing Machinery and hyperref 2020-05-15 v7.00e Hypertext links for LaTeX
Conversieprogramma	LuaHBTeX, Version 1.12.0 (TeX Live 2020); modified using iText® 7.1.11-SNAPSHOT ©2000-2020 iText Group NV (AGPL-version)
Versleuteld	no
Papierformaat	612 x 792 pts (letter)
Versie van pdfopmaak	1.6