transformers/docs/source/en
Jason Phang 71e602725b
[WIP] Adding GPT-NeoX-20B (#16659)
* initial

* first try

* working 20B

* 20B tokenizers

* Docs

* Import fixes for missing classes

* Update docs, fixup

* black formatting

* isort

* flake

* dummy objects

* documentation

* Documentation yml

* more docs

* tweaks for tests

* tokenization auto

* fix neox tests

* test

* test

* einsum

* address PR feedback

* Documentation

* Update README.md

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_neox/__init__.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_neox/configuration_gpt_neox.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove undefined LaTeX syntax

* Update to full url to avoid confusion about if that's supposed to refer to the Hub

* fix auto

* move tests

* documentation fix

* more doc fixes

* test refactor

* fix import

* fix import

* fix import

* fix import

* fix import

* style fixes

* More modeling fixes

Co-authored-by: Jason Phang <zp489@gr057.hpc.nyu.edu>
Co-authored-by: Stella Biderman <stellabiderman@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-05-24 09:31:10 -04:00
..
internal [WIP] Enable reproducibility for distributed trainings (#16907) 2022-05-11 09:37:13 -04:00
main_classes Use Accelerate in from_pretrained for big model inference (#17341) 2022-05-23 14:32:21 -04:00
model_doc [WIP] Adding GPT-NeoX-20B (#16659) 2022-05-24 09:31:10 -04:00
tasks Update audio examples with MInDS-14 (#16633) 2022-04-08 15:55:42 -05:00
_config.py Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
_toctree.yml [WIP] Adding GPT-NeoX-20B (#16659) 2022-05-24 09:31:10 -04:00
accelerate.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
add_new_model.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
add_new_pipeline.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
autoclass_tutorial.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
benchmarks.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
bertology.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
big_models.mdx Make Trainer compatible with sharded checkpoints (#17053) 2022-05-03 09:55:10 -04:00
community.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
contributing.md Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
converting_tensorflow_models.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
create_a_model.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
custom_models.mdx Update custom_models.mdx (#16964) 2022-04-27 16:46:55 +02:00
debugging.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
fast_tokenizers.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
glossary.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
index.mdx [WIP] Adding GPT-NeoX-20B (#16659) 2022-05-24 09:31:10 -04:00
installation.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
migration.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
model_sharing.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
model_summary.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
multilingual.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
notebooks.md Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
pad_truncation.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
perf_hardware.mdx [WIP] [doc] performance/scalability revamp (#15723) 2022-05-16 13:36:41 +02:00
perf_train_gpu_many.mdx [WIP] [doc] performance/scalability revamp (#15723) 2022-05-16 13:36:41 +02:00
perf_train_gpu_one.mdx [WIP] [doc] performance/scalability revamp (#15723) 2022-05-16 13:36:41 +02:00
performance.mdx [WIP] [doc] performance/scalability revamp (#15723) 2022-05-16 13:36:41 +02:00
perplexity.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
philosophy.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
pipeline_tutorial.mdx docs(transformers): fix typo (#17263) 2022-05-16 17:04:30 -04:00
pr_checks.mdx Add a check on config classes docstring checkpoints (#17012) 2022-04-30 10:40:46 +02:00
preprocessing.mdx Fixing the output of code examples in the preprocessing chapter (#17162) 2022-05-10 12:16:28 -04:00
quicktour.mdx Fix doc test quicktour dataset (#16929) 2022-04-25 16:26:59 +02:00
run_scripts.mdx Fix all docs for accelerate install directions (#17145) 2022-05-09 15:45:18 -04:00
sagemaker.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
serialization.mdx Add CvT (#17299) 2022-05-18 17:47:18 +02:00
task_summary.mdx [Doctests] Correct task summary (#16644) 2022-04-11 14:59:35 +02:00
testing.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
tokenizer_summary.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
training.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
troubleshooting.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00