transformers/tests
Jason Phang 71e602725b
[WIP] Adding GPT-NeoX-20B (#16659)
* initial

* first try

* working 20B

* 20B tokenizers

* Docs

* Import fixes for missing classes

* Update docs, fixup

* black formatting

* isort

* flake

* dummy objects

* documentation

* Documentation yml

* more docs

* tweaks for tests

* tokenization auto

* fix neox tests

* test

* test

* einsum

* address PR feedback

* Documentation

* Update README.md

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_neox/__init__.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_neox/configuration_gpt_neox.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove undefined LaTeX syntax

* Update to full url to avoid confusion about if that's supposed to refer to the Hub

* fix auto

* move tests

* documentation fix

* more doc fixes

* test refactor

* fix import

* fix import

* fix import

* fix import

* fix import

* style fixes

* More modeling fixes

Co-authored-by: Jason Phang <zp489@gr057.hpc.nyu.edu>
Co-authored-by: Stella Biderman <stellabiderman@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-05-24 09:31:10 -04:00
..
benchmark [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
deepspeed missing file (#17164) 2022-05-10 10:19:50 -07:00
extended Update self-push workflow (#17177) 2022-05-13 16:28:00 +02:00
fixtures add a warning in SpmConverter for sentencepiece's model using the byte fallback feature (#16629) 2022-04-11 11:06:10 +02:00
generation [Generation] Fix Transition probs (#17311) 2022-05-19 22:17:02 +02:00
models [WIP] Adding GPT-NeoX-20B (#16659) 2022-05-24 09:31:10 -04:00
onnx Add onnx export cuda support (#17183) 2022-05-18 17:52:13 +02:00
optimization [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
pipelines Adding batch_size test to QA pipeline. (#17330) 2022-05-19 14:28:12 -04:00
sagemaker Black preview (#17217) 2022-05-12 16:25:55 -04:00
tokenization [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
trainer [tests] fix copy-n-paste error (#17312) 2022-05-18 16:00:47 -07:00
utils Fix tests of mixed precision now that experimental is deprecated (#17300) 2022-05-17 14:14:17 +01:00
__init__.py GPU text generation: mMoved the encoded_prompt to correct device 2020-01-06 15:11:12 +01:00
test_configuration_common.py Black preview (#17217) 2022-05-12 16:25:55 -04:00
test_feature_extraction_common.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
test_modeling_common.py Use Accelerate in from_pretrained for big model inference (#17341) 2022-05-23 14:32:21 -04:00
test_modeling_flax_common.py [Flax] improve large model init and loading (#16148) 2022-04-19 14:19:55 +02:00
test_modeling_tf_common.py Rewrite TensorFlow train_step and test_step (#17057) 2022-05-17 14:36:23 +01:00
test_sequence_feature_extraction_common.py Some tests misusing assertTrue for comparisons fix (#16771) 2022-04-19 14:44:08 +02:00
test_tokenization_common.py Black preview (#17217) 2022-05-12 16:25:55 -04:00