Sugawara
|
6daa9cb515
|
add GPTNeoXForSequenceClassification (#22671)
* add GPTNeoXForSequenceClassification
* move the labels to logits.device (ref: #22561)
* fix
|
2023-04-10 11:52:23 -04:00 |
|
Seb0
|
074490b2c2
|
fix(docs): fix task guide links in model docs (#22226)
fix(docs): task guide links in model docs
|
2023-03-17 14:30:17 +00:00 |
|
Maria Khalusova
|
78a53d59cb
|
Adding task guides to resources (#21704)
* added resources: links to task guides that support these models
* minor polishing
* conflict resolved
* link fix
* Update docs/source/en/model_doc/vision-encoder-decoder.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2023-02-21 10:35:11 -05:00 |
|
Jason Phang
|
71e602725b
|
[WIP] Adding GPT-NeoX-20B (#16659)
* initial
* first try
* working 20B
* 20B tokenizers
* Docs
* Import fixes for missing classes
* Update docs, fixup
* black formatting
* isort
* flake
* dummy objects
* documentation
* Documentation yml
* more docs
* tweaks for tests
* tokenization auto
* fix neox tests
* test
* test
* einsum
* address PR feedback
* Documentation
* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/models/gpt_neox/__init__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/models/gpt_neox/configuration_gpt_neox.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Remove undefined LaTeX syntax
* Update to full url to avoid confusion about if that's supposed to refer to the Hub
* fix auto
* move tests
* documentation fix
* more doc fixes
* test refactor
* fix import
* fix import
* fix import
* fix import
* fix import
* style fixes
* More modeling fixes
Co-authored-by: Jason Phang <zp489@gr057.hpc.nyu.edu>
Co-authored-by: Stella Biderman <stellabiderman@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2022-05-24 09:31:10 -04:00 |
|