Sylvain Gugger
7fc6f41d91
Add doc for add-new-model-like command ( #15433 )
2022-01-31 11:10:45 -05:00
Ogundepo Odunayo
282ae123e2
add t5 ner finetuning ( #15432 )
2022-01-31 17:03:06 +01:00
NielsRogge
d4b3e56d64
[Hotfix] Fix Swin model outputs ( #15414 )
...
* Fix Swin model outputs
* Rename pooler
2022-01-31 16:32:14 +01:00
Suraj Patil
38dfb40ae3
import torch.utils.checkpoint ( #15427 )
2022-01-31 15:51:50 +01:00
Jonatas Grosman
f624249d8b
[Robust Speech Challenge] Add missing LR parameter ( #15428 )
2022-01-31 15:50:56 +01:00
Kamal Raj
3254080d45
Update README.md ( #15430 )
...
fix typo
2022-01-31 09:48:20 -05:00
Julien Plu
aa19f478ac
Add (M)Luke model training for Token Classification in the examples ( #14880 )
...
* Add Luke training
* Fix true label tags
* Fix true label tags
* Fix true label tags
* Update the data collator for Luke
* Some training refactor for Luke
* Improve data collator for Luke
* Fix import
* Fix datasets concatenation
* Add the --max_entity_length argument for Luke models
* Remove unused code
* Fix style issues
* Fix style issues
* Move the Luke training into a separate folder
* Fix style
* Fix naming
* Fix filtering
* Fix filtering
* Fix filter
* Update some preprocessing
* Move luke to research_projects
* Checkstyle
* Address comments
* Fix style
2022-01-31 07:58:18 -05:00
François REMY
0094eba363
Fix additional DataTrainingArguments documentation ( #15408 )
...
(This is an editorial change only)
2022-01-31 07:45:11 -05:00
NielsRogge
ee5de66349
Add SegformerFeatureExtractor to Auto API ( #15410 )
2022-01-31 11:38:08 +01:00
Suraj Patil
0f69b924fb
[XGLMTokenizer] fix init and add in AutoTokenizer ( #15406 )
2022-01-30 15:35:53 +01:00
Yih-Dar
f380bf2b61
Fix the inconsistency of loss calculation between PT/TF XLNetLMHeadModel ( #15298 )
...
* Fix the inconsistency of loss calculation between PT/TF XLNetLMHeadModel
* overwrite test_loss_computation
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-01-29 15:08:35 +00:00
Soonhwan-Kwon
e09473a817
Add support for XLM-R XL and XXL models by modeling_xlm_roberta_xl.py ( #13727 )
...
* add xlm roberta xl
* add convert xlm xl fairseq checkpoint to pytorch
* fix init and documents for xlm-roberta-xl
* fix indention
* add test for XLM-R xl,xxl
* fix model hub name
* fix some stuff
* up
* correct init
* fix more
* fix as suggestions
* add torch_device
* fix default values of doc strings
* fix leftovers
* merge to master
* up
* correct hub names
* fix docs
* fix model
* up
* finalize
* last fix
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* add copied from
* make style
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-01-29 13:42:37 +01:00
Steven Liu
16d4acbfdb
Get started docs ( #15098 )
...
* clean commit of changes
* apply review feedback, make edits
* fix backticks, minor formatting
* 🖍 make fixup and minor edits
* 🖍 fix # in header
* 📝 update code sample without from_pt
* 📝 final review
2022-01-28 19:01:37 -06:00
Steven Liu
cabd6d26a2
Update model share tutorial ( #15288 )
...
* add model sharing tutorial
* 🖍 apply feedback from review
* 📝 make edits
* 🖍 fix formatting
* 📝 convert from pt checkpoint to flax
* 📝 final review
2022-01-28 18:49:26 -06:00
Sylvain Gugger
c98a6ac211
Use argument for preprocessing workers in run_summairzation ( #15394 )
2022-01-28 18:34:10 -05:00
Yih-Dar
db07956740
Fix missing eps arg for LayerNorm in ElectraGeneratorPredictions ( #15332 )
...
* fix missing eps
* Same fix for ConvBertGeneratorPredictions
* Same fix for AlbertMLMHead
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-01-28 18:32:26 -05:00
Stas Bekman
297602c7f4
[deepspeed] saving checkpoint fallback when fp16 weights aren't saved ( #14948 )
...
* [deepspeed] saving checkpoint fallback when fp16 weights aren't saved
* Bump required deepspeed version to match usage when saving checkpoints
* update version
Co-authored-by: Mihai Balint <balint.mihai@gmail.com>
2022-01-28 11:05:47 -08:00
Suraj Patil
d25e25ee2b
Add XGLM models ( #14876 )
...
* add xglm
* update vocab size
* fix model name
* style and tokenizer
* typo
* no mask token
* fix pos embed compute
* fix args
* fix tokenizer
* fix positions
* fix tokenization
* style and dic fixes
* fix imports
* add fast tokenizer
* update names
* add pt tests
* fix tokenizer
* fix typo
* fix tokenizer import
* fix fast tokenizer
* fix tokenizer
* fix converter
* add tokenizer test
* update checkpoint names
* fix tokenizer tests
* fix slow tests
* add copied from comments
* rst -> mdx
* flax model
* update flax tests
* quality
* style
* doc
* update index and readme
* fix copies
* fix doc
* update toctrr
* fix indent
* minor fixes
* fix config doc
* don't save embed_pos weights
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* address Sylvains commnets, few doc fixes
* fix check_repo
* align order of arguments
* fix copies
* fix labels
* remove unnecessary mapping
* fix saving tokenizer
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-01-28 18:55:23 +01:00
Matt
b6b79faa7e
Make links explicit ( #15395 )
...
* Make links explicit
* Removing reference to compute_metrics() since it's kind of PyTorch-specific
2022-01-28 17:31:22 +00:00
Yih-Dar
6df29ba5e6
fix wrong tokenizer checkpoint name in flax marian ( #15391 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-01-28 16:53:25 +01:00
lewtun
507601a5cf
Prepare deprecated ONNX exporter for torch v1.11 ( #15388 )
...
* Prepare deprecated ONNX exporter for PyTorch v1.11
* Add deprecation warning
2022-01-28 16:32:47 +01:00
Ngo Quang Huy
4996922b6d
[docs] fix wrong file name in pr_check
( #15380 )
2022-01-28 07:52:01 -05:00
Ngo Quang Huy
8f5d62fdb1
Fix bad_words_ids
not working with sentencepiece-based tokenizers ( #15343 )
...
* Fix `bad_word_ids` not working with sentencepiece-based tokenizers
* make style
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-01-28 12:39:55 +01:00
Nicolas Patry
06107541d3
Fixing support batch_size
and num_return_Sequences
in text-generation
pipeline ( #15318 )
...
* Fixing support `batch_size` and `num_return_Sequences` in
`text-generation` pipeline
And `text2text-generation` too.
The bug was caused by the batch_size containing both the incoming batch
**and** the generated `num_sequences`.
The fix simply consists into splitting both of these again into
different dimensions.
* TF support.
* Odd backward compatibility script in the way.
2022-01-28 12:15:30 +01:00
Yanming Wang
c4d1fd77fa
Set syncfree AdamW as the default optimizer for xla:gpu device in amp mode ( #15361 )
...
* Use syncfree AdamW for xla:gpu device by default
* Make syncfree AdamW optional
2022-01-27 20:05:31 -05:00
Lysandre Debut
2e4559fa37
Add init to BORT ( #15378 )
...
* Add init to BORT
* BORT should be in init
2022-01-27 15:16:54 -05:00
Steven Liu
f5db6ce76a
Fix code format for Accelerate doc ( #15335 )
...
* 🖍 fix code syntax to external libraries and replace image
* 🔄 revert code formatting, replace image with code block
* 🖍 apply feedback
2022-01-27 13:49:04 -06:00
Sylvain Gugger
0b07230409
Allow relative imports in dynamic code ( #15352 )
...
* Allow dynamic modules to use relative imports
* Add tests
* Add one last test
* Changes
2022-01-27 14:47:59 -05:00
dependabot[bot]
628b59e51d
Bump numpy from 1.19.2 to 1.21.0 in /examples/research_projects/lxmert ( #15369 )
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.2 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-27 14:46:15 -05:00
dependabot[bot]
ca0848b2ff
Bump notebook in /examples/research_projects/visual_bert ( #15368 )
...
Bumps [notebook](http://jupyter.org ) from 6.1.5 to 6.4.1.
---
updated-dependencies:
- dependency-name: notebook
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2022-01-27 14:45:58 -05:00
dependabot[bot]
7d45a2e81c
Bump numpy in /examples/research_projects/visual_bert ( #15367 )
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.2 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-27 14:45:18 -05:00
Sylvain Gugger
a81fd35524
Fix tests_fetcher ( #15376 )
2022-01-27 14:17:48 -05:00
Lysandre
eab338104d
Docs for version v4.16.0
2022-01-27 13:11:51 -05:00
Lysandre
f87db5e412
Release: v4.16.0
2022-01-27 13:06:33 -05:00
Matt
c43749289d
Example script for PushToHubCallback ( #15375 )
...
* Example script for PushToHubCallback
* Expanding description slightly
2022-01-27 16:16:24 +00:00
Sylvain Gugger
8f6454bfac
Add proper documentation for Keras callbacks ( #15374 )
...
* Add proper documentation for Keras callbacks
* Add dummies
2022-01-27 10:51:38 -05:00
Matt
2de90beeeb
Super-small fix stops us confusing Keras console logging by modifying its logs ( #15373 )
2022-01-27 15:43:43 +00:00
Sylvain Gugger
fa6dce250f
Implement fixes for TrainingArguments doc ( #15370 )
...
Co-authored-by: osanseviero <osanseviero@gmail.com>
Co-authored-by: osanseviero <osanseviero@gmail.com>
2022-01-27 10:25:43 -05:00
SaulLu
ade7371a41
improve saving strategy of sentencepiece tokenizer ( #15328 )
...
* add new test
* add a feature to same the sentencepiece tokenizer model when the init file was deleted
* update marian
* update m2m_100
* fix marian
* update speech to text
* override test for layoutxlm
* fix saving bartpho
* remove harcoded values bartpho
* special token string version
* finish bartpho
* override layoutxml test
* add mbart
* move special tokens list
* format
* Revert "format"
This reverts commit 37a40df379
.
* simplify list of string of special tokens
* Re-write `self.fairseq_tokens_to_ids ` initialization logic with special tokens
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
2022-01-27 16:24:51 +01:00
Anton Lozhkov
196cce6e9b
Add a device argument to the eval script ( #15371 )
...
* Device argument for the eval script
* Default to none
* isort
2022-01-27 15:58:55 +01:00
Matt
6beae766ee
Fix KerasMetricCallback prediction with generate() and inference of column names ( #15351 )
...
* Fix prediction with generate() and the inference of column names
Should now have very few differences with the PyTorch implementation
* Minor edit to parent class
* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Explaining the dict conversion
* Putting main_input_name back
* Fixes to main_input_name
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-01-27 14:13:23 +00:00
Sylvain Gugger
da5ef25db9
Push to hub save ( #15327 )
...
* Adapt doc and push at every save
* style
2022-01-27 09:00:54 -05:00
Patrick von Platen
9f831bdeaf
[DocTests Speech] Add doc tests for all speech models ( #15031 )
...
* fix_torch_device_generate_test
* remove @
* doc tests
* up
* up
* fix doctests
* adapt files
* finish refactor
* up
* save intermediate
* add more logic
* new change
* improve
* next try
* next try
* next try
* next try
* fix final spaces
* fix final spaces
* improve
* renaming
* correct more bugs
* finish wavlm
* add comment
* run on test runner
* finish all speech models
* adapt
* finish
2022-01-27 14:29:31 +01:00
Sylvain Gugger
4df69506a8
Fix YosoConfig doc ( #15353 )
2022-01-26 21:06:27 +01:00
Stas Bekman
fc8fc400e3
[docs] post-PR merge fix ( #15355 )
...
* [docs] post-PR merge fix
* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-01-26 11:23:32 -08:00
novice
99a2771189
Add YOSO ( #15091 )
...
* Add cookiecutter files
* Add cuda kernels and cpp files
* Update modeling_yoso.py
* Add .h files
* Update configuration_yoso.py
* Updates
* Remove tokenizer
* Code quality
* Update modeling_yoso.py
* Update modeling_yoso.py
* Fix failing test
* Update modeling_yoso.py
* Fix code quality
* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Apply suggestions from code review
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Apply suggestions from code review and fix integration tests
* Update src/transformers/models/yoso/modeling_yoso.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Apply suggestions from code review
* Fix copied from statement
* Fix docstring
* Fix code quality
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Apply suggestions and fix mask
* Apply suggestions from code review
* Fix code quality
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Fix docstrings
* Fix code quality
* Remove trailing whitespace
* Update yoso.mdx
* Move kernel loading to YosoEncoder
* make style
* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/yoso/modeling_yoso.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Add short summary to docs
* Update docs/source/model_doc/yoso.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update yoso.mdx
* Update docs/source/model_doc/yoso.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Remove CausalLM model and add copied from
* Remove autoregressive code
* Remove unused imports
* add copied from for embeddings
* Fix code quality
* Update docs/source/model_doc/yoso.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Apply suggestion from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-01-26 19:18:29 +01:00
Sylvain Gugger
6292532fd1
Update doc writing guide ( #15350 )
2022-01-26 12:54:11 -05:00
François REMY
19732cc07a
Fix 'eval_split_name' described as defaulting to 'train' ( #15348 )
...
The default is correct (`test`) but the description is not.
2022-01-26 10:19:38 -05:00
Ngo Quang Huy
5d8b98608c
Fix deepspeed docs ( #15346 )
2022-01-26 07:24:33 -05:00
Jacob Deppen
96161ac408
make table into valid Markdown table syntax ( #15337 )
2022-01-26 07:10:00 -05:00