Omar U. Espejel
da47c264f9
Add translating guide ( #17004 )
...
* Add translating guide
2022-04-30 17:43:38 -05:00
Yih-Dar
ede5e04191
Add a check on config classes docstring checkpoints ( #17012 )
...
* Add the check
* add missing ckpts
* add a list to ignore
* call the added check script
* better regex pattern
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-30 10:40:46 +02:00
Sylvain Gugger
7152ed2bae
Result of new doc style with fixes ( #17015 )
...
* Result of new doc style with fixes
* Add last two files
* Bump hf-doc-builder
2022-04-29 17:42:15 -04:00
Sylvain Gugger
18df440709
Replace dict/BatchEncoding instance checks by Mapping ( #17014 )
...
* Replace dict/BatchEncoding instance checks by Mapping
* Typo
2022-04-29 17:20:52 -04:00
Nicolas Patry
b8dffd1f3e
Revert "Updating variable names. ( #16445 )" ( #17011 )
...
This reverts commit 4f3a14e3c2
.
2022-04-29 12:26:45 -04:00
Nicolas Patry
4f3a14e3c2
Updating variable names. ( #16445 )
2022-04-29 17:44:28 +02:00
tarzan
20fb5d51ea
Update README_zh-hans.md ( #16977 )
2022-04-29 11:05:03 -04:00
Pavel Belevich
63fbed5c59
Make create_extended_attention_mask_for_decoder static method ( #16893 )
2022-04-29 10:57:09 -04:00
Joao Gante
fb0ae12947
TF: XLA bad words logits processor and list of processors ( #16974 )
2022-04-29 15:54:58 +01:00
Zachary Mueller
57e6464ac9
Update all require decorators to use skipUnless when possible ( #16999 )
2022-04-29 08:55:38 -04:00
Yih-Dar
e952e049b4
use scale=1.0 in floats_tensor called in speech model testers ( #17007 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-29 14:41:33 +02:00
Sylvain Gugger
e6f00a11d7
Update README to latest release ( #16997 )
2022-04-28 14:17:44 -04:00
Zachary Mueller
3486a92a57
Fix savedir for by epoch ( #16996 )
2022-04-28 13:49:45 -04:00
Yih-Dar
5af5735f62
set eos_token_id to None to generate until max length ( #16989 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-28 19:47:38 +02:00
amyeroberts
01562dac7e
Rename a class to reflect framework pattern AutoModelXxx -> TFAutoModelXxx ( #16993 )
2022-04-28 18:11:54 +01:00
conan1024hao
1be8d56ec6
Add parameter --config_overrides for run_mlm_wwm.py ( #16961 )
...
* dd parameter --config_overrides for run_mlm_wwm.py
* linter
2022-04-28 10:44:55 -04:00
Yih-Dar
1f9e862507
Update check_models_are_tested to deal with Windows path ( #16973 )
...
* fix
* Apply suggestions from code review
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-04-28 15:31:57 +02:00
Dat Quoc Nguyen
dced262409
Update tokenization_bertweet.py ( #16941 )
...
The emoji version must be either 0.5.4 or 0.6.0. Newer emoji versions have been updated to newer versions of the Emoji Charts, thus not consistent with the one used for pre-processing the pre-training Tweet corpus (i.e. not consistent with the vocab).
2022-04-27 16:54:31 -04:00
Yih-Dar
992996e9ca
Add -e flag to some GH workflow yml files ( #16959 )
...
* Add -e flag
* add check
* create new keys
* run python setup.py build install
* add comments
* change to develop
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-27 21:44:21 +02:00
Yih-Dar
596afb4297
Fix check_all_models_are_tested ( #16970 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-27 21:18:29 +02:00
Sylvain Gugger
691cdbb7d7
Fix doc notebooks links ( #16969 )
...
* Fix doc notebooks links
* Remove missing section
2022-04-27 14:59:53 -04:00
Zachary Mueller
60e1d883f1
Fixup no_trainer save logic ( #16968 )
...
* Fixup all examples
2022-04-27 14:46:49 -04:00
Sylvain Gugger
c79bbc3ba5
Fix multiple deletions of the same files in save_pretrained ( #16947 )
...
* Fix multiple deletions of the same files in save_pretrained
* Add is_main_process argument
2022-04-27 12:28:42 -04:00
Sylvain Gugger
bfbec17765
Fix add-new-model-like when model doesn't support all frameworks ( #16966 )
2022-04-27 11:15:25 -04:00
Mishig Davaadorj
cf8a7c2490
Update custom_models.mdx ( #16964 )
...
BertModelForSequenceClassification -> BertForSequenceClassification
2022-04-27 16:46:55 +02:00
Antoni Baum
5896b3ecce
Fix distributed_concat
with scalar tensor ( #16963 )
...
* Fix `distributed_concat` with scalar tensor
* Update trainer_pt_utils.py
2022-04-27 10:26:22 -04:00
NielsRogge
084c38c59d
[HF Argparser] Fix parsing of optional boolean arguments ( #16946 )
...
* Add fix
* Apply suggestion from code review
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-04-27 15:00:45 +02:00
Leonid Boytsov
c82e017aa9
Misc. fixes for Pytorch QA examples: ( #16958 )
...
1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed).
2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion).
3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions.
4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.
2022-04-27 08:51:39 -04:00
Yih-Dar
49d5bcb0f3
Fix HubertRobustTest PT/TF equivalence test on GPU ( #16943 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-27 10:50:03 +02:00
NielsRogge
479fdc4925
Add semantic script, trainer ( #16834 )
...
* Add first draft
* Improve script and README
* Improve README
* Apply suggestions from code review
* Improve script, add link to resulting model
* Add corresponding test
* Adjust learning rate
2022-04-27 10:12:18 +02:00
Anton Lozhkov
a4a88fa09f
[Research] Speed up evaluation for XTREME-S ( #16785 )
...
* Avoid repeated per-lang filtering
* Language groups and logits preprocessing
* Style
2022-04-27 08:34:21 +02:00
Yongliang Shen
2d91e3c304
use original loaded keys to find mismatched keys ( #16920 )
2022-04-26 17:29:52 -04:00
nikkie
d365f5074f
Fix RuntimeError message format ( #16906 )
2022-04-26 17:08:28 -04:00
Yang Ming
10dfa126b7
documentation: some minor clean up ( #16850 )
2022-04-26 16:56:08 -04:00
Krishna Sirumalla
aaee4038c3
Add onnx config for RoFormer ( #16861 )
...
* add roformer onnx config
2022-04-26 16:51:15 +02:00
Ahmed Elnaggar
8afaaa26f5
FIx Iterations for decoder ( #16934 )
...
FIx Iterations for decoder
2022-04-26 12:54:14 +02:00
Manuel
fa32247406
apply torch int div to layoutlmv2 ( #15457 )
...
* apply torch int div
* black linting fixup
* update path to torch_int_div
* clarify imports
2022-04-26 10:07:51 +02:00
Sylvain Gugger
344b9fb0c6
Limit the use of PreTrainedModel.device ( #16935 )
...
* Limit the use of PreTrainedModel.device
* Fix
2022-04-25 20:58:50 -04:00
code-review-doctor
6568752039
Fix issue probably-meant-fstring found at https://codereview.doctor ( #16913 )
2022-04-25 15:15:00 -04:00
Sanchit Gandhi
fea94d6790
Replace deprecated logger.warn with warning ( #16876 )
2022-04-25 15:12:51 -04:00
Joao Gante
e03966e404
TF: XLA stable softmax ( #16892 )
...
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-04-25 20:10:51 +01:00
Rushi Chaudhari
8246caf3eb
added deit onnx config ( #16887 )
...
* added deit onnx config
2022-04-25 20:50:45 +02:00
Joao Gante
9331b37967
TF: XLA Logits Warpers ( #16899 )
...
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
2022-04-25 19:48:08 +01:00
Joao Gante
809dac48f9
TF: XLA logits processors - minimum length, forced eos, and forced bos ( #16912 )
...
* XLA min len, forced eos, and forced bos
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
2022-04-25 19:27:53 +01:00
Yih-Dar
f6210c49e2
Fix RemBertTokenizerFast ( #16933 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-25 19:51:50 +02:00
Yih-Dar
32adbb26d6
Fix PyTorch RAG tests GPU OOM ( #16881 )
...
* add torch.cuda.empty_cache in some PT RAG tests
* torch.cuda.empty_cache in tearDownModule()
* tearDown()
* add gc.collect()
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-25 17:33:56 +02:00
Yih-Dar
3e47d19cfc
Add missing ckpt in config docs ( #16900 )
...
* add missing ckpt in config docs
* add more missing ckpt in config docs
* fix wrong ckpts
* fix realm ckpt
* fix s2t2
* fix xlm_roberta ckpt
* Fix for deberta v2
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* use only one checkpoint for DPR
* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
2022-04-25 17:31:45 +02:00
Patrick von Platen
3a71e94a92
Fix doc test quicktour dataset ( #16929 )
...
* fix doc test
* fix doc test
Co-authored-by: Patrick <patrick@pop-os.localdomain>
2022-04-25 16:26:59 +02:00
Thomas Chaigneau
508baf1943
add bigbird typo fixes ( #16897 )
...
Co-authored-by: ChainYo <t.chaigneau.tc@gmail.com>
2022-04-25 11:32:06 +02:00
Patrick von Platen
72728be3db
[DocTests] Fix some doc tests ( #16889 )
...
* [DocTests] Fix some doc tests
* hacky fix
* correct
2022-04-23 08:40:14 +02:00