Anton Lozhkov
e226a24f84
[xtreme-s] Update Minds14 results ( #16241 )
...
* update results
* per-language metrics
* Format the per-language metrics
2022-03-21 19:33:59 +01:00
Suraj Patil
93d3fd8645
remove jax.ops.index ( #16220 )
2022-03-17 17:51:43 +01:00
Anton Lozhkov
d35e0c6247
Minor fixes to XTREME-S ( #16193 )
...
* Minor fixes
* Fix vocab union
* Update examples/research_projects/xtreme-s/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Update README
* unused import
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-03-16 17:23:00 +04:00
Sanchit Gandhi
ee27b3d7df
Replace all deprecated jax.ops
operations with jnp's at
( #16078 )
...
* Replace all deprecated `jax.ops` operations with jnp's `at`
* np to jnp scores
* suggested changes
2022-03-16 09:08:55 +00:00
Patrick von Platen
c2dc89be62
[Xtreme-S] fix some namings ( #16183 )
2022-03-16 01:21:31 +01:00
Anton Lozhkov
99fd3eb4a5
Add the XTREME-S fine-tuning example ( #15985 )
...
* CTC+classification draft
* CTC+classification draft
* style
* multilingual runs
* Fix race condition during processor.from_reatrained
* Merge covost experiments
* Add README
* Quality
* Switch to .all configs
* Fix typos
2022-03-16 00:21:06 +01:00
Stas Bekman
580dd87c55
[Deepspeed] add support for bf16 mode ( #14569 )
...
* [WIP] add support for bf16 mode
* prep for bf16
* prep for bf16
* fix; zero2/bf16 is ok
* check bf16 is available
* test fixes
* enable zero3_bf16
* config files
* docs
* split stage_dtype; merge back to non-dtype-specific config file
* fix doc
* cleanup
* cleanup
* bfloat16 => bf16 to match the PR changes
* s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/
* test fixes/skipping
* move
* fix
* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* backticks
* cleanup
* cleanup
* cleanup
* new version
* add note about grad accum in bf16
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-03-11 17:53:53 -08:00
Sylvain Gugger
19597998f6
Don't compute metrics in LM examples on TPU ( #16029 )
2022-03-10 07:44:51 -05:00
Sanchit Gandhi
6c9010ef63
Update README.md
2022-03-10 10:20:37 +01:00
Shotaro Ishihara
8feede229c
Fix broken code blocks in README.md ( #15967 )
...
at transformers/examples/pytorch/contrastive-image-text
2022-03-09 17:07:52 +01:00
Joao Gante
e7f34ccd4f
Swag example: Update doc format ( #16014 )
2022-03-09 13:25:34 +00:00
Joao Gante
62d847602a
Update TF multiple choice example ( #15868 )
2022-03-08 13:16:34 +00:00
Yeb Havinga
91fb62d01c
Speedup training by using numpy instead of jnp for batch shuffling ( #15963 )
...
Speedup training by using numpy instead of jnp for batch shuffling
Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>
2022-03-08 12:18:38 +01:00
Patrick von Platen
10b76987fc
[FlaxT5 Example] fix flax t5 example pretraining ( #15835 )
2022-03-04 17:04:43 +01:00
Sanchit Gandhi
b71474895d
Update README.md
2022-03-04 09:58:45 +01:00
davidleonfdez
c0281feb50
Fix #15898 ( #15928 )
2022-03-03 14:41:03 -05:00
Sylvain Gugger
79d28e80b6
v4.18.0.dev.0
2022-03-03 10:19:58 -05:00
Ross Johnstone
e535c389aa
Fix tiny typo ( #15884 )
2022-03-02 15:37:05 +01:00
Joao Gante
05c237ea94
Update TF QA example ( #15870 )
2022-03-02 10:38:13 +00:00
Joao Gante
3f2e636850
Update TF LM examples ( #15855 )
2022-03-01 14:12:58 +00:00
Suraj Patil
bf1fe32824
[examples/summarization and translation] fix readme ( #15833 )
2022-02-25 17:28:16 +01:00
Lysandre Debut
29c10a41d0
[Test refactor 1/5] Per-folder tests reorganization ( #15725 )
...
* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>
2022-02-23 15:46:28 -05:00
Yongrae Jo
3db2e8f92b
Fix typo on examples/pytorch/question-answering ( #15644 )
...
cna -> can
2022-02-22 13:51:07 -05:00
Joao Gante
3956b133b6
TF text classification examples ( #15704 )
...
* Working example with to_tf_dataset
* updated text_classification
* more comments
2022-02-21 17:17:59 +00:00
Suraj Patil
86119c1154
add VisionTextDualEncoder and CLIP fine-tuning script ( #15701 )
...
* begin script
* update script
* fix features and data args
* main
* add requirements
* add column name args
* fix captions
* don't jit transforms
* fix caption
* fix labels, handle attention mask
* convert pixel values to numpy
* labels => input_ids
* transform images on the fly
* use AutoModel class, create the hybird model outside of the script
* fix version message
* add readme
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* adderss review comments
* add more comments
* allow freezing vision and text models
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-02-21 16:10:59 +01:00
Ivan Agarský
5444687f0f
Fix minor comment typos ( #15740 )
2022-02-21 12:41:27 +01:00
Simon Sardorf
a63bd3675f
Remove input and target reset after preprocessing ( #15741 )
...
Remove input and target reset after preprocessing
2022-02-21 11:10:15 +01:00
NielsRogge
57882177be
Add SimMIM ( #15586 )
...
* Add first draft
* Make model importable
* Make SwinForMaskedImageModeling importable
* Fix imports
* Add missing inits
* Add support for Swin
* Fix bug
* Fix bug
* Fix another bug
* Fix Swin MIM implementation
* Fix default encoder stride
* Fix Swin
* Add print statements for debugging
* Add image_size data argument
* Fix Swin
* Fix image_size
* Add print statements for debugging
* Fix print statement
* Remove print statements
* Improve reshaping of bool_masked_pos
* Add support for DeiT, fix tests
* Improve docstrings
* Apply new black version
* Improve script
* Fix bug
* Improve README
* Apply suggestions from code review
* Remove DS_Store and add to gitignore
* Apply suggestions from code review + fix BEiT Flax
* Revert BEiT changes
* Improve README
* Fix code quality
* Improve README
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-02-17 19:44:55 +01:00
NielsRogge
0e91f885c3
Add image classification notebook ( #15667 )
...
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-02-17 13:14:01 +01:00
Shamane Siri
80f1a59168
updated with latest PL and Ray ( #15653 )
2022-02-15 16:53:05 +01:00
Stas Bekman
fcb0f74397
[research_projects] deal with security alerts ( #15594 )
...
* [research_projects] deal with security alerts
* add a note of the original PL ver and warning
2022-02-11 14:31:09 -05:00
Patrick von Platen
3d5dea9bf0
Add example batch size to all commands ( #15596 )
2022-02-10 08:52:07 -05:00
Lysandre Debut
7732d0fe7a
Upgrade black to version ~=22.0 ( #15565 )
...
* Upgrade black to version ~=22.0
* Check copies
* Fix code
2022-02-09 09:28:57 -05:00
Anton Lozhkov
a459f7f97d
Add ASR CTC streaming example ( #15309 )
...
* Single-epoch run
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* Infinite dataset
* Trainer fix + distributed benchmark
* Benchmark fix
* unused import
* interleaved splits
* interleaved splits
* has_length util
* Move to research projects
* Leftover Sized checks
* Bump min version
* Unused import
* Revert trainer changes
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-02-07 18:35:37 +03:00
davidleonfdez
f1a4c4ead5
[WIP] Add preprocess_logits_for_metrics Trainer param ( #15473 )
...
* Add preprocess_logits_for_metrics Trainer param
* Compute accuracy in LM examples
* Improve comments
2022-02-03 12:07:20 -05:00
Sylvain Gugger
45cac3fade
Fix labels stored in model config for token classification examples ( #15482 )
...
* Playing
* Properly set labels in model config for token classification example
* Port to run_ner_no_trainer
* Quality
2022-02-02 14:23:43 -05:00
Sylvain Gugger
d0b5ed110a
Harder check for IndexErrors in QA scripts ( #15438 )
...
* Harder check for IndexErrors in QA scripts
* Make test stronger
2022-02-01 15:49:13 -05:00
Kamal Raj
d2749cf72e
Update README.md ( #15462 )
...
fix typo
2022-02-01 10:04:30 -05:00
Suraj Patil
87918d3221
[examples/Flax] add a section about GPUs ( #15198 )
...
* add a section about GPUs
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2022-01-31 19:20:53 +01:00
Jonatas Grosman
f624249d8b
[Robust Speech Challenge] Add missing LR parameter ( #15428 )
2022-01-31 15:50:56 +01:00
Julien Plu
aa19f478ac
Add (M)Luke model training for Token Classification in the examples ( #14880 )
...
* Add Luke training
* Fix true label tags
* Fix true label tags
* Fix true label tags
* Update the data collator for Luke
* Some training refactor for Luke
* Improve data collator for Luke
* Fix import
* Fix datasets concatenation
* Add the --max_entity_length argument for Luke models
* Remove unused code
* Fix style issues
* Fix style issues
* Move the Luke training into a separate folder
* Fix style
* Fix naming
* Fix filtering
* Fix filtering
* Fix filter
* Update some preprocessing
* Move luke to research_projects
* Checkstyle
* Address comments
* Fix style
2022-01-31 07:58:18 -05:00
François REMY
0094eba363
Fix additional DataTrainingArguments documentation ( #15408 )
...
(This is an editorial change only)
2022-01-31 07:45:11 -05:00
Sylvain Gugger
c98a6ac211
Use argument for preprocessing workers in run_summairzation ( #15394 )
2022-01-28 18:34:10 -05:00
Matt
b6b79faa7e
Make links explicit ( #15395 )
...
* Make links explicit
* Removing reference to compute_metrics() since it's kind of PyTorch-specific
2022-01-28 17:31:22 +00:00
dependabot[bot]
628b59e51d
Bump numpy from 1.19.2 to 1.21.0 in /examples/research_projects/lxmert ( #15369 )
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.2 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-27 14:46:15 -05:00
dependabot[bot]
ca0848b2ff
Bump notebook in /examples/research_projects/visual_bert ( #15368 )
...
Bumps [notebook](http://jupyter.org ) from 6.1.5 to 6.4.1.
---
updated-dependencies:
- dependency-name: notebook
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2022-01-27 14:45:58 -05:00
dependabot[bot]
7d45a2e81c
Bump numpy in /examples/research_projects/visual_bert ( #15367 )
...
Bumps [numpy](https://github.com/numpy/numpy ) from 1.19.2 to 1.21.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0 )
---
updated-dependencies:
- dependency-name: numpy
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-27 14:45:18 -05:00
Lysandre
eab338104d
Docs for version v4.16.0
2022-01-27 13:11:51 -05:00
Lysandre
f87db5e412
Release: v4.16.0
2022-01-27 13:06:33 -05:00
Anton Lozhkov
196cce6e9b
Add a device argument to the eval script ( #15371 )
...
* Device argument for the eval script
* Default to none
* isort
2022-01-27 15:58:55 +01:00