Sylvain Gugger
18ca0e9140
Fix #6575 ( #6596 )
2020-08-19 13:04:33 -04:00
Suraj Patil
7581884dee
[BartTokenizerFast] add prepare_seq2seq_batch ( #6543 )
2020-08-19 10:37:48 -04:00
Patrick von Platen
8bcceaceff
fix model outputs test ( #6593 )
2020-08-19 16:18:51 +02:00
Sam Shleifer
9a86321b11
tf generation utils: remove unused kwargs ( #6591 )
2020-08-19 09:37:45 -04:00
Pradhy729
2a7402cbd3
Feed forward chunking others ( #6365 )
...
* Feed forward chunking for Distilbert & Albert
* Added ff chunking for many other models
* Change model signature
* Added chunking for XLM
* Cleaned up by removing some variables.
* remove test_chunking flag
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
2020-08-19 14:31:10 +02:00
Patrick von Platen
fe0b85e77a
[EncoderDecoder] Add functionality to tie encoder decoder weights ( #6538 )
...
* start adding tie encoder to decoder functionality
* finish model tying
* make style
* Apply suggestions from code review
* fix t5 list including cross attention
* apply sams suggestions
* Update src/transformers/modeling_encoder_decoder.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* add max depth break point
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-08-19 14:23:45 +02:00
Sam Shleifer
ab42d74850
Fix bart base test ( #6587 )
2020-08-18 21:28:10 -04:00
Sam Shleifer
1529bf9680
add BartConfig.force_bos_token_to_be_generated ( #6526 )
...
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-08-18 19:15:50 -04:00
Patrick von Platen
974bb4af26
[Model card] Bert2GPT2 EncoderDecoder model ( #6569 )
...
* Bert2GPT2 EncoderDecoder model
* Update README.md
2020-08-18 19:28:17 +02:00
Suraj Patil
6f972e1423
update xnli-mt url ( #6580 )
2020-08-18 13:10:47 -04:00
Suraj Patil
fb6844aff5
[Pegasus Doc] minor typo ( #6579 )
...
Minor typo correction
@sshleifer
2020-08-18 12:47:47 -04:00
Manuel Romero
aaab9ab187
Create README.md ( #6556 )
2020-08-18 12:43:20 -04:00
Manuel Romero
1dfce0f08a
Create README.md ( #6557 )
2020-08-18 12:42:14 -04:00
Romain Rigaux
7516bcf273
[docs] Fix number of 'ug' occurrences in tokenizer_summary ( #6574 )
2020-08-18 10:23:25 -04:00
Romain Rigaux
5a5af22ed5
[docs] Fix wrong newline in the middle of a paragraph ( #6573 )
2020-08-18 10:22:43 -04:00
Stas Bekman
7659a8eb37
fix incorrect codecov reports ( #6553 )
...
As discussed at https://github.com/huggingface/transformers/issues/6317 codecov currently sends an invalid report when it fails to find a code coverage report for the base it checks against, so this gets fixed by:
- require_base: yes # don't report if there is no base coverage report
let's add this for clarity, this supposedly is already the default.
- require_head: yes # don't report if there is no head coverage report
and perhaps no point reporting on doc changes as they don't make any difference and it just generates noise:
- require_changes: true # only comment if there was change in coverage
2020-08-18 10:21:13 -04:00
Stefan Schweter
cfa26d2b41
github: add @stefan-it to bug-report template for all token-classification related bugs ( #6489 )
2020-08-18 08:38:54 -04:00
Philip May
1fdf372f8c
Small typo fixes for model card: electra-base-german-uncased ( #6555 )
...
* Update README.md
* Update model_cards/german-nlp-group/electra-base-german-uncased/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-08-18 08:21:52 -04:00
Ali Modarressi
5a81195ea9
Fixed label datatype for STS-B ( #6492 )
...
* fixed label datatype for sts-b
* naming update
* make style
* make style
2020-08-18 08:09:39 -04:00
Sam Shleifer
12d7624199
[marian] converter supports models from new Tatoeba project ( #6342 )
2020-08-17 23:55:42 -04:00
Jim Regan
fb7330b30e
update with #s of sentences/tokens ( #6546 )
2020-08-17 16:48:05 -04:00
onepointconsulting
63144701ed
Added first model card ( #6530 )
...
* Added first model card
* Add metadata
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-08-17 16:24:10 -04:00
Ikram Ali
98ee802023
[model_cards] Add model cards for Urduhack model (roberta-urdu-small) ( #6536 )
...
* [model_cards] roberta-urdu-small added.
* [model_cards] typo fixed.
* Tweak license format (yaml expects a simple string)
Co-authored-by: Ikram Ali <mrikram1989>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-08-17 16:04:29 -04:00
Jim Regan
3a302904cb
[model_cards] Add a new model for Irish ( #6544 )
2020-08-17 15:56:56 -04:00
Julien Chaumond
07971d8b18
[model_cards] Fix yaml for cedpsam/chatbot_fr
2020-08-17 21:33:32 +02:00
Suraj Patil
407da12ef1
[T5Tokenizer] add prepare_seq2seq_batch method ( #6122 )
...
* tests
2020-08-17 13:57:19 -04:00
Suraj Patil
c9564f5343
[Doc] add more MBart and other doc ( #6490 )
...
* add mbart example
* add Pegasus and MBart in readme
* typo
* add MBart in Pretrained models
* add pre-proc doc
* add DPR in readme
* fix indent
* doc fix
2020-08-17 12:30:26 -04:00
Stas Bekman
f68c873100
replace _ with __ rst links ( #6541 )
2020-08-17 12:27:02 -04:00
sgugger
7ca6ab67fc
Fix CI
2020-08-17 12:20:40 -04:00
Stas Bekman
b732e7e111
[doc] multiple corrections to "Summary of the tasks" ( #6509 )
...
* [doc] multiple corrections to "Summary of the tasks"
* fix indentation
* correction
* fix links, add links to examples/seq2seq/README.md instead of non-existing script
2020-08-17 11:49:16 -04:00
Suraj Patil
2a77813d53
[BartTokenizer] add prepare s2s batch ( #6212 )
...
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
2020-08-17 11:44:46 -04:00
Stas Bekman
84d33317ae
[doc] make the text more readable, fix some typos, add some disambiguation ( #6508 )
...
* [doc] make the text more readable, fix some typos, add some disambiguation
* Update docs/source/glossary.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-08-17 11:07:58 -04:00
Joe Davison
d0c2389f48
add custom datasets tutorial ( #6466 )
...
* add custom datasets tutorial
* python -> bash code blocks
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* minor review feedback changes
* add working native QA snippet
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-08-17 09:15:34 -04:00
Sam Shleifer
d2da2cb232
allow spaces in bash args with "$@" ( #6521 )
2020-08-17 09:06:35 -04:00
Funtowicz Morgan
b41cc0b86a
Fix flaky ONNX tests ( #6531 )
2020-08-17 09:04:35 -04:00
Stas Bekman
39c3b1d9de
[sched] polynomial_decay_schedule use default power=1.0 ( #6473 )
2020-08-17 08:33:12 -04:00
Stas Bekman
9dbe4094f2
[testing] a new TestCasePlus subclass + get_auto_remove_tmp_dir() ( #6494 )
...
* [testing] switch to a new TestCasePlus + get_auto_remove_tmp_dir() for auto-removal of tmp dirs
* respect after=True for tempfile, simplify code
* comments
* comment fix
* put `before` last in args, so can make debug even faster
2020-08-17 08:12:19 -04:00
Patrick von Platen
36010cb1e2
fix pegasus doc ( #6533 )
2020-08-17 12:24:43 +02:00
Kevin Canwen Xu
37709b5909
Remove deprecated assertEquals ( #6532 )
...
`assertEquals` is deprecated: https://stackoverflow.com/questions/930995/assertequals-vs-assertequal-in-python/931011
This PR replaces these deprecated methods.
2020-08-17 17:13:58 +08:00
Stas Bekman
49d8076fa2
[doc] Summary of the models fixes ( #6511 )
...
* [doc] Summary of the models fixes
* correction
2020-08-17 16:04:53 +08:00
Cahya Wirawan
72911c893a
Create model cards for indonesian models ( #6522 )
...
* added model cards for indonesian gpt2-small, bert-base and roberta-base models
* removed bibtex entries
2020-08-17 15:42:25 +08:00
Masatoshi Suzuki
48c6c6139f
Support additional dictionaries for BERT Japanese tokenizers ( #6515 )
...
* Update BERT Japanese tokenizers
* Update CircleCI config to download unidic
* Specify to use the latest dictionary packages
2020-08-17 12:00:23 +08:00
Stas Bekman
423eb5b1d7
[doc] fix invalid env vars ( #6504 )
...
- remove invalid `ENV_` prefix.
- add a few ':' while at it
2020-08-17 11:11:40 +08:00
Philip May
3c72f5584b
Add Model Card for electra-base-german-uncased ( #6496 )
...
* Add Model Card for electra-base-german-uncased
* Update README.md
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>
2020-08-17 11:02:32 +08:00
Stas Bekman
df15c7c226
typos ( #6505 )
2020-08-17 10:57:36 +08:00
fabiocapsouza
6d38ab1cc3
Update bert-base-portuguese-cased and bert-large-portuguese-cased model cards ( #6527 )
...
Co-authored-by: Fabio Souza <fabiosouza@neuralmind.ai>
2020-08-17 10:49:49 +08:00
Sam Shleifer
84c265ffcc
[lightning_base] fix s2s logging, only make train_loader once ( #6404 )
2020-08-16 22:49:41 -04:00
Sam Shleifer
72add6c98f
[s2s] docs, document desired filenames nicely ( #6525 )
2020-08-16 20:31:22 -04:00
Kyle Piira
2060181126
Fixes paths with spaces in seq2seq example ( #6493 )
2020-08-16 13:36:38 -04:00
Kevin Canwen Xu
fe61c05b85
Add examples/bert-loses-patience who can help ( #6499 )
2020-08-16 16:30:16 +08:00