Suraj Patil
1c8c2d9ab3
[WIP][examples/seq2seq] move old s2s scripts to legacy ( #10136 )
...
* move old s2s scripts to legacy
* add the tests back
* proper rename
* restore
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2021-02-15 10:48:02 -08:00
Stas Bekman
769948fad2
json to jsonlines, and doc, and typo ( #10043 )
2021-02-07 17:51:34 -08:00
Magdalena Biesialska
8f6c12d306
Fix fine-tuning translation scripts ( #9809 )
2021-01-26 11:30:31 -05:00
Sylvain Gugger
411c582109
Fixes to run_seq2seq and instructions ( #9734 )
...
* Fixes to run_seq2seq and instructions
* Add more defaults for summarization
2021-01-22 10:03:57 -05:00
Manuel Romero
37d6fb5d04
Fix link to bertabs/README.md ( #9255 )
2020-12-22 11:41:23 -05:00
Stas Bekman
14c79c3e31
native amp leak fix landed in 1.7.1 ( #9115 )
...
update README with good news that the leak fix has been applied to pytorch-1.7.1.
2020-12-15 09:10:41 -05:00
Sylvain Gugger
783d7d2629
Reorganize examples ( #9010 )
...
* Reorganize example folder
* Continue reorganization
* Change requirements for tests
* Final cleanup
* Finish regroup with tests all passing
* Copyright
* Requirements and readme
* Make a full link for the documentation
* Address review comments
* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Add symlink
* Reorg again
* Apply suggestions from code review
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
* Adapt title
* Update to new strucutre
* Remove test
* Update READMEs
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
2020-12-11 10:07:02 -05:00
Stas Bekman
df311a5ccf
[seq2seq] document the caveat of leaky native amp ( #8930 )
...
* document the caveat of leaky native amp
* Update examples/seq2seq/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-12-04 15:43:35 -08:00
Stas Bekman
4c3d98dddc
[s2s finetune_trainer] add instructions for distributed training ( #8884 )
2020-12-03 16:05:55 -08:00
Stas Bekman
ddf3c64654
potpurri of small fixes ( #8807 )
2020-11-26 14:06:27 -08:00
Sumithra Bhakthavatsalam
81ebd70671
[s2s] distill t5-large -> t5-small ( #8376 )
...
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-11-11 17:58:45 -05:00
Leandro von Werra
17450397a7
Docs bart training ref ( #8330 )
...
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-11-05 17:20:57 -05:00
Sam Shleifer
49e4fece5c
[s2s] distillBART docs for paper replication ( #8150 )
2020-10-29 12:01:15 -04:00
Stas Bekman
0e24e4c136
[s2s] create doc for pegasus/fsmt replication ( #7934 )
2020-10-20 15:07:52 -04:00
Sam Shleifer
827c519494
[examples] bump pl=0.9.0 ( #7053 )
2020-10-11 16:39:38 -04:00
Sam Shleifer
297233fa92
[s2s] Switch README urls to cdn ( #7670 )
2020-10-08 21:22:22 -04:00
Suraj Patil
eab5f59682
[s2s] add create student script ( #7290 )
...
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-09-27 15:10:46 -04:00
Suraj Patil
415071b4c2
doc changes ( #7385 )
2020-09-25 08:00:36 -04:00
Suraj Patil
9e68d075a4
Seq2SeqTrainer ( #6769 )
...
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-09-24 18:46:58 -04:00
Sam Shleifer
d9d0f1140b
[s2s] distributed eval allows num_return_sequences > 1 ( #7254 )
2020-09-24 17:30:09 -04:00
Stas Bekman
eadd870b2f
[seq2seq] make it easier to run the scripts ( #7274 )
2020-09-24 15:23:48 -04:00
Sam Shleifer
25b0463d0b
[s2s] add supported architecures to MD ( #7252 )
2020-09-22 13:09:35 -04:00
Sam Shleifer
a5638b2b3a
[s2s] dynamic batch size with --max_tokens_per_batch ( #7030 )
2020-09-17 15:19:34 -04:00
Sam Shleifer
0203ad43bc
[s2s] distributed eval cleanup ( #7186 )
2020-09-16 15:38:37 -04:00
Stas Bekman
fdaf8ab349
[s2s run_eval] new features ( #7109 )
...
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-09-16 13:59:57 -04:00
brett koonce
b8e4906c97
tweak tar command in readme ( #6919 )
2020-09-03 09:29:01 -04:00
Sam Shleifer
dfa10a41ba
[s2s README] Add more dataset download instructions ( #6737 )
2020-08-30 16:29:24 -04:00
Sam Shleifer
9336086ab5
prepare_seq2seq_batch makes labels/ decoder_input_ids made later. ( #6654 )
...
* broken test
* batch parity
* tests pass
* boom boom
* boom boom
* split out bart tokenizer tests
* fix tests
* boom boom
* Fixed dataset bug
* Fix marian
* Undo extra
* Get marian working
* Fix t5 tok tests
* Test passing
* Cleanup
* better assert msg
* require torch
* Fix mbart tests
* undo extra decoder_attn_mask change
* Fix import
* pegasus tokenizer can ignore src_lang kwargs
* unused kwarg test cov
* boom boom
* add todo for pegasus issue
* cover one word translation edge case
* Cleanup
* doc
2020-08-28 11:15:17 -04:00
Sam Shleifer
72add6c98f
[s2s] docs, document desired filenames nicely ( #6525 )
2020-08-16 20:31:22 -04:00
Sam Shleifer
be1520d3a3
rename prepare_translation_batch -> prepare_seq2seq_batch ( #6103 )
2020-08-11 15:57:07 -04:00
Sam Shleifer
2804fff839
[s2s]Use prepare_translation_batch for Marian finetuning ( #6293 )
...
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-08-06 14:58:38 -04:00
Sam Shleifer
57eb1cb68d
[s2s] Document better mbart finetuning command ( #6229 )
...
* Document better MT command
* improve multigpu command
2020-08-03 18:22:31 -04:00
Stas Bekman
d8dbf3b75d
[s2s] clean up + doc ( #6184 )
...
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-08-01 14:51:07 -04:00
Stas Bekman
3212b8850d
[s2s] add support for overriding config params ( #6149 )
2020-07-30 01:09:46 -04:00
Stas Bekman
f0c70085c2
link to README.md ( #6068 )
...
* add a link to README.md
* Update README.md
2020-07-28 20:34:58 +08:00
Sam Shleifer
3c7fbf35a6
MBART: support summarization tasks where max_src_len > max_tgt_len ( #6003 )
...
* MBART: support summarization tasks
* fix test
* Style
* add tokenizer test
2020-07-28 08:18:11 -04:00
Sam Shleifer
7a68d40138
[s2s] Don't mention packed data in README ( #6079 )
2020-07-27 20:07:21 -04:00
Sam Shleifer
1e00ef681d
[s2s] dont document packing because it hurts performance ( #6077 )
2020-07-27 18:26:00 -04:00
Sam Shleifer
c3206eef44
[test] partial coverage for train_mbart_enro_cc25.sh ( #5976 )
2020-07-22 14:34:49 -04:00
Sam Shleifer
5b193b39b0
[examples/seq2seq]: add --label_smoothing option ( #5919 )
2020-07-21 16:51:39 -04:00
Aditya Soni
ccbf74a685
typos in seq2seq/readme ( #5937 )
2020-07-21 09:44:59 -04:00
Sam Shleifer
09a2f40684
Seq2SeqDataset uses linecache to save memory by @Pradhy729 ( #5792 )
...
Co-authored-by: Pradhy729 <49659913+Pradhy729@users.noreply.github.com>
2020-07-18 13:57:33 -04:00
Sam Shleifer
dad5e12e54
[seq2seq] distillation.py accepts trainer arguments ( #5865 )
2020-07-18 07:43:57 -04:00
Sam Shleifer
ba2400189b
[seq2seq] MAX_LEN env var for MT commands ( #5837 )
2020-07-17 22:51:31 -04:00
Nathan Raw
529850ae7b
Lightning Updates for v0.8.5 ( #5798 )
...
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
2020-07-17 22:43:06 -04:00
Sam Shleifer
353b8f1e7a
Add mbart-large-cc25, support translation finetuning ( #5129 )
...
improve unittests for finetuning, especially w.r.t testing frozen parameters
fix freeze_embeds for T5
add streamlit setup.cfg
2020-07-07 13:23:01 -04:00
Sam Shleifer
27a7fe7a8d
examples/seq2seq: never override $WANDB_PROJECT ( #5407 )
2020-06-30 15:29:13 -04:00
Kevin Canwen Xu
331d8d2936
Upload DistilBART artwork ( #5394 )
2020-06-30 18:11:11 +08:00
Sam Shleifer
a316a6aaa8
[seq2seq docs] Move evaluation down, fix typo ( #5365 )
2020-06-29 10:36:04 -04:00
Sam Shleifer
393b8dc09a
examples/seq2seq/run_eval.py fixes and docs ( #5322 )
2020-06-26 19:20:43 -04:00