Sam Shleifer
|
49e4fece5c
|
[s2s] distillBART docs for paper replication (#8150)
|
2020-10-29 12:01:15 -04:00 |
|
Stas Bekman
|
0e24e4c136
|
[s2s] create doc for pegasus/fsmt replication (#7934)
|
2020-10-20 15:07:52 -04:00 |
|
Sam Shleifer
|
827c519494
|
[examples] bump pl=0.9.0 (#7053)
|
2020-10-11 16:39:38 -04:00 |
|
Sam Shleifer
|
297233fa92
|
[s2s] Switch README urls to cdn (#7670)
|
2020-10-08 21:22:22 -04:00 |
|
Suraj Patil
|
eab5f59682
|
[s2s] add create student script (#7290)
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
|
2020-09-27 15:10:46 -04:00 |
|
Suraj Patil
|
415071b4c2
|
doc changes (#7385)
|
2020-09-25 08:00:36 -04:00 |
|
Suraj Patil
|
9e68d075a4
|
Seq2SeqTrainer (#6769)
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
|
2020-09-24 18:46:58 -04:00 |
|
Sam Shleifer
|
d9d0f1140b
|
[s2s] distributed eval allows num_return_sequences > 1 (#7254)
|
2020-09-24 17:30:09 -04:00 |
|
Stas Bekman
|
eadd870b2f
|
[seq2seq] make it easier to run the scripts (#7274)
|
2020-09-24 15:23:48 -04:00 |
|
Sam Shleifer
|
25b0463d0b
|
[s2s] add supported architecures to MD (#7252)
|
2020-09-22 13:09:35 -04:00 |
|
Sam Shleifer
|
a5638b2b3a
|
[s2s] dynamic batch size with --max_tokens_per_batch (#7030)
|
2020-09-17 15:19:34 -04:00 |
|
Sam Shleifer
|
0203ad43bc
|
[s2s] distributed eval cleanup (#7186)
|
2020-09-16 15:38:37 -04:00 |
|
Stas Bekman
|
fdaf8ab349
|
[s2s run_eval] new features (#7109)
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
|
2020-09-16 13:59:57 -04:00 |
|
brett koonce
|
b8e4906c97
|
tweak tar command in readme (#6919)
|
2020-09-03 09:29:01 -04:00 |
|
Sam Shleifer
|
dfa10a41ba
|
[s2s README] Add more dataset download instructions (#6737)
|
2020-08-30 16:29:24 -04:00 |
|
Sam Shleifer
|
9336086ab5
|
prepare_seq2seq_batch makes labels/ decoder_input_ids made later. (#6654)
* broken test
* batch parity
* tests pass
* boom boom
* boom boom
* split out bart tokenizer tests
* fix tests
* boom boom
* Fixed dataset bug
* Fix marian
* Undo extra
* Get marian working
* Fix t5 tok tests
* Test passing
* Cleanup
* better assert msg
* require torch
* Fix mbart tests
* undo extra decoder_attn_mask change
* Fix import
* pegasus tokenizer can ignore src_lang kwargs
* unused kwarg test cov
* boom boom
* add todo for pegasus issue
* cover one word translation edge case
* Cleanup
* doc
|
2020-08-28 11:15:17 -04:00 |
|
Sam Shleifer
|
72add6c98f
|
[s2s] docs, document desired filenames nicely (#6525)
|
2020-08-16 20:31:22 -04:00 |
|
Sam Shleifer
|
be1520d3a3
|
rename prepare_translation_batch -> prepare_seq2seq_batch (#6103)
|
2020-08-11 15:57:07 -04:00 |
|
Sam Shleifer
|
2804fff839
|
[s2s]Use prepare_translation_batch for Marian finetuning (#6293)
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2020-08-06 14:58:38 -04:00 |
|
Sam Shleifer
|
57eb1cb68d
|
[s2s] Document better mbart finetuning command (#6229)
* Document better MT command
* improve multigpu command
|
2020-08-03 18:22:31 -04:00 |
|
Stas Bekman
|
d8dbf3b75d
|
[s2s] clean up + doc (#6184)
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
|
2020-08-01 14:51:07 -04:00 |
|
Stas Bekman
|
3212b8850d
|
[s2s] add support for overriding config params (#6149)
|
2020-07-30 01:09:46 -04:00 |
|
Stas Bekman
|
f0c70085c2
|
link to README.md (#6068)
* add a link to README.md
* Update README.md
|
2020-07-28 20:34:58 +08:00 |
|
Sam Shleifer
|
3c7fbf35a6
|
MBART: support summarization tasks where max_src_len > max_tgt_len (#6003)
* MBART: support summarization tasks
* fix test
* Style
* add tokenizer test
|
2020-07-28 08:18:11 -04:00 |
|
Sam Shleifer
|
7a68d40138
|
[s2s] Don't mention packed data in README (#6079)
|
2020-07-27 20:07:21 -04:00 |
|
Sam Shleifer
|
1e00ef681d
|
[s2s] dont document packing because it hurts performance (#6077)
|
2020-07-27 18:26:00 -04:00 |
|
Sam Shleifer
|
c3206eef44
|
[test] partial coverage for train_mbart_enro_cc25.sh (#5976)
|
2020-07-22 14:34:49 -04:00 |
|
Sam Shleifer
|
5b193b39b0
|
[examples/seq2seq]: add --label_smoothing option (#5919)
|
2020-07-21 16:51:39 -04:00 |
|
Aditya Soni
|
ccbf74a685
|
typos in seq2seq/readme (#5937)
|
2020-07-21 09:44:59 -04:00 |
|
Sam Shleifer
|
09a2f40684
|
Seq2SeqDataset uses linecache to save memory by @Pradhy729 (#5792)
Co-authored-by: Pradhy729 <49659913+Pradhy729@users.noreply.github.com>
|
2020-07-18 13:57:33 -04:00 |
|
Sam Shleifer
|
dad5e12e54
|
[seq2seq] distillation.py accepts trainer arguments (#5865)
|
2020-07-18 07:43:57 -04:00 |
|
Sam Shleifer
|
ba2400189b
|
[seq2seq] MAX_LEN env var for MT commands (#5837)
|
2020-07-17 22:51:31 -04:00 |
|
Nathan Raw
|
529850ae7b
|
Lightning Updates for v0.8.5 (#5798)
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
|
2020-07-17 22:43:06 -04:00 |
|
Sam Shleifer
|
353b8f1e7a
|
Add mbart-large-cc25, support translation finetuning (#5129)
improve unittests for finetuning, especially w.r.t testing frozen parameters
fix freeze_embeds for T5
add streamlit setup.cfg
|
2020-07-07 13:23:01 -04:00 |
|
Sam Shleifer
|
27a7fe7a8d
|
examples/seq2seq: never override $WANDB_PROJECT (#5407)
|
2020-06-30 15:29:13 -04:00 |
|
Kevin Canwen Xu
|
331d8d2936
|
Upload DistilBART artwork (#5394)
|
2020-06-30 18:11:11 +08:00 |
|
Sam Shleifer
|
a316a6aaa8
|
[seq2seq docs] Move evaluation down, fix typo (#5365)
|
2020-06-29 10:36:04 -04:00 |
|
Sam Shleifer
|
393b8dc09a
|
examples/seq2seq/run_eval.py fixes and docs (#5322)
|
2020-06-26 19:20:43 -04:00 |
|
Sam Shleifer
|
5543b30aa6
|
[pl_examples] default warmup steps=0 (#5316)
|
2020-06-26 15:03:41 -04:00 |
|
Sam Shleifer
|
e008d520bb
|
[examples/seq2seq] more README improvements (#5274)
|
2020-06-25 10:13:01 -04:00 |
|
Sam Shleifer
|
40457bcebb
|
examples/seq2seq supports translation (#5202)
|
2020-06-24 23:58:11 -04:00 |
|