Sean Naren
|
5e24982e58
|
Upgrade PyTorch Lightning to 1.0.2 (#7852)
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
|
2020-10-28 14:59:14 -04:00 |
|
Stas Bekman
|
8b38173398
|
[seq2seq testing] multigpu test run via subprocess (#7281)
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
|
2020-10-21 17:20:53 -04:00 |
|
Sam Shleifer
|
827c519494
|
[examples] bump pl=0.9.0 (#7053)
|
2020-10-11 16:39:38 -04:00 |
|
Sam Shleifer
|
500be01c5d
|
[s2s] save first batch to json for debugging purposes (#6810)
|
2020-10-06 16:11:56 -04:00 |
|
Sam Shleifer
|
48f23f92a8
|
[s2sTrainer] test + code cleanup (#7467)
|
2020-10-01 00:33:01 -04:00 |
|
Stas Bekman
|
eadd870b2f
|
[seq2seq] make it easier to run the scripts (#7274)
|
2020-09-24 15:23:48 -04:00 |
|
Sam Shleifer
|
78387cc63e
|
[s2s] only save metrics.json from rank zero (#7331)
|
2020-09-22 18:27:28 -04:00 |
|
Stas Bekman
|
af4b98ed97
|
[s2s] adjust finetune + test to work with fsmt (#7263)
|
2020-09-21 15:13:19 -04:00 |
|
Stas Bekman
|
7cbf0f722d
|
examples/seq2seq/__init__.py mutates sys.path (#7194)
|
2020-09-20 16:54:42 -04:00 |
|
Sam Shleifer
|
67d9fc50d9
|
[s2s] remove double assert (#7223)
|
2020-09-17 18:32:31 -04:00 |
|
Sam Shleifer
|
a5638b2b3a
|
[s2s] dynamic batch size with --max_tokens_per_batch (#7030)
|
2020-09-17 15:19:34 -04:00 |
|
Sam Shleifer
|
77950c485a
|
[wip/s2s] DistributedSortishSampler (#7056)
|
2020-09-10 15:23:44 -04:00 |
|
Sam Shleifer
|
e9a2f772bc
|
[s2s] --eval_max_generate_length (#7018)
|
2020-09-10 14:11:34 -04:00 |
|
Sam Shleifer
|
ce37be9d94
|
[s2s] warn if --fp16 for torch 1.6 (#6977)
|
2020-09-06 20:41:29 -04:00 |
|
Sam Shleifer
|
e95d262f25
|
[s2s] support early stopping based on loss, rather than rouge (#6927)
|
2020-09-03 17:31:35 -04:00 |
|
Sam Shleifer
|
207ed8cb78
|
[s2s] use --eval_beams command line arg (#6926)
|
2020-09-03 12:42:09 -04:00 |
|
Sam Shleifer
|
39ed68d597
|
[s2s] allow task_specific_params=summarization_xsum (#6923)
|
2020-09-03 11:11:40 -04:00 |
|
Sam Shleifer
|
b9772897ec
|
[s2s] command line args for faster val steps (#6833)
|
2020-08-31 16:16:10 -04:00 |
|
Sam Shleifer
|
9336086ab5
|
prepare_seq2seq_batch makes labels/ decoder_input_ids made later. (#6654)
* broken test
* batch parity
* tests pass
* boom boom
* boom boom
* split out bart tokenizer tests
* fix tests
* boom boom
* Fixed dataset bug
* Fix marian
* Undo extra
* Get marian working
* Fix t5 tok tests
* Test passing
* Cleanup
* better assert msg
* require torch
* Fix mbart tests
* undo extra decoder_attn_mask change
* Fix import
* pegasus tokenizer can ignore src_lang kwargs
* unused kwarg test cov
* boom boom
* add todo for pegasus issue
* cover one word translation edge case
* Cleanup
* doc
|
2020-08-28 11:15:17 -04:00 |
|
Sam Shleifer
|
0344428f79
|
[s2s] round bleu, rouge to 4 digits (#6704)
|
2020-08-25 00:33:11 -04:00 |
|
Sylvain Gugger
|
a573777901
|
Update repo to isort v5 (#6686)
* Run new isort
* More changes
* Update CI, CONTRIBUTING and benchmarks
|
2020-08-24 11:03:01 -04:00 |
|
Sam Shleifer
|
84c265ffcc
|
[lightning_base] fix s2s logging, only make train_loader once (#6404)
|
2020-08-16 22:49:41 -04:00 |
|
Sam Shleifer
|
2804fff839
|
[s2s]Use prepare_translation_batch for Marian finetuning (#6293)
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2020-08-06 14:58:38 -04:00 |
|
Sam Shleifer
|
dafa296c95
|
[s2s] Delete useless method, log tokens_per_batch (#6081)
|
2020-07-28 11:24:23 -04:00 |
|
Sam Shleifer
|
3c7fbf35a6
|
MBART: support summarization tasks where max_src_len > max_tgt_len (#6003)
* MBART: support summarization tasks
* fix test
* Style
* add tokenizer test
|
2020-07-28 08:18:11 -04:00 |
|
Suraj Patil
|
d1d15d6f2d
|
[examples (seq2seq)] fix preparing decoder_input_ids for T5 (#5994)
|
2020-07-27 10:10:43 -04:00 |
|
Sam Shleifer
|
9dab39feea
|
seq2seq/run_eval.py can take decoder_start_token_id (#5949)
|
2020-07-21 16:58:45 -04:00 |
|
Sam Shleifer
|
5b193b39b0
|
[examples/seq2seq]: add --label_smoothing option (#5919)
|
2020-07-21 16:51:39 -04:00 |
|
Sam Shleifer
|
09a2f40684
|
Seq2SeqDataset uses linecache to save memory by @Pradhy729 (#5792)
Co-authored-by: Pradhy729 <49659913+Pradhy729@users.noreply.github.com>
|
2020-07-18 13:57:33 -04:00 |
|
Nathan Raw
|
529850ae7b
|
Lightning Updates for v0.8.5 (#5798)
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
|
2020-07-17 22:43:06 -04:00 |
|
Sam Shleifer
|
353b8f1e7a
|
Add mbart-large-cc25, support translation finetuning (#5129)
improve unittests for finetuning, especially w.r.t testing frozen parameters
fix freeze_embeds for T5
add streamlit setup.cfg
|
2020-07-07 13:23:01 -04:00 |
|
Sam Shleifer
|
27a7fe7a8d
|
examples/seq2seq: never override $WANDB_PROJECT (#5407)
|
2020-06-30 15:29:13 -04:00 |
|
Sam Shleifer
|
5543b30aa6
|
[pl_examples] default warmup steps=0 (#5316)
|
2020-06-26 15:03:41 -04:00 |
|
Sam Shleifer
|
40457bcebb
|
examples/seq2seq supports translation (#5202)
|
2020-06-24 23:58:11 -04:00 |
|