Sam Shleifer
a504cb49ec
[examples] fix summarization do_predict ( #3866 )
2020-04-20 10:49:56 -04:00
Sam Shleifer
f0c96fafd1
[examples] summarization/bart/finetune.py supports t5 ( #3824 )
...
renames `run_bart_sum.py` to `finetune.py`
2020-04-16 15:15:19 -04:00
Sam Shleifer
c59b1e682d
[examples] unit test for run_bart_sum ( #3544 )
...
- adds pytorch-lightning dependency
2020-04-15 18:35:01 -04:00
Patrick von Platen
01c37dcdb5
[Config, Caching] Remove output_past
everywhere and replace by use_cache
argument ( #3734 )
...
* remove output_past from pt
* make style
* add optional input length for gpt2
* add use cache to prepare input
* save memory in gpt2
* correct gpt2 test inputs
* make past input optional for gpt2
* finish use_cache for all models
* make style
* delete modeling_gpt2 change in test file
* correct docstring
* correct is true statements for gpt2
2020-04-14 14:40:28 -04:00
Sam Shleifer
715aa5b135
[Bart] Replace config.output_past with use_cache kwarg ( #3632 )
2020-04-07 19:08:26 -04:00
Sam Shleifer
e344e3d402
[examples] SummarizationDataset cleanup ( #3451 )
2020-04-07 19:05:58 -04:00
Patrick von Platen
ae6834e028
[Examples] Clean summarization and translation example testing files for T5 and Bart ( #3514 )
...
* fix conflicts
* add model size argument to summarization
* correct wrong import
* fix isort
* correct imports
* other isort make style
* make style
2020-03-31 17:54:13 +02:00
Sam Shleifer
8deff3acf2
[bart-tiny-random] Put a 5MB model on S3 to allow faster exampl… ( #3488 )
2020-03-30 12:28:27 -04:00
Sam Shleifer
33ef7002e1
[Docs] examples/summarization/bart: Simplify CNN/DM preprocessi… ( #3516 )
2020-03-29 13:25:42 -04:00
Patrick von Platen
17dceae7a1
Fix circle ci flaky fail of wmt example ( #3485 )
...
* force bleu
* fix wrong file name
* rename file
* different filenames for each example test
* test files should clean up after themselves
* test files should clean up after themselves
* do not force bleu
* correct typo
* fix isort
2020-03-27 13:01:28 -04:00
Andre Carrera
3d76df3a12
BART for summarization training with CNN/DM using pytorch-lightning
2020-03-24 21:00:24 -04:00
Patrick von Platen
95e00d0808
Clean special token init in modeling_....py ( #3264 )
...
* make style
* fix conflicts
2020-03-20 21:41:04 +01:00
Patrick von Platen
e8f44af5bf
[generate] do_sample default back to False ( #3298 )
...
* change do_samples back
* None better default as boolean
* adapt do_sample to True in test example
* make style
2020-03-17 10:52:37 -04:00
Patrick von Platen
4f75d380a4
make style
2020-03-13 16:35:52 +01:00
Patrick von Platen
c2ee3840ae
update file to new starting token logic
2020-03-13 16:34:44 +01:00
Sam Shleifer
2e81b9d8d7
Bart: update example for #3140 compatibility ( #3233 )
...
* Update bart example docs
2020-03-12 10:36:37 -04:00
Patrick von Platen
5b3000d933
renamed min_len to min_length
2020-03-11 11:06:56 +01:00
Sam Shleifer
3aca02efb3
Bart example: model.to(device) ( #3194 )
2020-03-09 15:09:35 -04:00
Sam Shleifer
857e0a0d3b
Rename BartForMaskedLM -> BartForConditionalGeneration ( #3114 )
...
* improved documentation
2020-03-05 17:41:18 -05:00
Sam Shleifer
5b396457e5
Summarization Examples: add Bart CNN Evaluation ( #3082 )
...
* Rename and improve example
* Add test
* slightly faster test
* style
* This breaks remy prolly
* shorter test string
* no slow
* newdir structure
* New tree
* Style
* shorter
* docs
* clean
* Attempt future import
* more import hax
2020-03-03 15:29:59 -05:00