transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-14 10:08:29 +06:00

Author	SHA1	Message	Date
Sam Shleifer	6078b12098	[s2s] distill: --normalize_hidden --supervise_forward (#6834 )	2020-09-04 14:05:56 -04:00
Sam Shleifer	e95d262f25	[s2s] support early stopping based on loss, rather than rouge (#6927 )	2020-09-03 17:31:35 -04:00
Sam Shleifer	39ed68d597	[s2s] allow task_specific_params=summarization_xsum (#6923 )	2020-09-03 11:11:40 -04:00
Sam Shleifer	5a318f075a	[s2s]: script to convert pl checkpoints to hf checkpoints (#6911 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-09-03 09:47:00 -04:00
Sam Shleifer	b9772897ec	[s2s] command line args for faster val steps (#6833 )	2020-08-31 16:16:10 -04:00
Sam Shleifer	5ab21b072f	[s2s] Test hub configs in self-scheduled CI (#6809 )	2020-08-28 17:05:52 -04:00
Sam Shleifer	9336086ab5	prepare_seq2seq_batch makes labels/ decoder_input_ids made later. (#6654 ) * broken test * batch parity * tests pass * boom boom * boom boom * split out bart tokenizer tests * fix tests * boom boom * Fixed dataset bug * Fix marian * Undo extra * Get marian working * Fix t5 tok tests * Test passing * Cleanup * better assert msg * require torch * Fix mbart tests * undo extra decoder_attn_mask change * Fix import * pegasus tokenizer can ignore src_lang kwargs * unused kwarg test cov * boom boom * add todo for pegasus issue * cover one word translation edge case * Cleanup * doc	2020-08-28 11:15:17 -04:00
Sam Shleifer	fb78a90d6a	PL: --adafactor option (#6776 )	2020-08-27 22:19:46 -04:00
Sam Shleifer	4bd7be9a42	s2s distillation uses AutoModelForSeqToSeqLM (#6761 )	2020-08-26 23:25:11 -04:00
Sam Shleifer	61518e2df3	[s2s] run_eval.py QOL improvements and cleanup(#6746 )	2020-08-26 18:59:20 -04:00
Lysandre	a75c64d80c	Black 20 release	2020-08-26 17:20:22 +02:00
Sam Shleifer	f94a52cd79	[s2s] add BartTranslationDistiller for distilling mBART (#6363 )	2020-08-12 11:41:04 -04:00
Stas Bekman	87b359439f	[test] replace capsys with the more refined CaptureStderr/CaptureStdout (#6422 ) * replace capsys with the more refined CaptureStderr/CaptureStdout * Update examples/seq2seq/test_seq2seq_examples.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-08-12 07:54:28 -04:00
Stas Bekman	0830e79512	the test now works again (#6371 )	2020-08-10 02:55:52 -04:00
Stas Bekman	175cd45e13	fix the shuffle agrument usage and the default (#6307 )	2020-08-06 20:32:28 -04:00
Sam Shleifer	2804fff839	[s2s]Use prepare_translation_batch for Marian finetuning (#6293 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-06 14:58:38 -04:00
Stas Bekman	376c02e9a9	[WIP] lightning_base: support --lr_scheduler with multiple possibilities (#6232 ) * support --lr_scheduler with multiple possibilities * correct the error message * add a note about supported schedulers * cleanup * cleanup2 * needs the argument default * style * add another assert in the test * implement requested changes * cleanups * fix relative import * cleanup	2020-08-05 09:01:17 -04:00
Sylvain Gugger	91cb95461e	Switch from return_tuple to return_dict (#6138 ) * Switch from return_tuple to return_dict * Fix test * [WIP] Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleC… (#5614) * Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleChoice} models and tests * AutoModels Tiny tweaks * Style * Final changes before merge * Re-order for simpler review * Final fixes * Addressing @sgugger's comments * Test MultipleChoice * Rework TF trainer (#6038) * Fully rework training/prediction loops * fix method name * Fix variable name * Fix property name * Fix scope * Fix method name * Fix tuple index * Fix tuple index * Fix indentation * Fix variable name * fix eval before log * Add drop remainder for test dataset * Fix step number + fix logging datetime * fix eval loss value * use global step instead of step + fix logging at step 0 * Fix logging datetime * Fix global_step usage * Fix breaking loop + logging datetime * Fix step in prediction loop * Fix step breaking * Fix train/test loops * Force TF at least 2.2 for the trainer * Use assert_cardinality to facilitate the dataset size computation * Log steps per epoch * Make tfds compliant with TPU * Make tfds compliant with TPU * Use TF dataset enumerate instead of the Python one * revert previous commit * Fix data_dir * Apply style * rebase on master * Address Sylvain's comments * Address Sylvain's and Lysandre comments * Trigger CI * Remove unused import * Switch from return_tuple to return_dict * Fix test * Add recent model Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Julien Plu <plu.julien@gmail.com>	2020-07-30 09:17:00 -04:00
Stas Bekman	3212b8850d	[s2s] add support for overriding config params (#6149 )	2020-07-30 01:09:46 -04:00
Sam Shleifer	3c7fbf35a6	MBART: support summarization tasks where max_src_len > max_tgt_len (#6003 ) * MBART: support summarization tasks * fix test * Style * add tokenizer test	2020-07-28 08:18:11 -04:00
Sam Shleifer	c69ea5efc4	[CI] Don't test apex (#6021 )	2020-07-24 15:34:16 -04:00
Sam Shleifer	5b193b39b0	[examples/seq2seq]: add --label_smoothing option (#5919 )	2020-07-21 16:51:39 -04:00
Sam Shleifer	f1a4e06f1f	[Fix] seq2seq pack_dataset.py actually packs (#5913 ) Huge MT speedup!	2020-07-20 15:18:26 -04:00
Sam Shleifer	09a2f40684	Seq2SeqDataset uses linecache to save memory by @Pradhy729 (#5792 ) Co-authored-by: Pradhy729 <49659913+Pradhy729@users.noreply.github.com>	2020-07-18 13:57:33 -04:00
Nathan Raw	529850ae7b	Lightning Updates for v0.8.5 (#5798 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-07-17 22:43:06 -04:00
Sam Shleifer	283500ff9f	[seq2seq] pack_dataset.py rewrites dataset in max_tokens format (#5819 )	2020-07-16 14:06:49 -04:00
Sam Shleifer	353b8f1e7a	Add mbart-large-cc25, support translation finetuning (#5129 ) improve unittests for finetuning, especially w.r.t testing frozen parameters fix freeze_embeds for T5 add streamlit setup.cfg	2020-07-07 13:23:01 -04:00
Sam Shleifer	13deb95a40	Move tests/utils.py -> transformers/testing_utils.py (#5350 )	2020-07-01 10:31:17 -04:00
Sam Shleifer	393b8dc09a	examples/seq2seq/run_eval.py fixes and docs (#5322 )	2020-06-26 19:20:43 -04:00
Sam Shleifer	5543b30aa6	[pl_examples] default warmup steps=0 (#5316 )	2020-06-26 15:03:41 -04:00
Sam Shleifer	40457bcebb	examples/seq2seq supports translation (#5202 )	2020-06-24 23:58:11 -04:00

31 Commits