transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Stas Bekman	35cb101eae	DataParallel fixes (#5733 ) * DataParallel fixes: 1. switched to a more precise check - if self.args.n_gpu > 1: + if isinstance(model, nn.DataParallel): 2. fix tests - require the same fixup under DataParallel as the training module * another fix	2020-07-20 09:29:12 -04:00
Pradhy729	290b6e18ac	Trainer support for iterabledataset (#5834 ) * Don't pass sampler for iterable dataset * Added check for test and eval dataloaders. * Formatting * Don't pass sampler for iterable dataset * Added check for test and eval dataloaders. * Formatting * Cleaner if nesting. * Added test for trainer and iterable dataset * Formatting for test * Fixed import when torch is available only. * Added require torch decorator to helper class * Moved dataset class inside unittest * Removed nested if and changed model in test * Checking torch availability for IterableDataset	2020-07-20 09:07:37 -04:00
Julien Chaumond	82dd96cae7	[model_cards] Dataset ids are case-sensitive cc @lhoestq @thomwolf Also cc'ing model author @nreimers => Model pages now properly link to the dataset pages (and in the future, eval results, etc.)	2020-07-20 12:47:28 +02:00
Manuel Romero	b01a8844a9	Create README.md (#5813 )	2020-07-20 04:06:42 -04:00
Alan deLevie	223bad242d	fix typo in (#5893 )	2020-07-20 03:53:03 -04:00
Alan deLevie	d441f8d29d	fix typo in training_args_tf.py (#5894 )	2020-07-20 03:48:22 -04:00
Sam Shleifer	09a2f40684	Seq2SeqDataset uses linecache to save memory by @Pradhy729 (#5792 ) Co-authored-by: Pradhy729 <49659913+Pradhy729@users.noreply.github.com>	2020-07-18 13:57:33 -04:00
Teven	4b506a37e3	Xlnet outputs (#5883 ) Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.	2020-07-18 17:33:13 +02:00
Teven	a55809241f	Revert "Xlnet outputs (#5881 )" (#5882 ) This reverts commit `13be487212`.	2020-07-18 17:15:40 +02:00
Teven	13be487212	Xlnet outputs (#5881 ) Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.	2020-07-18 16:53:29 +02:00
Sebastian	eae6d8d14f	Update tokenizers to 0.8.1.rc to fix Mac OS X issues (#5867 )	2020-07-18 08:20:11 -04:00
Sam Shleifer	dad5e12e54	[seq2seq] distillation.py accepts trainer arguments (#5865 )	2020-07-18 07:43:57 -04:00
Sam Shleifer	ba2400189b	[seq2seq] MAX_LEN env var for MT commands (#5837 )	2020-07-17 22:51:31 -04:00
Nathan Raw	529850ae7b	Lightning Updates for v0.8.5 (#5798 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-07-17 22:43:06 -04:00
Teven	615be03f9d	Revert "XLNet `use_cache` refactor (#5770 )" (#5854 ) This reverts commit `0b2da0e592`.	2020-07-17 20:33:44 +02:00
Teven	0b2da0e592	XLNet `use_cache` refactor (#5770 ) Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.	2020-07-17 20:24:16 +02:00
Jannes	9750e1300c	Create README.md (#5847 )	2020-07-17 14:03:53 -04:00
Julien Chaumond	1bca4fbd39	[model_card] Fix metadata	2020-07-17 13:55:37 -04:00
Gianpaolo Di Pietro	a9d56a675a	Added model card for neuraly/bert-base-italian-cased-sentiment (#5845 ) * Added model card for neuraly/bert-base-italian-cased-sentiment * Update model_cards/neuraly/bert-base-italian-cased-sentiment/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Gianpy15 <g.dipietro@neuraly.ai> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-17 13:50:49 -04:00
Patrick von Platen	12f14710ce	[Model card] Bert2Bert Add Rouge2 results	2020-07-17 18:22:05 +02:00
Patrick von Platen	9d37c56bab	[Reformer] - Cache hidden states and buckets to speed up inference (#5578 ) * fix merge rebase * add intermediate reformer code * save intermediate caching results * save intermediate * save intermediate results * save intermediate * upload next step * fix generate tests * make tests work * add named tuple output * Apply suggestions from code review * fix use_cache for False case * fix tensor to gpu * fix tensor to gpu * refactor * refactor and make style	2020-07-17 16:17:42 +02:00
Patrick von Platen	0b6c255a95	[Model card] Bert2Bert (#5841 ) * Create README.md * Update README.md * Update README.md * Update README.md	2020-07-17 11:41:56 +02:00
Sam Shleifer	3d9556a72b	[cleanups] make Marian save as Marian (#5830 )	2020-07-17 02:54:25 -04:00
Sam Shleifer	e238e3d55a	[seq2seq] Don't copy self.source in sortishsampler (#5818 )	2020-07-17 01:53:25 -04:00
Bayartsogt Yadamsuren	2e4624b415	language tag addition on albert-mongolian (#5828 ) * language tag addition on albert-mongolian * Update model_cards/bayartsogt/albert-mongolian/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-17 01:40:38 -04:00
Manuel Romero	d088d744ad	Create README.md (#5821 )	2020-07-16 15:18:31 -04:00
Nick Doiron	233072fc1e	dv-wave (#5823 )	2020-07-16 15:13:51 -04:00
Sam Shleifer	283500ff9f	[seq2seq] pack_dataset.py rewrites dataset in max_tokens format (#5819 )	2020-07-16 14:06:49 -04:00
Manuel Romero	c45d7a707d	Update README.md (#5812 ) Fix missig "-" in meta data	2020-07-16 10:25:50 -04:00
Patrick von Platen	057411c56a	fix longformer slow down (#5811 )	2020-07-16 16:19:37 +02:00
Patrick von Platen	89a78be51f	fix benchmark for longformer (#5808 )	2020-07-16 15:15:10 +02:00
Patrick von Platen	aefc0c0429	fix benchmark non standard model (#5801 )	2020-07-16 12:13:10 +02:00
Martin Müller	8ce610bc96	Update README.md (#5789 )	2020-07-16 05:26:17 -04:00
Julien Chaumond	6b6d035d8f	[model_card] illuin/lepetit	2020-07-16 03:50:47 -04:00
HuYong	d1f74b9aff	ADD ERNIE model (#5763 ) * ERNIE model card * Update Readme.md * Update Readme.md * Update Readme.md * Rename Readme.md to README.md * Update README.md * Update Readme.md * Update README.md * Rename Readme.md to README.md * Update Readme.md * Update Readme.md * Rename Readme.md to README.md * Update and rename Readme.md to README.md Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>	2020-07-16 11:03:05 +08:00
Clement	3b924fabee	Create distilbert squad tags	2020-07-15 17:59:06 -04:00
Clement	067814102c	fix readme	2020-07-15 17:50:46 -04:00
Clement	d179fd69ca	test readme change	2020-07-15 17:48:22 -04:00
Manuel Romero	63761614eb	Update README.md (#5776 ) Add cherry picked example for the widget Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-15 16:19:21 -04:00
Manuel Romero	221e23c6c1	Create README.md (#5781 ) * Create README.md * Update model_cards/mrm8488/RoBasquERTa/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-15 16:17:25 -04:00
Manuel Romero	d4cda29af1	Create README.md (#5782 ) * Create README.md * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-15 16:17:19 -04:00
Julien Chaumond	62ec28ce4f	[model_cards] Fix pierreguillou/gpt2-small-portuguese	2020-07-15 22:14:52 +02:00
Pierre Guillou	a946724bbf	metadata (#5758 ) * metadata * Update model_cards/pierreguillou/gpt2-small-portuguese/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-15 16:13:28 -04:00
Julien Chaumond	015dc51fe3	[model_card] bert-portuguese: add language meta cc @rodrigonogueira4 @abiocapsouza @robertoalotufo Also cc @piegu Obrigado :)	2020-07-15 21:25:52 +02:00
Sam Shleifer	1a647abf0b	[fix] check code quality (#5772 )	2020-07-15 14:59:38 -04:00
Julien Chaumond	b23d3a5ad4	[model_cards] Switch all languages codes to ISO-639-{1,2,3}	2020-07-15 18:59:20 +02:00
Funtowicz Morgan	d533c7e9b9	[fix] T5 ONNX test: model.to(torch_device) (#5769 ) Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-07-15 10:11:22 -04:00
Sam Shleifer	d0486c8bc2	[cleanup] T5 test, warnings (#5761 )	2020-07-15 08:23:22 -04:00
Patrick von Platen	ec0a945cf9	[AutoModels] Fix config params handling of all PT and TF AutoModels (#5665 ) * fix auto model causal lm * leverage given functionality * apply unused kwargs to all auto models	2020-07-15 09:51:14 +02:00
Julien Chaumond	8ab565a4be	[model_card] Fix syntax	2020-07-14 22:27:07 +02:00

1 2 3 4 5 ...

4569 Commits