transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 03:01:07 +06:00

Author	SHA1	Message	Date
Jannes	783a0c7ee9	Create README.md (#5872 )	2020-07-21 13:20:21 -04:00
Jannes	e7844d60c2	Create README.md (#5871 )	2020-07-21 13:19:48 -04:00
tuner007	b1ee69763c	Create README.md (#5864 )	2020-07-21 13:15:07 -04:00
Manuel Romero	5f809e4976	Update README.md (#5857 ) Add nlp dataset used	2020-07-21 13:14:27 -04:00
Manuel Romero	4215f59c99	Update README.md (#5856 ) Add dataset used as it is now part of nlp package	2020-07-21 13:11:08 -04:00
Ali Hamdi Ali Fadel	1d72460d55	Add ComVE model cards (#5884 ) * Add ComVE model cards * Apply suggestions from code review Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-21 12:54:29 -04:00
Aditya Soni	ccbf74a685	typos in seq2seq/readme (#5937 )	2020-07-21 09:44:59 -04:00
BatJedi	d32279438a	Created model card for my extreme summarization model (#5839 ) * Created model card for my extreme summarization model * Update model_cards/yuvraj/xSumm/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-21 03:54:57 -04:00
BatJedi	abf5c56e9d	Created model card for my summarization model (#5838 ) * Created model card for my summarization model * Update model_cards/yuvraj/summarizer-cnndm/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-21 03:54:14 -04:00
Manuel Romero	d73baeebc5	Create README.md (#5921 ) - Maybe the result of this query answers the question You did some days ago @julien-c ;-)	2020-07-21 03:52:52 -04:00
Manuel Romero	50acfc8717	Create README.md (#5924 )	2020-07-21 03:41:37 -04:00
Manuel Romero	7249533404	Create README.md (#5920 )	2020-07-21 03:31:42 -04:00
Sylvain Gugger	4781afd045	Clarify arg class (#5916 )	2020-07-20 19:47:06 -04:00
Qingqing Cao	8e0bcb56ec	DataParallel fix: multi gpu evaluation (#5926 ) The DataParallel training was fixed in https://github.com/huggingface/transformers/pull/5733, this commit also fixes the evaluation. It's more convenient when the user enables both `do_train` and `do_eval`.	2020-07-20 17:54:08 -04:00
Sylvain Gugger	a20969170b	Add AlbertForPretraining to doc (#5914 )	2020-07-20 17:53:21 -04:00
Sam Shleifer	f1a4e06f1f	[Fix] seq2seq pack_dataset.py actually packs (#5913 ) Huge MT speedup!	2020-07-20 15:18:26 -04:00
Sylvain Gugger	32883b310b	Improve doc of use_cache (#5912 ) * Improve doc of use_cache * Update src/transformers/configuration_xlnet.py Co-authored-by: Teven <teven.lescao@gmail.com> Co-authored-by: Teven <teven.lescao@gmail.com>	2020-07-20 11:50:41 -04:00
Clement	9ccb45a263	Update gpt2-README.md	2020-07-20 11:40:33 -04:00
Clement	f19751117d	Create gpt2-medium-README.md	2020-07-20 10:47:42 -04:00
Clement	511523672b	Create gpt2-large-README.md	2020-07-20 10:47:27 -04:00
Clement	182c611934	Update gpt2-README.md	2020-07-20 10:47:11 -04:00
Clement	a9ae27cd0f	add link to write with transformers to model card	2020-07-20 10:46:10 -04:00
Sam Shleifer	01c40db4f8	[cleanup] squad processor (#5868 )	2020-07-20 10:44:10 -04:00
Stas Bekman	35cb101eae	DataParallel fixes (#5733 ) * DataParallel fixes: 1. switched to a more precise check - if self.args.n_gpu > 1: + if isinstance(model, nn.DataParallel): 2. fix tests - require the same fixup under DataParallel as the training module * another fix	2020-07-20 09:29:12 -04:00
Pradhy729	290b6e18ac	Trainer support for iterabledataset (#5834 ) * Don't pass sampler for iterable dataset * Added check for test and eval dataloaders. * Formatting * Don't pass sampler for iterable dataset * Added check for test and eval dataloaders. * Formatting * Cleaner if nesting. * Added test for trainer and iterable dataset * Formatting for test * Fixed import when torch is available only. * Added require torch decorator to helper class * Moved dataset class inside unittest * Removed nested if and changed model in test * Checking torch availability for IterableDataset	2020-07-20 09:07:37 -04:00
Julien Chaumond	82dd96cae7	[model_cards] Dataset ids are case-sensitive cc @lhoestq @thomwolf Also cc'ing model author @nreimers => Model pages now properly link to the dataset pages (and in the future, eval results, etc.)	2020-07-20 12:47:28 +02:00
Manuel Romero	b01a8844a9	Create README.md (#5813 )	2020-07-20 04:06:42 -04:00
Alan deLevie	223bad242d	fix typo in (#5893 )	2020-07-20 03:53:03 -04:00
Alan deLevie	d441f8d29d	fix typo in training_args_tf.py (#5894 )	2020-07-20 03:48:22 -04:00
Sam Shleifer	09a2f40684	Seq2SeqDataset uses linecache to save memory by @Pradhy729 (#5792 ) Co-authored-by: Pradhy729 <49659913+Pradhy729@users.noreply.github.com>	2020-07-18 13:57:33 -04:00
Teven	4b506a37e3	Xlnet outputs (#5883 ) Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.	2020-07-18 17:33:13 +02:00
Teven	a55809241f	Revert "Xlnet outputs (#5881 )" (#5882 ) This reverts commit `13be487212`.	2020-07-18 17:15:40 +02:00
Teven	13be487212	Xlnet outputs (#5881 ) Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.	2020-07-18 16:53:29 +02:00
Sebastian	eae6d8d14f	Update tokenizers to 0.8.1.rc to fix Mac OS X issues (#5867 )	2020-07-18 08:20:11 -04:00
Sam Shleifer	dad5e12e54	[seq2seq] distillation.py accepts trainer arguments (#5865 )	2020-07-18 07:43:57 -04:00
Sam Shleifer	ba2400189b	[seq2seq] MAX_LEN env var for MT commands (#5837 )	2020-07-17 22:51:31 -04:00
Nathan Raw	529850ae7b	Lightning Updates for v0.8.5 (#5798 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-07-17 22:43:06 -04:00
Teven	615be03f9d	Revert "XLNet `use_cache` refactor (#5770 )" (#5854 ) This reverts commit `0b2da0e592`.	2020-07-17 20:33:44 +02:00
Teven	0b2da0e592	XLNet `use_cache` refactor (#5770 ) Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.	2020-07-17 20:24:16 +02:00
Jannes	9750e1300c	Create README.md (#5847 )	2020-07-17 14:03:53 -04:00
Julien Chaumond	1bca4fbd39	[model_card] Fix metadata	2020-07-17 13:55:37 -04:00
Gianpaolo Di Pietro	a9d56a675a	Added model card for neuraly/bert-base-italian-cased-sentiment (#5845 ) * Added model card for neuraly/bert-base-italian-cased-sentiment * Update model_cards/neuraly/bert-base-italian-cased-sentiment/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Gianpy15 <g.dipietro@neuraly.ai> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-17 13:50:49 -04:00
Patrick von Platen	12f14710ce	[Model card] Bert2Bert Add Rouge2 results	2020-07-17 18:22:05 +02:00
Patrick von Platen	9d37c56bab	[Reformer] - Cache hidden states and buckets to speed up inference (#5578 ) * fix merge rebase * add intermediate reformer code * save intermediate caching results * save intermediate * save intermediate results * save intermediate * upload next step * fix generate tests * make tests work * add named tuple output * Apply suggestions from code review * fix use_cache for False case * fix tensor to gpu * fix tensor to gpu * refactor * refactor and make style	2020-07-17 16:17:42 +02:00
Patrick von Platen	0b6c255a95	[Model card] Bert2Bert (#5841 ) * Create README.md * Update README.md * Update README.md * Update README.md	2020-07-17 11:41:56 +02:00
Sam Shleifer	3d9556a72b	[cleanups] make Marian save as Marian (#5830 )	2020-07-17 02:54:25 -04:00
Sam Shleifer	e238e3d55a	[seq2seq] Don't copy self.source in sortishsampler (#5818 )	2020-07-17 01:53:25 -04:00
Bayartsogt Yadamsuren	2e4624b415	language tag addition on albert-mongolian (#5828 ) * language tag addition on albert-mongolian * Update model_cards/bayartsogt/albert-mongolian/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-17 01:40:38 -04:00
Manuel Romero	d088d744ad	Create README.md (#5821 )	2020-07-16 15:18:31 -04:00
Nick Doiron	233072fc1e	dv-wave (#5823 )	2020-07-16 15:13:51 -04:00

1 2 3 4 5 ...

4592 Commits