transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Julien Plu	de29ff9bd2	Fix open (#9368 )	2021-01-04 10:22:15 -05:00
Stas Bekman	d018afced0	[trainer] parametrize default output_dir (#9352 ) This PR: * fixes trainer to have the logger agree with the actual default `output_dir`, but setting it one place and passing it as an argument to both places @sgugger	2021-01-04 10:14:32 -05:00
Julien Plu	d735b074d7	Fix Flaubert (#9292 )	2021-01-04 16:06:28 +01:00
dependabot[bot]	5dd389d1c7	Bump notebook from 6.1.4 to 6.1.5 in /examples/research_projects/lxmert (#9402 ) Bumps [notebook](https://github.com/jupyter/jupyterhub) from 6.1.4 to 6.1.5. - [Release notes](https://github.com/jupyter/jupyterhub/releases) - [Changelog](https://github.com/jupyterhub/jupyterhub/blob/master/CHECKLIST-Release.md) - [Commits](https://github.com/jupyter/jupyterhub/commits) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2021-01-04 10:02:07 -05:00
Sylvain Gugger	23a71449c0	Put back LXMert example (#9401 )	2021-01-04 09:59:07 -05:00
Julien Plu	6c03d4ac70	Fix CTRL (#9291 )	2021-01-04 09:56:51 -05:00
Charles	c581d8af7a	Add utility function for retrieving locally cached models (#8836 ) * add get_cached_models function * add List type to import * fix code quality * Update src/transformers/file_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/file_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/file_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/file_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/file_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fix style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-01-04 09:53:54 -05:00
Sam Shleifer	8eb7f26d5d	simplify marian distillation script (#9394 )	2021-01-04 11:21:24 +05:30
Yoshitomo Matsubara	d944966b19	Fix typos in README and bugs in RAG example code for end-to-end evaluation and finetuning (#9355 ) * fix a bug in eval_batch_retrieval * should return parser as well as other staticmethod * remove duplicate argument * these kwargs are no longer accepted (cause TypeError in self.generator.generate of modeling_rag.py) * fixed file paths in README * moved an arg to add_ray_specific_args	2021-01-03 16:00:30 +01:00
Chris Kennedy	c4fd609afb	file_utils.py: TF examples outputs.last_hidden_states -> state (#9382 )	2021-01-02 17:58:16 +01:00
Patrick von Platen	b01f451ca3	[Docs] `past_key_values` return a tuple of tuple as a default (#9381 ) * push * make style	2021-01-02 15:55:07 +01:00
Derrick Blakely	5f7a07c0c8	use return dict for rag encoder (#9363 )	2021-01-02 12:39:14 +01:00
Stas Bekman	ae333d04b2	torch.cuda.is_available() is redundant as apex handles that internally (#9350 )	2020-12-30 10:09:51 +01:00
Stas Bekman	8217d4e37f	[prophetnet] wrong import (#9349 ) ``` python -c "from apex.normalization import FusedProphetNetLayerNorm" Traceback (most recent call last): File "<string>", line 1, in <module> ImportError: cannot import name 'FusedProphetNetLayerNorm' from 'apex.normalization' (/home/stas/anaconda3/envs/main-38/lib/python3.8/site-packages/apex/normalization/__init__.py) ``` It looks like this code has never been tested, so it silently fails inside try/except. Discovered this by accident in https://github.com/huggingface/transformers/issues/9338#issuecomment-752217708	2020-12-29 22:32:07 +01:00
Patrick von Platen	912f6881d2	add import math (#9346 )	2020-12-29 19:35:06 +01:00
Patrick von Platen	785e52cd30	improve templates (#9342 )	2020-12-29 16:48:44 +01:00
Julien Plu	64103fb6be	Fix TransfoXL (#9302 )	2020-12-28 20:52:18 +01:00
Julien Plu	d97d06d05f	Fix TF T5 (#9301 ) * Fix T5 * Fix test * Fix test	2020-12-28 20:51:40 +01:00
Patrick von Platen	83fdd252f6	[Seq2Seq Templates] Correct some TF-serving errors and add gradient checkpointing to PT by default. (#9334 ) * correct tests * correct shape and get_tf_activation * more correction tf * add gradient checkpointing to templates * correct typo	2020-12-28 17:51:04 +01:00
Patrick von Platen	8e74eca7f2	push (#9320 )	2020-12-27 21:57:50 +01:00
Patrick von Platen	61443cd7d9	[GPT2] Correct gradient checkpointing (#9308 ) * correct gpt2 * fix gpt2 * fix use_cache ordering * correct past tolerance * fix for all cases * style	2020-12-25 23:28:12 +01:00
Vasudev Gupta	21fc676645	add translation example (#9303 ) * Created using Colaboratory * mbart-training examples add * link add * Update description Co-authored-by: Suraj Patil <surajp815@gmail.com>	2020-12-25 14:47:49 +05:30
Patrick von Platen	52b3a05e83	[Bart doc] Fix outdated statement (#9299 ) * fix bart doc * fix docs	2020-12-24 14:47:53 +01:00
Bram Vanroy	7777db159f	Update tokenization_utils_base.py (#9293 ) Missing "s" typo	2020-12-24 14:43:14 +01:00
Daniele Sartiano	71963a6633	fix typo in modeling_encoder_decoder.py (#9297 ) * Update modeling_encoder_decoder.py Fixed typo. * typo Co-authored-by: Suraj Patil <surajp815@gmail.com>	2020-12-24 14:38:08 +01:00
Ratthachat (Jung)	f3a3b91d6f	Proposed Fix : [RagSequenceForGeneration] generate "without" input_ids (#9220 ) * Create modeling_tf_dpr.py * Add TFDPR * Add back TFPegasus, TFMarian, TFMBart, TFBlenderBot last commit accidentally deleted these 4 lines, so I recover them back * Add TFDPR * Add TFDPR * clean up some comments, add TF input-style doc string * Add TFDPR * Make return_dict=False as default * Fix return_dict bug (in .from_pretrained) * Add get_input_embeddings() * Create test_modeling_tf_dpr.py The current version is already passed all 27 tests! Please see the test run at : https://colab.research.google.com/drive/1czS_m9zy5k-iSJbzA_DP1k1xAAC_sdkf?usp=sharing * fix quality * delete init weights * run fix copies * fix repo consis * del config_class, load_tf_weights They shoud be 'pytorch only' * add config_class back after removing it, test failed ... so totally only removing "use_tf_weights = None" on Lysandre suggestion * newline after .. note:: * import tf, np (Necessary for ModelIntegrationTest) * slow_test from_pretrained with from_pt=True At the moment we don't have TF weights (since we don't have official official TF model) Previously, I did not run slow test, so I missed this bug * Add simple TFDPRModelIntegrationTest Note that this is just a test that TF and Pytorch gives approx. the same output. However, I could not test with the official DPR repo's output yet * upload correct tf model * remove position_ids as missing keys * fix RagSeq generate with context_input_ids fix RagSeq generate with context_input_ids * apply style * delete unused lines * Add test_rag_sequence_generate_batch_from_context_input_ids * Readability improved * stylying * Stylize * typos * add check_model_generate_from_context_input_ids * make style * Apply suggestions from code review * make style2 Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: patrickvonplaten <patrick@huggingface.co>	2020-12-24 13:38:00 +01:00
Suraj Patil	2a18b70998	enable cache by default (#9296 )	2020-12-24 17:47:36 +05:30
Jungwhan	6189ae9960	Fix typo in file_utils.py (#9289 )	2020-12-24 13:48:33 +05:30
Jethro Kuan	222dbdb203	allow integer device for BatchEncoding (#9271 ) Fixes #9244 Co-authored-by: Jethro Kuan <jethro.kuan@bytedance.com>	2020-12-24 09:01:56 +01:00
Patrick von Platen	6c091abef2	[Templates] Adapt Bert (#9284 ) * adapt templates * adapt config * add test as well * fix output type * fix cache false naming * finish tests * last fix	2020-12-24 01:44:33 +01:00
Suraj Patil	88ef8893cd	Add caching mechanism to BERT, RoBERTa (#9183 ) * add past_key_values * add use_cache option * make mask before cutting ids * adjust position_ids according to past_key_values * flatten past_key_values * fix positional embeds * fix _reorder_cache * set use_cache to false when not decoder, fix attention mask init * add test for caching * add past_key_values for Roberta * fix position embeds * add caching test for roberta * add doc * make style * doc, fix attention mask, test * small fixes * adress patrick's comments * input_ids shouldn't start with pad token * use_cache only when decoder * make consistent with bert * make copies consistent * add use_cache to encoder * add past_key_values to tapas attention * apply suggestions from code review * make coppies consistent * add attn mask in tests * remove copied from longformer * apply suggestions from code review * fix bart test * nit * simplify model outputs * fix doc * fix output ordering	2020-12-23 23:01:32 +05:30
Sylvain Gugger	a1cb6e9866	Adapt to new name of `label_smoothing_factor` training arg (#9282 )	2020-12-23 11:05:21 -05:00
Connor Brinton	bcc87c639f	Minor documentation revisions from copyediting (#9266 ) * typo: Revise "checkout" to "check out" * typo: Change "seemlessly" to "seamlessly" * typo: Close parentheses in "Using the tokenizer" * typo: Add closing parenthesis to supported models aside * docs: Treat ``position_ids`` as plural Alternatively, the word "argument" could be added to make the subject singular. * docs: Remove comma, making subordinate clause * docs: Remove comma separating verb and direct object * docs: Fix typo ("next" -> "text") * docs: Reverse phrase order to simplify sentence * docs: "quicktour" -> "quick tour" * docs: "to throw" -> "from throwing" * docs: Remove disruptive newline in padding/truncation section * docs: "show exemplary" -> "show examples of" * docs: "much harder as" -> "much harder than" * docs: Fix typo "seach" -> "search" * docs: Fix subject-verb disagreement in WordPiece description * docs: Fix style in preprocessing.rst	2020-12-23 10:15:49 -05:00
Patrick von Platen	d5db6c37d4	[Seq2Seq Templates] Fix check_repo.py templates file (#9277 ) * add enc dec pt model to check repo * fix indent	2020-12-23 11:40:20 +01:00
Xu Song	4bafc43b0e	Fix param error (#9273 ) TypeError: forward() got an unexpected keyword argument 'token_type_ids'	2020-12-23 11:34:57 +01:00
Xu Song	58e8a7611f	Fix gpt2 document (#9272 )	2020-12-23 11:34:15 +01:00
Patrick von Platen	cbe63949d7	Model Templates for Seq2Seq (#9251 ) * adapt cookie cutter * fix copy past statement * delete copy statements for now * remove unused import from template * make doc rst * correct config docstring * correct training * correct inputs processing tf enc dec * make style * adapt templates * clean tabs * correct tensor -> Tensor naming * correct indent * correct templates * fix the test * break lines to avoid > 119 * Apply suggestions from code review	2020-12-22 23:41:20 +01:00
Sylvain Gugger	e6c1f1cad8	Revert renaming in finetune_trainer (#9262 )	2020-12-22 15:42:34 -05:00
Sylvain Gugger	ab17758874	Add speed metrics to all example scripts + template (#9260 )	2020-12-22 14:02:26 -05:00
Julien Chaumond	5b5f7dd09c	[hf_api] Fix incorrect typing	2020-12-22 19:52:47 +01:00
Julien Plu	1558d191e6	Fix TF BART for saved model creation (#9252 ) * Fix TF BART for saved model creation * Apply style * Update src/transformers/models/bart/modeling_tf_bart.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/bart/modeling_tf_bart.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Rework the fix * Fix condition * Apply style * Fix condition * Fix shape_list * Apply Patrick's solution * Apply Patrick's solution * Rebase * make tests pass Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>	2020-12-22 18:07:04 +01:00
Manuel Romero	37d6fb5d04	Fix link to bertabs/README.md (#9255 )	2020-12-22 11:41:23 -05:00
Manuel Romero	189c1b91a6	Fix link to old language modeling script (#9254 )	2020-12-22 11:40:47 -05:00
Sylvain Gugger	490b39e614	Seq2seq trainer (#9241 ) * Add label smoothing in Trainer * Add options for scheduler and Adafactor in Trainer * Put Seq2SeqTrainer in the main lib * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Address review comments and adapt scripts * Documentation * Move test not using script to tests folder Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-12-22 11:33:44 -05:00
Sylvain Gugger	1fc7119181	Fix script that check objects are documented (#9259 )	2020-12-22 11:12:58 -05:00
Patrick von Platen	e9d77ccd5a	[EncoderDecoder] Make tests more aggressive (#9256 ) * add tests * make style and fix bart bug * fix bart past key value edge case * correct tf bart test * fix gpt2 tf * fix t5 test	2020-12-22 17:00:04 +01:00
Sylvain Gugger	ec07da65e2	Update the README of the text classification example (#9237 ) * Update the README of the text classification example * Update examples/README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Adapt comment from review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-12-21 15:23:40 -05:00
Teven	4eef5889ac	Adding performer fine-tuning research exampke (#9239 ) * added run_mlm_performer.py research example * make styke * make styke * Added a README !	2020-12-21 21:19:41 +01:00
Patrick von Platen	9a12b9696f	[MPNet] Add slow to fast tokenizer converter (#9233 ) * add converter * delet unnecessary comments	2020-12-21 15:41:34 +01:00
Suraj Patil	f4432b7e01	add base model classes to bart subclassed models (#9230 ) * add base model classes to bart subclassed models * add doc	2020-12-21 19:56:46 +05:30

1 2 3 4 5 ...

6216 Commits