transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Julien Chaumond	015f7812ed	[ci skip] Pin isort	2020-05-14 10:12:18 -04:00
Lysandre Debut	ef46ccb05c	TPU needs a rendezvous (#4339 )	2020-05-14 08:59:52 -04:00
Viktor Alm	94cb73c2d2	Add image and metadata (#4345 ) Unfortunately i accidentally orphaned my other PR	2020-05-13 20:05:15 -04:00
Manuel Romero	a0eebdc404	Add link to W&B to see whole training logs (#4348 )	2020-05-13 20:04:57 -04:00
Lysandre	7cb203fae4	Release: v2.9.1	2020-05-13 17:38:50 -04:00
Sam Shleifer	9a687ebb77	[Marian Fixes] prevent predicting pad_token_id before softmax, support language codes, name multilingual models (#4290 )	2020-05-13 17:29:41 -04:00
Patrick von Platen	839bfaedb2	[Docs, Notebook] Include generation pipeline (#4295 ) * add first text for generation * add generation pipeline to usage * Created using Colaboratory * correct docstring * finish	2020-05-13 14:24:08 -04:00
Elyes Manai	2d184cb553	wrong variable name used (#4328 )	2020-05-13 10:22:03 -04:00
Julien Plu	ca13618681	Question Answering for TF trainer (#4320 ) * Add QA trainer example for TF * Make data_dir optional * Fix parameter logic * Fix feature convert * Update the READMEs to add the question-answering task * Apply style * Change 'sequence-classification' to 'text-classification' and prefix with 'eval' all the metric names * Apply style * Apply style	2020-05-13 09:22:31 -04:00
Denis	1e51bb717c	Fix for #3865 . PretrainedTokenizer mapped " do not" into " don't" when .decode(...) is called. Removed the " do not" --> " don't" mapping from clean_up_tokenization(...). (#4024 )	2020-05-13 14:32:57 +02:00
Julien Chaumond	241759101e	(v2) Improvements to the wandb integration (#4324 ) * Improvements to the wandb integration * small reorg + no global necessary * feat(trainer): log epoch and final metrics * Simplify logging a bit * Fixup * Fix crash when just running eval Co-authored-by: Chris Van Pelt <vanpelt@gmail.com> Co-authored-by: Boris Dayma <boris.dayma@gmail.com>	2020-05-12 21:52:01 -04:00
Funtowicz Morgan	7d7fe4997f	Allow BatchEncoding to be initialized empty. (#4316 ) * Allow BatchEncoding to be initialized empty. This is required by recent changes introduced in TF 2.2. * Attempt to unpin Tensorflow to 2.2 with the previous commit.	2020-05-12 15:02:46 -04:00
Savaş Yıldırım	0a97f6312a	Update README.md (#4313 )	2020-05-12 15:01:45 -04:00
Savaş Yıldırım	15a121fec5	Update README.md (#4315 )	2020-05-12 15:01:34 -04:00
Stefan Schweter	15d45211f7	[model_cards]: 🇹🇷 Add new ELECTRA small and base models for Turkish (#4318 )	2020-05-12 15:01:17 -04:00
Viktor Alm	8a017cbb5a	Add modelcard with acknowledgements (#4321 )	2020-05-12 15:00:56 -04:00
Julien Chaumond	4bf5042240	Fix BART tests on GPU (#4298 )	2020-05-12 09:11:50 -04:00
Viktor Alm	e4512aab3b	Add MultipleChoice to TFTrainer [WIP] (#4270 ) * catch gpu len 1 set to gpu0 * Add mpc to trainer * Add MPC for TF * fix TF automodel for MPC and add Albert * Apply style * Fix import * Note to self: double check * Make shape None, None for datasetgenerator output shapes * Add from_pt bool which doesnt seem to work * Original checkpoint dir * Fix docstrings for automodel * Update readme and apply style * Colab should probably not be from users * Colabs should probably not be from users * Add colab * Update README.md * Update README.md * Cleanup __intit__ * Cleanup flake8 trailing comma * Update src/transformers/training_args_tf.py * Update src/transformers/modeling_tf_auto.py Co-authored-by: Viktor Alm <viktoralm@pop-os.localdomain> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-12 08:48:48 -04:00
Levent Serinol	65be574aec	fixed missing torch module import (#4305 ) fixed missing torch module import in example usage code	2020-05-12 08:34:17 -04:00
Jangwon Park	31e67dd19f	Remove hard-coded pad token id in distilbert and albert (#3965 )	2020-05-12 08:32:44 -04:00
Lysandre Debut	30e343862f	pin TF to 2.1 (#4297 ) * pin TF to 2.1 * Pin flake8 as well	2020-05-11 21:03:30 -04:00
Julien Chaumond	56e8ef632f	[ci] Restrict GPU tests to actual code commits	2020-05-11 20:40:41 -04:00
Julien Chaumond	ba6f6e44a8	[ci] Re-enable torch GPU tests	2020-05-12 00:05:36 +00:00
Lysandre Debut	9524956819	Documentation specification (#4294 )	2020-05-11 16:43:57 -04:00
Bram Vanroy	61d22f9cc7	Simplify cache vars and allow for TRANSFORMERS_CACHE env (#4226 ) * simplify cache vars and allow for TRANSFORMERS_CACHE env As it currently stands, "TRANSFORMERS_CACHE" is not an accepted variable. It seems that the these variables were not updated when moving from version pytorch_transformers to transformers. In addition, the fallback procedure could be improved. and simplified. Pathlib seems redundant here. * Update file_utils.py	2020-05-11 15:24:02 -04:00
Lysandre Debut	cd40cb8879	Fix special token doc (#4292 )	2020-05-11 15:05:36 -04:00
Tianlei Wu	82601f4c1a	Allow gpt2 to be exported to valid ONNX (#4244 ) * allow gpt2 to be exported to valid ONNX model * cast size from int to float explictly	2020-05-11 14:55:55 -04:00
Guo, Quan	39994051e4	Add migrating from `pytorch-transformers` (#4273 ) "Migrating from pytorch-transformers to transformers" is missing in the main document. It is available in the main `readme` thought. Just move it to the document.	2020-05-11 13:35:13 -04:00
Lysandre Debut	051dcb2a07	CamemBERT does not make use of Token Type IDs (#4289 )	2020-05-11 13:31:03 -04:00
fgaim	41e8291217	Add ALBERT to the Tensorflow to Pytorch model conversion cli (#3933 ) * Add ALBERT to convert command of transformers-cli * Document ALBERT tf to pytorch model conversion	2020-05-11 13:10:00 -04:00
Stefan Schweter	3f42eb979f	Documentation: fix links to NER examples (#4279 ) * docs: fix link to token classification (NER) example * examples: fix links to NER scripts	2020-05-11 12:48:21 -04:00
Funtowicz Morgan	8fdb7997c6	Align sentiment-analysis' tokenizer (currently uncased) to the model (uncased). (#4264 )	2020-05-11 12:45:53 -04:00
Sam Shleifer	4658896ee1	[Marian] Fix typo in docstring (#4284 )	2020-05-11 11:47:51 -04:00
Levent Serinol	bf64b8cf09	Model card for bert-turkish-question-answering question-answering model (#4281 ) * Create README.md * Update model_cards/lserinol/bert-turkish-question-answering/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-11 11:32:25 -04:00
Julien Plu	94b57bf796	[TF 2.2 compat] use tf.VariableAggregation.ONLY_FIRST_REPLICA (#4283 ) * Fix the issue to properly run the accumulator with TF 2.2 * Apply style * Fix training_args_tf for TF 2.2 * Fix the TF training args when only one GPU is available * Remove the fixed version of TF in setup.py	2020-05-11 11:28:37 -04:00
Savaş Yıldırım	cffbb3d8ed	Update README.md (#4276 )	2020-05-11 11:24:41 -04:00
Julien Plu	5f50d619dd	Fix XTREME link + add number of eval documents + fix usage code (#4280 )	2020-05-11 11:24:10 -04:00
theblackcat102	7751be7cee	fix reformer apex scaling issue (#4242 )	2020-05-11 16:53:42 +02:00
Patrick von Platen	ac7d5f67a2	[Reformer] Add Enwiki8 Reformer Model - Adapt convert script (#4282 ) * adapt convert script * update convert script * finish * fix marian pretrained docs	2020-05-11 16:38:07 +02:00
Patrick von Platen	336116d960	Reformer enwik8 - Model card (#4286 )	2020-05-11 16:22:08 +02:00
flozi00	b290c32e16	[docs] fix typo (#4249 )	2020-05-10 14:07:08 -04:00
Sam Shleifer	3487be75ef	[Marian] documentation and AutoModel support (#4152 ) - MarianSentencepieceTokenizer - > MarianTokenizer - Start using unk token. - add docs page - add better generation params to MarianConfig - more conversion utilities	2020-05-10 13:54:57 -04:00
Girishkumar	9d2f467bfb	[README] Corrected some grammatical mistakes (#4199 )	2020-05-10 09:02:36 -04:00
Julien Chaumond	7b75aa9fa5	[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223 ) * [TPU] Doc, fix xla_spawn.py, only preprocess dataset once * Update examples/README.md * [xla_spawn] Add `_mp_fn` to other Trainer scripts * [TPU] Fix: eval dataloader was None	2020-05-08 14:10:05 -04:00
Julien Chaumond	274d850d34	Fix #4098	2020-05-08 12:39:46 -04:00
Lorenzo De Mattei	26dad0a9fa	example updated to use generation pipeline (#4230 ) * example updated to use generation pipeline * Update model_cards/LorenzoDeMattei/GePpeTto/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-08 09:45:10 -04:00
rmroczkowski	9ebb5b2a54	Model card for allegro/herbert-klej-cased-tokenizer-v1 (#4184 )	2020-05-08 09:42:43 -04:00
rmroczkowski	9e54efd004	Model card for allegro/herbert-klej-cased-v1 (#4183 )	2020-05-08 09:42:28 -04:00
Manuel Romero	a8b798e6c4	Model card for spanish electra small (#4196 )	2020-05-08 09:30:15 -04:00
Savaş Yıldırım	242005d762	Create README.md (#4132 ) * Create README.md * Adding code fence around code block	2020-05-08 09:27:29 -04:00

... 308 309 310 311 312 ...

19383 Commits