transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 17:22:25 +06:00

Author	SHA1	Message	Date
Manuel Romero	5940c73bbb	Create README.md (#4179 ) model card for my De Novo Drug discovery model using MLM	2020-05-08 09:25:36 -04:00
Patrick von Platen	cf08830c28	[Pipeline, Generation] tf generation pipeline bug (#4217 ) * fix PR * move tests to correct place	2020-05-08 08:30:05 -04:00
Jared T Nielsen	8bf7312654	Add AlbertForPreTraining and TFAlbertForPreTraining models. (#4057 ) * Add AlbertForPreTraining and TFAlbertForPreTraining models. * PyTorch conversion * TensorFlow conversion * style Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-05-07 19:44:51 -04:00
Julien Chaumond	c99fe0386b	[doc] Fix broken links + remove crazy big notebook	2020-05-07 18:44:18 -04:00
Savaş Yıldırım	66113bd626	Create README.md (#4202 )	2020-05-07 18:31:22 -04:00
Julien Chaumond	6669915b65	[examples] Add column for pytorch-lightning support	2020-05-07 15:26:58 -04:00
Julien Chaumond	612fa1b10b	Examples readme.md (#4215 ) * README * Update README.md	2020-05-07 15:00:06 -04:00
Lysandre	2e57824374	Pin isort and tf <= 2.1.0	2020-05-07 14:42:00 -04:00
Lysandre	e7cfc1a313	Release: v2.9.0	2020-05-07 14:15:20 -04:00
Julien Chaumond	0ae96ff8a7	BIG Reorganize examples (#4213 ) * Created using Colaboratory * [examples] reorganize files * remove run_tpu_glue.py as superseded by TPU support in Trainer * Bugfix: int, not tuple * move files around	2020-05-07 13:48:44 -04:00
Julien Chaumond	cafa6a9e29	[Trainer] Ability to specify optimizer/scheduler at init cc @patrickvonplaten @thomwolf	2020-05-07 11:25:26 -04:00
Bram Vanroy	e4fd5e3999	Use with_extension to change the extension (#4203 ) As per https://github.com/huggingface/transformers/pull/3934#discussion_r421307659	2020-05-07 11:14:56 -04:00
Lysandre Debut	ebf80e2e70	Tpu trainer (#4146 ) * wip * wip * a last wip * Better logging when using TPUs * Correct argument name * Tests * fix * Metrics in evaluation * Update src/transformers/training_args.py * [tpu] Use launcher script instead * [tpu] lots of tweaks * Fix formatting Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-07 10:34:04 -04:00
Funtowicz Morgan	026097b9ee	Ensure fast tokenizer can construct tensor without pad token if only one sample is provided. (#4201 )	2020-05-07 10:02:53 -04:00
Funtowicz Morgan	0a6cbea0a5	Rewritten batch support in pipelines. (#4154 ) * Rewritten batch support in pipelines. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Fix imports sorting 🔧 Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Set pad_to_max_length=True by default on Pipeline. * Set pad_to_max_length=False for generation pipelines. Most of generation models doesn't have padding token. * Address @joeddav review comment: Uniformized args. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> Address @joeddav review comment: Uniformized *args (second). Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-05-07 09:52:40 -04:00
Patrick von Platen	99d1a69444	fix examples (#4192 )	2020-05-07 10:54:48 +02:00
Patrick von Platen	74ffc9ea6b	[Reformer] Fix example and error message (#4191 ) * fix example reformer * fix error message and example docstring * improved error message	2020-05-07 10:50:11 +02:00
Patrick von Platen	96c78396ce	fix docstring reformer (#4190 )	2020-05-07 10:28:31 +02:00
Patrick von Platen	dca34695d0	Reformer (#3351 ) * first copy & past commit from Bert and morgans LSH code * add easy way to compare to trax original code * translate most of function * make trax lsh self attention deterministic with numpy seed + copy paste code * add same config * add same config * make layer init work * implemented hash_vectors function for lsh attention * continue reformer translation * hf LSHSelfAttentionLayer gives same output as trax layer * refactor code * refactor code * refactor code * refactor * refactor + add reformer config * delete bogus file * split reformer attention layer into two layers * save intermediate step * save intermediate step * make test work * add complete reformer block layer * finish reformer layer * implement causal and self mask * clean reformer test and refactor code * fix merge conflicts * fix merge conflicts * update init * fix device for GPU * fix chunk length init for tests * include morgans optimization * improve memory a bit * improve comment * factorize num_buckets * better testing parameters * make whole model work * make lm model work * add t5 copy paste tokenizer * add chunking feed forward * clean config * add improved assert statements * make tokenizer work * improve test * correct typo * extend config * add complexer test * add new axial position embeddings * add local block attention layer * clean tests * refactor * better testing * save intermediate progress * clean test file * make shorter input length work for model * allow variable input length * refactor * make forward pass for pretrained model work * add generation possibility * finish dropout and init * make style * refactor * add first version of RevNet Layers * make forward pass work and add convert file * make uploaded model forward pass work * make uploaded model forward pass work * refactor code * add namedtuples and cache buckets * correct head masks * refactor * made reformer more flexible * make style * remove set max length * add attention masks * fix up tests * fix lsh attention mask * make random seed optional for the moment * improve memory in reformer * add tests * make style * make sure masks work correctly * detach gradients * save intermediate * correct backprob through gather * make style * change back num hashes * rename to labels * fix rotation shape * fix detach * update * fix trainer * fix backward dropout * make reformer more flexible * fix conflict * fix * fix * add tests for fixed seed in reformer layer * fix trainer typo * fix typo in activations * add fp16 tests * add fp16 training * support fp16 * correct gradient bug in reformer * add fast gelu * re-add dropout for embedding dropout * better naming * better naming * renaming * finalize test branch * finalize tests * add more tests * finish tests * fix * fix type trainer * fix fp16 tests * fix tests * fix tests * fix tests * fix issue with dropout * fix dropout seeds * correct random seed on gpu * finalize random seed for dropout * finalize random seed for dropout * remove duplicate line * correct half precision bug * make style * refactor * refactor * docstring * remove sinusoidal position encodings for reformer * move chunking to modeling_utils * make style * clean config * make style * fix tests * fix auto tests * pretrained models * fix docstring * update conversion file * Update pretrained_models.rst * fix rst * fix rst * update copyright * fix test path * fix test path * fix small issue in test * include reformer in generation tests * add docs for axial position encoding * finish docs * Update convert_reformer_trax_checkpoint_to_pytorch.py * remove isort * include sams comments * remove wrong comment in utils * correct typos * fix typo * Update reformer.rst * applied morgans optimization * make style * make gpu compatible * remove bogus file * big test refactor * add example for chunking * fix typo * add to README	2020-05-07 10:17:01 +02:00
Clement	877fc56410	change order pytorch/tf in readme (#4167 )	2020-05-06 16:31:07 -04:00
Julien Plu	aad50151f3	TF version of the trainer (#4017 ) * First commit to add a TF version of the trainer. * Make the TF trainer closer to what looks the PT trainer * Refactoring common code between the PT and TF trainer into an util file. * Some bugfix + better similarity with the PT trainer * Add missing class in transformers init * Bugfix over prediction + use classification report instead of simple metrics * Fix name error * Fix optimization tests + style * Apply style * Several bugfix for multi-gpu training * Apply style * Apply style * Add glue example for the TF trainer * Several bugix + address the reviews * Fix on the TF training args file * Add a debug mode * Bugfix in utils_ner.py when segment_ids is None * Apply style * Apply style * Add TPU strategy * Fix selection strategy	2020-05-06 12:56:52 -04:00
Simone Primarosa	25296b12aa	Fix overwrite_cache behaviour for pytorch lightning examples (#4093 )	2020-05-06 12:24:49 -04:00
kumapo	9972562d33	Include ElectraPreTrainedModel into __init__ (#4173 )	2020-05-06 12:00:23 -04:00
martindh	ff8ed52dd8	Camembert-large-fquad model card (#4143 ) Description for the model card describing the camembert-large-fquad model.	2020-05-06 10:41:07 -04:00
Julien Plu	4c3be2e718	Add model card for the NER model (#4162 )	2020-05-06 10:40:55 -04:00
Manuel Romero	17ae0363db	Fix markdown to show the results table properly (#4119 )	2020-05-06 10:38:29 -04:00
Patrick von Platen	a638e986f4	fix hard wired pad token id (#4138 )	2020-05-06 00:42:34 +02:00
Julien Chaumond	fd2174664c	[Trainer] W&B: Enable model watch See https://github.com/huggingface/transformers/pull/3916	2020-05-05 10:59:23 -04:00
Lysandre Debut	79b1c6966b	Pytorch 1.5.0 (#3973 ) * Standard deviation can no longer be set to 0 * Remove torch pinned version * 9th instead of 10th, silly me	2020-05-05 10:23:01 -04:00
Boris Dayma	818463ee8e	Trainer: add logging through Weights & Biases (#3916 ) * feat: add logging through Weights & Biases * feat(wandb): make logging compatible with all scripts * style(trainer.py): fix formatting * [Trainer] Tweak wandb integration Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-04 22:42:27 -04:00
jaymody	858b1d1e5a	allow an already created tensorboard SummaryWriter be passed to Trainer	2020-05-04 19:58:24 -04:00
Patrick von Platen	8e67573a64	[EncoderDecoder Tests] Improve tests (#4046 ) * Hoist bert model tester for patric * indent * make tests work * Update tests/test_modeling_bert.py Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: sshleifer <sshleifer@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-04 02:18:36 +02:00
Lorenzo Ampil	6af3306a1d	Add decoder specific error message for T5Stack.forward (#4128 )	2020-05-03 12:40:08 +02:00
Zhiyu Lin	1cdd2ad2af	Fix #2941 (#4109 ) * Fix of issue #2941 Reshaped score array to avoid `numpy` ValueError. * Update src/transformers/pipelines.py * Update src/transformers/pipelines.py Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-02 11:20:30 -04:00
Manuel Romero	5f4f6b65b3	distilroberta-base-finetuned-sentiment (#4115 ) * Create model card Create Model card for distilroberta-base-finetuned-sentiment * Update model_cards/mrm8488/distilroberta-base-finetuned-sentiment/README.md * Update model_cards/mrm8488/distilroberta-base-finetuned-sentiment/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-02 11:19:31 -04:00
Suraj Parmar	7da051f135	model card for surajp/albert-base-sanskrit (#4114 ) * Create README.md * Update model_cards/surajp/albert-base-sanskrit/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-02 11:15:39 -04:00
Zhen Wang	14911e2e12	Create README.md (#4112 )	2020-05-02 10:52:12 -04:00
HUSEIN ZOLKEPLI	9e97c87539	Added huseinzol05/gpt2-345M-bahasa-cased (#4102 )	2020-05-02 10:51:15 -04:00
William Falcon	4c5bd92183	Update run_pl_glue.py (#4117 )	2020-05-02 10:38:30 -04:00
William Falcon	5282b31df4	Update run_pl_ner.py (#4118 )	2020-05-02 10:38:21 -04:00
Stefan Schweter	1e616c0af3	NER: parse args from .args file or JSON (#4110 ) * ner: parse args from .args file or JSON * examples: mention json-based configuration file support for run_ner script	2020-05-02 10:29:17 -04:00
Patrick von Platen	abb1fa3f37	Update README.md	2020-05-02 10:32:00 +02:00
Patrick von Platen	0ccbfd2868	Update Reformer ReadME	2020-05-02 10:31:00 +02:00
Patrick von Platen	2d8340a91f	[Reformer] Move model card to google model (#4113 ) * correct model card * remove model card from patrick von platen	2020-05-02 10:25:22 +02:00
Julien Chaumond	d713cfc5eb	GePpeTto 🇮🇹: Fixpath to model card	2020-05-01 11:48:58 -04:00
Lorenzo De Mattei	f3d44301cc	GePpeTto model 🇮🇹 (#4099 ) * Create GePpeTto.md * Update model_cards/LorenzoDeMattei/GePpeTto.md * Update model_cards/LorenzoDeMattei/GePpeTto.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-05-01 11:46:42 -04:00
Julien Chaumond	27d55125e6	Configs: saner num_labels in configs. (#3967 )	2020-05-01 11:28:55 -04:00
Stefan Schweter	e80be7f1d0	docs: add xlm-roberta section to multi-lingual section (#4101 )	2020-05-01 11:06:58 -04:00
Sam Shleifer	18db92dd9a	[testing] add timeout_decorator (#3543 )	2020-05-01 09:05:47 -04:00
Julien Chaumond	b8686174be	Merge pull request #3934 from huggingface/examples_args_from_files [qol] example scripts: parse args from .args file or JSON	2020-04-30 22:40:13 -04:00

... 309 310 311 312 313 ...

19383 Commits