transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 01:02:25 +06:00

Author	SHA1	Message	Date
Suchin	529534dc2f	BioMed Roberta-Base (AllenAI) (#3643 ) * added model card * updated README * updated README * updated README * added evals * removed pico eval * Tweaks Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-04-06 16:12:09 -04:00
Lysandre Debut	261c4ff4e2	Update notebooks (#3620 ) * Update notebooks * From local to global link * from local links to actual global links	2020-04-06 14:32:39 -04:00
Julien Chaumond	39a34cc375	[model_cards] ELECTRA (w/ examples of usage) Co-Authored-By: Kevin Clark <clarkkev@users.noreply.github.com> Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2020-04-06 11:43:33 -04:00
LysandreJik	ea6dba2787	Re-pin isort	2020-04-06 10:09:54 -04:00
LysandreJik	11c3257a18	unpin isort for pypi	2020-04-06 10:06:41 -04:00
LysandreJik	36bffc81b3	Release: v2.8.0	2020-04-06 10:03:53 -04:00
Patrick von Platen	2ee410560e	[Generate, Test] Split generate test function into beam search, no beam search (#3601 ) * split beam search and no beam search test * fix test * clean generate tests	2020-04-06 10:37:05 +02:00
Patrick von Platen	1789c7daf1	fix argument order (#3637 )	2020-04-05 12:33:41 +02:00
Patrick von Platen	b809d2f073	Fix TF T5 docstring (#3636 )	2020-04-05 12:23:09 +02:00
Timo Moeller	4ab8ab4f50	Adjust model card to reflect changes to vocabulary (cherry picked from commit `8e25c4bf28`)	2020-04-04 15:27:41 -04:00
ktrapeznikov	ac40eed1a5	Create README.md adding readme for ktrapeznikov/albert-xlarge-v2-squad-v2	2020-04-04 15:18:54 -04:00
ktrapeznikov	fd9995ebc5	Create README.md	2020-04-04 15:18:31 -04:00
Julien Chaumond	5d912e7ed4	Tweak typing for #3566	2020-04-04 15:04:03 -04:00
Julien Chaumond	94eb68d742	weigths*weights	2020-04-04 15:03:26 -04:00
Manuel Romero	243e687be6	Create model card	2020-04-04 08:20:34 -04:00
Julien Chaumond	3e4b4dd190	[model_cards] Link to ExBERT visualisation Hat/tip @bhoov @HendrikStrobelt @sebastianGehrmann Also cc @srush and @thomwolf	2020-04-03 20:03:29 -04:00
Max Ryabinin	c6acd246ec	Speed up GELU computation with torch.jit (#2988 ) * Compile gelu_new with torchscript * Compile _gelu_python with torchscript * Wrap gelu_new with torch.jit for torch>=1.4	2020-04-03 15:20:21 -04:00
Lysandre Debut	d5d7d88612	ELECTRA (#3257 ) * Electra wip * helpers * Electra wip * Electra v1 * ELECTRA may be saved/loaded * Generator & Discriminator * Embedding size instead of halving the hidden size * ELECTRA Tokenizer * Revert BERT helpers * ELECTRA Conversion script * Archive maps * PyTorch tests * Start fixing tests * Tests pass * Same configuration for both models * Compatible with base + large * Simplification + weight tying * Archives * Auto + Renaming to standard names * ELECTRA is uncased * Tests * Slight API changes * Update tests * wip * ElectraForTokenClassification * temp * Simpler arch + tests Removed ElectraForPreTraining which will be in a script * Conversion script * Auto model * Update links to S3 * Split ElectraForPreTraining and ElectraForTokenClassification * Actually test PreTraining model * Remove num_labels from configuration * wip * wip * From discriminator and generator to electra * Slight API changes * Better naming * TensorFlow ELECTRA tests * Accurate conversion script * Added to conversion script * Fast ELECTRA tokenizer * Style * Add ELECTRA to README * Modeling Pytorch Doc + Real style * TF Docs * Docs * Correct links * Correct model intialized * random fixes * style * Addressing Patrick's and Sam's comments * Correct links in docs	2020-04-03 14:10:54 -04:00
Yohei Tamura	8594dd80dd	BertJapaneseTokenizer accept options for mecab (#3566 ) * BertJapaneseTokenizer accept options for mecab * black * fix mecab_option to Option[str]	2020-04-03 11:12:19 -04:00
HUSEIN ZOLKEPLI	216e167ce6	Added albert-base-bahasa-cased README and fixed tiny-bert-bahasa-cased README (#3613 ) * add bert bahasa readme * update readme * update readme * added xlnet * added tiny-bert and fix xlnet readme * added albert base	2020-04-03 09:28:43 -04:00
ahotrod	1ac6a246d8	Update README.md (#3604 ) Update AutoModel & AutoTokernizer loading.	2020-04-03 09:28:25 -04:00
ahotrod	e91692f4a3	Update README.md (#3603 )	2020-04-03 09:27:57 -04:00
HenrykBorzymowski	8e287d507d	corrected mistake in polish model cards (#3611 ) * added model_cards for polish squad models * corrected mistake in polish design cards Co-authored-by: Henryk Borzymowski <henryk.borzymowski@pwc.com>	2020-04-03 09:07:15 -04:00
redewiedergabe	81484b447b	Create README.md (#3568 ) * Create README.md * added meta block (language: german) * Added additional information about test data	2020-04-02 21:48:31 -04:00
ahotrod	9f6349aba9	Create README.md	2020-04-02 21:43:12 -04:00
Henryk Borzymowski	ddb1ce7418	added model_cards for polish squad models	2020-04-02 21:40:16 -04:00
Patrick von Platen	f68d22850c	delete bogus print statement (#3595 )	2020-04-02 21:49:34 +02:00
Nicolas	c50aa67bff	Resizing embedding matrix before sending it to the optimizer. (#3532 ) * Resizing embedding matrix after sending it to the optimizer prevents from updating the newly resized matrix. * Remove space for style matter	2020-04-02 15:00:05 -04:00
Mark Kockerbeck	1b10159950	Adding should_continue check for retraining (#3509 )	2020-04-02 14:07:08 -04:00
Patrick von Platen	390c128592	[Encoder-Decoder] Force models outputs to always have batch_size as their first dim (#3536 ) * solve conflicts * improve comments	2020-04-02 15:18:33 +02:00
Patrick von Platen	ab5d06a094	[T5, examples] replace heavy t5 models with tiny random models (#3556 ) * replace heavy t5 models with tiny random models as was done by sshleifer * fix isort	2020-04-02 12:34:05 +02:00
Patrick von Platen	a4ee4da18a	[T5, TF 2.2] change tf t5 argument naming (#3547 ) * change tf t5 argument naming for TF 2.2 * correct bug in testing	2020-04-01 22:04:20 +02:00
Patrick von Platen	06dd597552	fix bug in warnings T5 pipelines (#3545 )	2020-04-01 21:59:12 +02:00
Anirudh Srinivasan	9de9ceb6c5	Correct output shape for Bert NSP models in docs (#3482 )	2020-04-01 15:04:38 -04:00
Patrick von Platen	b815edf69f	[T5, Testst] Add extensive hard-coded integration tests and make sure PT and TF give equal results (#3550 ) * add some t5 integration tests * finish summarization and translation integration tests for T5 - results loook good * add tf test * fix == vs is bug * fix tf beam search error and make tf t5 tests pass	2020-04-01 18:01:33 +02:00
HUSEIN ZOLKEPLI	8538ce9044	Add tiny-bert-bahasa-cased model card (#3567 ) * add bert bahasa readme * update readme * update readme * added xlnet * added tiny-bert and fix xlnet readme	2020-04-01 07:15:00 -04:00
Manuel Romero	c1a6252be1	Create model card (#3557 ) Create model card for: distilbert-multi-finetuned-for-xqua-on-tydiqa	2020-04-01 07:14:23 -04:00
Julien Chaumond	50e15c825c	Tokenizers: Start cleaning examples a little (#3455 ) * Start cleaning examples * Fixup	2020-04-01 07:13:40 -04:00
Patrick von Platen	b38d552a92	[Generate] Add bad words list argument to the generate function (#3367 ) * add bad words list * make style * add bad_words_tokens * make style * better naming * make style * fix typo	2020-03-31 18:42:31 +02:00
Patrick von Platen	ae6834e028	[Examples] Clean summarization and translation example testing files for T5 and Bart (#3514 ) * fix conflicts * add model size argument to summarization * correct wrong import * fix isort * correct imports * other isort make style * make style	2020-03-31 17:54:13 +02:00
Manuel Romero	0373b60c4c	Update README.md (#3552 ) - Show that the last uploaded version was trained on more data (custom_license files)	2020-03-31 10:40:34 -04:00
Patrick von Platen	83d1fbcff6	[Docs] Add usage examples for translation and summarization (#3538 )	2020-03-31 09:36:03 -04:00
Patrick von Platen	55bcae7f25	remove useless and confusing lm_labels line (#3531 )	2020-03-31 09:32:25 -04:00
Patrick von Platen	42e1e3c67f	Update usage doc regarding generate fn (#3504 )	2020-03-31 09:31:46 -04:00
Patrick von Platen	57b0fab692	Add better explanation to check `docs` locally. (#3459 )	2020-03-31 09:30:17 -04:00
Manuel Romero	a8d4dff0a1	Update README.md (#3470 ) Fix typo	2020-03-31 08:01:09 -04:00
Manuel Romero	4a5663568f	Create card for the model: GPT-2-finetuned-covid-bio-medrxiv (#3453 )	2020-03-31 08:01:03 -04:00
Branden Chan	bbedb59675	Create README.md (#3393 ) * Create README.md * Update README.md	2020-03-31 08:00:35 -04:00
Manuel Romero	c2cf192943	Add link to 16 POS tags model (#3465 )	2020-03-31 08:00:00 -04:00
Gabriele Sarti	c82ef72158	Added CovidBERT-NLI model card (#3477 )	2020-03-31 07:59:49 -04:00

... 313 314 315 316 317 ...

19383 Commits