transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Stas Bekman	f0435f5a61	these should run fine on multi-gpu (#8582 )	2020-11-17 14:00:41 -05:00
Sylvain Gugger	36a19915ea	Fix model templates (#8595 ) * First fixes * Fix imports and add init * Fix typo * Move init to final dest * Fix tokenization import * More fixes * Styling	2020-11-17 10:35:38 -05:00
Julien Chaumond	042a6aa777	Tokenizers: ability to load from model subfolder (#8586 ) * <small>tiny typo</small> * Tokenizers: ability to load from model subfolder * use subfolder for local files as well * Uniformize model shortcut name => model id * from s3 => from huggingface.co Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>	2020-11-17 08:58:45 -05:00
Sylvain Gugger	48395d6b8e	Fix init for MT5 (#8591 )	2020-11-17 08:52:13 -05:00
sgugger	a6cf9ca00b	Add __init__ to the models folder	2020-11-17 07:39:37 -05:00
Patrick von Platen	5104223552	[MT5] More docs (#8589 ) * add docs * make style	2020-11-17 12:47:57 +01:00
Patrick von Platen	86822a358b	T5 & mT5 (#8552 ) * add mt5 and t5v1_1 model * fix tests * correct some imports * add tf model * finish tf t5 * improve examples * fix copies * clean doc	2020-11-17 12:23:09 +01:00
fajri91	9e01f988dd	model_card for indolem/indobert-base-uncased (#8579 )	2020-11-17 03:36:50 -05:00
Sylvain Gugger	c89bdfbe72	Reorganize repo (#8580 ) * Put models in subfolders * Styling * Fix imports in tests * More fixes in test imports * Sneaky hidden imports * Fix imports in doc files * More sneaky imports * Finish fixing tests * Fix examples * Fix path for copies * More fixes for examples * Fix dummy files * More fixes for example * More model import fixes * Is this why you're unhappy GitHub? * Fix imports in conver command	2020-11-16 21:43:42 -05:00
Julien Plu	901507335f	Fix mixed precision issue for GPT2 (#8572 ) * Fix mixed precision issue for GPT2 * Forgot one cast * oops * Forgotten casts	2020-11-16 14:44:19 -05:00
Sylvain Gugger	1073a2bde5	Switch `return_dict` to `True` by default. (#8530 ) * Use the CI to identify failing tests * Remove from all examples and tests * More default switch * Fixes * More test fixes * More fixes * Last fixes hopefully * Use the CI to identify failing tests * Remove from all examples and tests * More default switch * Fixes * More test fixes * More fixes * Last fixes hopefully * Run on the real suite * Fix slow tests	2020-11-16 11:43:00 -05:00
Sylvain Gugger	0d0a0785fd	Update version to v4.0.0-dev (#8568 )	2020-11-16 10:21:19 -05:00
LSinev	afb50c663a	Fix GPT2DoubleHeadsModel to work with model.generate() (#6601 ) * Fix passing token_type_ids during GPT2DoubleHeadsModel.generate() if used and for GPT2LMHeadModel too * Update tests to check token_type_ids usage in GPT2 models	2020-11-16 14:35:44 +01:00
Yusuke Mori	04d8136bde	Adding the prepare_seq2seq_batch function to ProphetNet (#8515 ) * Simply insert T5Tokenizer's prepare_seq2seq_batch * Update/Add some 'import' * fix RunTimeError caused by '.view' * Moves .view related error avoidance from seq2seq_trainer to inside prophetnet * Update test_tokenization_prophetnet.py * Format the test code with black * Re-format the test code * Update test_tokenization_prophetnet.py * Add importing require_torch in the test code * Add importing BatchEncoding in the test code * Re-format the test code on Colab	2020-11-16 14:18:25 +01:00
Stas Bekman	931b10978e	[doc] typo fix (#8535 ) * [doc] typo fix @sgugger * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-11-16 08:05:30 -05:00
Branden Chan	6db21a06ae	Clearer Model Versioning Example (#8562 )	2020-11-16 06:59:10 -05:00
Mehrdad Farahani	daaa68451e	Readme for Wiki Summary [Persian] bert2bert (#8558 )	2020-11-16 05:04:46 -05:00
Mehrdad Farahani	06d468d3f0	Readme for News Headline Generation (bert2bert) (#8557 )	2020-11-16 05:04:38 -05:00
zhezhaoa	9b7fb8a368	Create README.md for Chinese RoBERTa Miniatures (#8550 ) * Create README.md * Update model_cards/uer/chinese_roberta_L-2_H-128/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-11-16 05:01:28 -05:00
Thomas Wolf	f4e04cd2c6	[breaking\|pipelines\|tokenizers] Adding slow-fast tokenizers equivalence tests pipelines - Removing sentencepiece as a required dependency (#8073 ) * Fixing roberta for slow-fast tests * WIP getting equivalence on pipelines * slow-to-fast equivalence - working on question-answering pipeline * optional FAISS tests * Pipeline Q&A * Move pipeline tests to their own test job again * update tokenizer to add sequence id methods * update to tokenizers 0.9.4 * set sentencepiecce as optional * clean up squad * clean up pipelines to use sequence_ids * style/quality * wording * Switch to use_fast = True by default * update tests for use_fast at True by default * fix rag tokenizer test * removing protobuf from required dependencies * fix NER test for use_fast = True by default * fixing example tests (Q&A examples use slow tokenizers for now) * protobuf in main deps extras["sentencepiece"] and example deps * fix protobug install test * try to fix seq2seq by switching to slow tokenizers for now * Update src/transformers/tokenization_utils_base.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/tokenization_utils_base.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-11-15 22:50:59 +01:00
Julien Plu	24184e73c4	Rework some TF tests (#8492 ) * Update some tests * Small update * Apply style * Use max_position_embeddings * Create a fake attribute * Create a fake attribute * Update wrong name * Wrong TransfoXL model file * Keep the common tests agnostic	2020-11-13 17:07:17 -05:00
Patrick von Platen	f6cdafdec7	fix load weights (#8528 ) * fix load weights * delete line	2020-11-13 20:31:40 +01:00
Joe Davison	f6f4da8dd4	Add bart-large-mnli model card (#8527 )	2020-11-13 14:07:25 -05:00
Julien Chaumond	725269746b	Model sharing doc: more tweaks (#8520 ) * More doc tweaks * Update model_sharing.rst * make style * missing newline * Add email tip Co-authored-by: Pierric Cistac <pierric@huggingface.co>	2020-11-13 12:10:26 -05:00
LysandreJik	9d519dabb7	Fix paths in github YAML	2020-11-13 12:04:17 -05:00
Lysandre Debut	826f04576f	Model templates encoder only (#8509 ) * Model templates * TensorFlow * Remove pooler * CI * Tokenizer + Refactoring * Encoder-Decoder * Let's go testing * Encoder-Decoder in TF * Let's go testing in TF * Documentation * README * Fixes * Better names * Style * Update docs * Choose to skip either TF or PT * Code quality fixes * Add to testing suite * Update file path * Cookiecutter path * Update `transformers` path * Handle rebasing * Remove seq2seq from model templates * Remove s2s config * Apply Sylvain and Patrick comments * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Last fixes from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-11-13 11:59:30 -05:00
Patrick von Platen	42e2d02e44	[T5] Bug correction & Refactor (#8518 ) * fix bug * T5 refactor * refactor tf * apply sylvains suggestions	2020-11-13 16:57:31 +01:00
Sylvain Gugger	42f63e3871	Merge remote-tracking branch 'origin/master'	2020-11-13 10:30:04 -05:00
Sylvain Gugger	bb03a14edd	Update doc for v3.5.1	2020-11-13 10:29:58 -05:00
Branden Chan	4df6b59318	Update deepset/roberta-base-squad2 model card (#8522 ) * Update README.md * Update README.md	2020-11-13 09:58:27 -05:00
Sylvain Gugger	0c9bae0934	Remove typo	2020-11-12 22:39:57 -05:00
Julien Plu	5d80539488	Add pretraining loss computation for TF Bert pretraining (#8470 ) * Add pretraining loss computation for TF Bert pretraining * Fix labels creation * Fix T5 model * restore T5 kwargs * try a generic fix for pretraining models * Apply style * Overide the prepare method for the BERT tests	2020-11-12 14:08:26 -05:00
Julien Plu	91a67b7506	Use LF instead of os.linesep (#8491 )	2020-11-12 13:52:40 -05:00
Julien Plu	27b3ff316a	Try to understand and apply Sylvain's comments (#8458 )	2020-11-12 13:43:00 -05:00
Forrest Iandola	0fa0349883	fix SqueezeBertForMaskedLM (#8479 )	2020-11-12 12:19:37 -05:00
Sylvain Gugger	7933054638	Model sharing doc (#8498 ) * Model sharing doc * Style	2020-11-12 11:53:23 -05:00
Chengxi Guo	d65e0bfea3	Fix doc bug (#8500 ) * fix doc bug Signed-off-by: mymusise <mymusise1@gmail.com> * fix example bug Signed-off-by: mymusise <mymusise1@gmail.com>	2020-11-12 11:47:23 -05:00
zeyuyun1	924c624a46	quick fix on concatenating text to support more datasets (#8474 )	2020-11-12 09:47:08 -05:00
Antonio Lanza	17b1fd804f	Fix typo in roberta-base-squad2-v2 model card (#8489 )	2020-11-12 05:29:37 -05:00
Julien Chaumond	c6c08ebf61	[model_cards] other chars than [\w\-_] not allowed anymore in model names cc @Pierrci	2020-11-12 10:45:29 +01:00
Funtowicz Morgan	121c24efa4	Update deploy-docs dependencies on CI to enable Flax (#8475 ) * Update deploy-docs dependencies on CI to enable Flax Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added pair of "" Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-11-11 18:31:41 -05:00
Sumithra Bhakthavatsalam	81ebd70671	[s2s] distill t5-large -> t5-small (#8376 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-11-11 17:58:45 -05:00
Funtowicz Morgan	a5b682329c	Flax/Jax documentation (#8331 ) * First addition of Flax/Jax documentation Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * make style * Ensure input order match between Bert & Roberta Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Install dependencies "all" when building doc Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * wraps build_doc deps with "" Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing @sgugger comments. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Use list to highlight JAX features. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Make style. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Let's not look to much into the future for now. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Style Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-11-11 14:53:36 -05:00
Lysandre	c7b6bbec5c	Skip test until investigation	2020-11-11 12:59:40 -05:00
Beomsoo Kim	aa2a2c6579	Replaced some iadd operations on lists with proper list methods. (#8433 )	2020-11-11 12:29:57 -05:00
Ratthachat (Jung)	026a2ff225	Add TFDPR (#8203 ) * Create modeling_tf_dpr.py * Add TFDPR * Add back TFPegasus, TFMarian, TFMBart, TFBlenderBot last commit accidentally deleted these 4 lines, so I recover them back * Add TFDPR * Add TFDPR * clean up some comments, add TF input-style doc string * Add TFDPR * Make return_dict=False as default * Fix return_dict bug (in .from_pretrained) * Add get_input_embeddings() * Create test_modeling_tf_dpr.py The current version is already passed all 27 tests! Please see the test run at : https://colab.research.google.com/drive/1czS_m9zy5k-iSJbzA_DP1k1xAAC_sdkf?usp=sharing * fix quality * delete init weights * run fix copies * fix repo consis * del config_class, load_tf_weights They shoud be 'pytorch only' * add config_class back after removing it, test failed ... so totally only removing "use_tf_weights = None" on Lysandre suggestion * newline after .. note:: * import tf, np (Necessary for ModelIntegrationTest) * slow_test from_pretrained with from_pt=True At the moment we don't have TF weights (since we don't have official official TF model) Previously, I did not run slow test, so I missed this bug * Add simple TFDPRModelIntegrationTest Note that this is just a test that TF and Pytorch gives approx. the same output. However, I could not test with the official DPR repo's output yet * upload correct tf model * remove position_ids as missing keys Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: patrickvonplaten <patrick@huggingface.co>	2020-11-11 12:28:09 -05:00
sarnoult	a38d1c7c31	Example NER script predicts on tokenized dataset (#8468 ) The new run_ner.py script tries to run prediction on the input test set `datasets["test"]`, but it should be the tokenized set `tokenized_datasets["test"]`	2020-11-11 10:28:23 -05:00
Julien Plu	069b63844c	Fix next sentence output (#8466 )	2020-11-11 15:41:39 +01:00
Julien Plu	da842e4e72	Add next sentence prediction loss computation (#8462 ) * Add next sentence prediction loss computation * Apply style * Fix tests * Add forgotten import * Add forgotten import * Use a new parameter * Remove kwargs and use positional arguments	2020-11-11 15:02:06 +01:00
Julien Plu	23290836c3	Fix TF Longformer (#8460 )	2020-11-11 12:54:15 +01:00

1 2 3 4 5 ...

5885 Commits