transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Lysandre Debut	72eefb34a9	Add require flax to test (#13260 )	2021-08-25 12:56:25 -04:00
Lysandre Debut	5af8df5afb	Some `model_type`s cannot be in the mapping (#13259 ) * Some tokenizers cannot be in the mapping * Style	2021-08-25 12:56:16 -04:00
Lysandre Debut	68b6907290	Add CLIP tokenizer to AutoTokenizer (#13258 )	2021-08-25 12:56:07 -04:00
Lysandre Debut	3bbe68f837	Hubert test fix (#13261 )	2021-08-25 18:41:26 +02:00
Lysandre Debut	3bb4466260	Better notification service (#13267 )	2021-08-25 12:14:44 -04:00
Nishant Prabhu	225de5ccbb	Replace assert statement with if condition and ValueError (#13263 )	2021-08-25 12:14:03 -04:00
Lysandre	46554fc12f	Grad enabled typo	2021-08-25 11:39:45 +02:00
Lysandre Debut	0e4f727069	Remove side effects of disabling gradient computaiton (#13257 )	2021-08-25 05:32:51 -04:00
Will Frey	b1198a8440	Update generation_logits_process.py (#12671 ) If you're using type hints, then passing an `int` where a `float` is annotated is acceptable as per [PEP 484](https://www.python.org/dev/peps/pep-0484/#the-numeric-tower). This makes life a little nicer.	2021-08-25 02:34:05 +08:00
dependabot[bot]	0245cee469	Bump notebook from 6.1.5 to 6.4.1 in /examples/research_projects/lxmert (#13226 ) Bumps [notebook](http://jupyter.org) from 6.1.5 to 6.4.1. --- updated-dependencies: - dependency-name: notebook dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2021-08-24 09:52:39 -04:00
Ambesh Shekhar	0512bfe79e	Custom errors and BatchSizeError (#13184 ) * Adding custom errors and BatchSizeError for GPT2 * Adding custom errors and BatchSizeError for GPT2 * Changing Exception to BaseException * Exception * Adding args to Custom Exception * Adding args to Custom Exception * Changing from BaseException to Exception * Changing Conditional loop syntax * Adding Copyright info * Handling check_code_quality * Handling check_code_quality pt2 * Handling check_code_quality pt3 * Handling check_code_quality pt4 * Handling check_code_quality pt5 * Handling check_code_quality pt6 * Handling check_code_quality pt6 * Using black for check_code_quality * sorting import style * Changing * Changing * verified through style_doc.py * verified through style_doc.py * applying isort * Removing indentation * Changing * Changing * Changing * Used ValueError * Using ValueError * Reformatted Style doc * Using style doc on modeling_gp2.py * Adding indentation * Changing	2021-08-24 09:01:01 -04:00
Ori Ram	cf57447648	Fix broken links in Splinter documentation (#13237 )	2021-08-24 07:55:21 -04:00
Stas Bekman	5c6eca71a9	fix `AutoModel.from_pretrained(..., torch_dtype=...)` (#13209 ) * fix AutoModel.from_pretrained(..., torch_dtype=...) * fix to_diff_dict * add better test * torch is not always available when a model has self.torch_dtype	2021-08-24 11:43:41 +02:00
Bram Vanroy	39db2f3c19	Allow local_files_only for fast pretrained tokenizers (#13225 ) * allow local_files_only for fast pretrained tokenizers * make style	2021-08-24 03:05:33 -04:00
Lysandre Debut	2772d3e79d	Add RemBert to AutoTokenizer (#13224 )	2021-08-23 13:16:48 -04:00
Allan Lin	f1bb6f0839	Fix load tf alias in Albert. (#13159 )	2021-08-23 12:08:33 -04:00
Kamal Raj	0b54046ff8	remove unwanted code (#13145 )	2021-08-23 12:07:41 -04:00
Yih-Dar	2e20c0f34a	Make Flax GPT2 working with cross attention (#13008 ) * make flax gpt2 working with cross attention * Remove encoder->decoder projection layer * A draft (incomplete) for FlaxEncoderDecoderModel * Add the method from_encoder_decoder_pretrained + the docstrings * Fix the mistakes of using EncoderDecoderModel * Fix style * Add FlaxEncoderDecoderModel to the library * Fix cyclic imports * Add FlaxEncoderDecoderModel to modeling_flax_auto.py * Remove question comments * add tests for FlaxEncoderDecoderModel * add flax_encoder_decoder to the lists of ignored entries in check_repo.py * fix missing required positional arguments * Remove *kwargs when creating FlaxEncoderDecoderModel in from_encoder_decoder_pretrained() Also fix generation eos/pad tokens issue Fix: Use sequences from the generated_output * Change a check from assert to raise ValueError * Fix examples and token ids issues * Fix missing all_cross_attentions when outputting tuple in modeling_gpt2 * Remove the changes in configuration docstrings. * allow for bert 2 gpt2 * make fix-copies * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Change remaining examples to bert2gpt2 * Change the test to Bert2GPT2 * Fix examples * Fix import * Fix unpack bug * Rename to FlaxEncoderDecoderModelTest and change the test to bert2gpt2 * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Fix: NotImplentedError -> NotImplementedError * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * up * finalize Co-authored-by: ydshieh <ydshieh@user.noreply> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-08-23 17:57:29 +02:00
SaulLu	7223844df9	Change how "additional_special_tokens" argument in the ".from_pretrained" method of the tokenizer is taken into account (#13056 ) * add test * add change in PretrainedTokenizerBase * change Luke * deactivate * add the possibility to add additional special tokens for M2M100 * format * add special test for canine * proposed changes for mbart * proposed changes for mbart50 * proposed changes for byt5 * proposed changes for canine * proposed changes for t5 * test fast and slow * remove comment * remove comment * add fast version for all tests * replace break by continue * add more comments * add check to avoid duplicates * remove comment * format * proposed change for wave2vec2 * reverse changes mbart * uncomment * format	2021-08-23 14:35:18 +02:00
sourabh112	b13c6c18d0	correcting group beam search function output score bug (#13211 )	2021-08-23 13:27:24 +02:00
Philipp Schmid	f689743e74	SageMaker: Fix sagemaker DDP & metric logs (#13181 ) * Barrier -> barrier * added logger for metrics * removed stream handler in trainer * moved handler * removed streamhandler from trainer * updated test image and instance type added datasets version to test * Update tests/sagemaker/scripts/pytorch/requirements.txt Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-08-23 10:18:07 +02:00
NielsRogge	8679bd7144	Add min and max question length options to TapasTokenizer (#12803 ) * Add min and max question length option to the tokenizer * Add corresponding test	2021-08-23 03:44:42 -04:00
NielsRogge	588e6caa15	Overwrite get_clean_sequence as this was causing a bottleneck (#13183 )	2021-08-23 03:41:35 -04:00
StevenTang1998	143738214c	Fix the loss calculation of ProphetNet (#13132 ) * Fix the loss calculation of ProphetNet * Fix the loss calculation of ProphetNet Fix the loss calculation of ProphetNet and remove warning	2021-08-20 11:01:54 +02:00
Allan Lin	91ff480e26	Update namespaces inside torch.utils.data to the latest. (#13167 ) * Update torch.utils.data namespaces to the latest. * Format * Update Dataloader. * Style	2021-08-19 14:29:51 +02:00
Jannis Vamvas	1fec32adc6	Fix generation docstrings regarding input_ids=None (#12823 )	2021-08-18 16:51:54 +02:00
Patrick von Platen	ecfa7eb260	[AutoFeatureExtractor] Fix loading of local folders if config.json exists (#13166 ) * up * up	2021-08-18 16:18:13 +02:00
Ori Ram	439a43b6b4	Add splinter (#12955 ) * splinter template * initialize splinter classes * Splinter Tokenizer * splinter.rst * tokenization fixes * Documentation & some minor variable name changes * bug fix (added back question_token_id to config) + variable names * Minor bug fixes + variable name changes * Fix Splinter references after merge with new transformers * changes after running make style & quality * Fix documentation unindent * Fix doc indentation in tokenization_splinter * Fix also SplinterTokenizerFast * Add Splinter to index.rst and README * Fixdouble whitespace from index.rst * Fixed index.rst with 'make fix-copies' * Update docs/source/model_doc/splinter.rst Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update docs/source/model_doc/splinter.rst Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update docs/source/model_doc/splinter.rst Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update docs/source/model_doc/splinter.rst Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update src/transformers/models/splinter/__init__.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Added "copied from BERT" comments * Removing unnexessary code from modeling_splinter * Update README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/splinter/configuration_splinter.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Remove references to TF modeling from splinter * Update src/transformers/models/splinter/modeling_splinter.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Remove unnecessary check * Update src/transformers/models/splinter/modeling_splinter.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add differences between Splinter and Bert tokenizers * Update src/transformers/models/splinter/modeling_splinter.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/splinter/tokenization_splinter_fast.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Remove unnecessary check * Doc formatting * Update src/transformers/models/splinter/tokenization_splinter.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/splinter/tokenization_splinter.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * bug fix: remove load_tf_weights attribute * Some minor quality changes * Update docs/source/model_doc/splinter.rst Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/splinter/configuration_splinter.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Change FullyConnectedLayer to SplinterFullyConnectedLayer * Variable naming * Reove gather_positions function * Remove ClassificationHead as it's outdated * Update src/transformers/models/splinter/modeling_splinter.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Remove hardcoded 102 token id * Minor style change * Added "tau" organization to all model identifiers & URLS * Added tau to the tests as well * Copy-from comments * Removed all unnecessary classes (e.g. SplinterForMaskedLM) * Running make fix-copies * Bug fix: Further removed unnecessary classes * Add Splinter to AutoTokenization * Add an integration test for Splinter * Removed initialize_new_qass from config - It will be done through different checkpoints * Removed `initialize_new_qass` from documentation as well * Added new checkpoint names (`tau/splinter-base-qass` and same for large) in the code * Minor change to test * SplinterTokenizer now doesn't abstract from BertTokenizer * SplinterTokenizerFast also dosn't abstract from Bert * style and quality * bug fix: import ing torch in tests only if it's available * Auto mappings * Changed copyrights in Splinter's files * Update src/transformers/models/splinter/configuration_splinter.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: yuvalkirstain <kirstain.yuval@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-08-17 08:29:01 -04:00
Nicolas Patry	6626d8a62f	Optimizes ByT5 tokenizer (#13119 ) * Starting to optimize ByT5. * Making ByT5Tokenizer faster. * Even faster. * Cleaning up.	2021-08-17 10:11:58 +02:00
sararb	14e9d2954c	compute seq_len from inputs_embeds (#13128 )	2021-08-16 18:36:08 +02:00
Lysandre Debut	e2f07c01e9	Ci continue through smi failure (#13140 ) * Continue on error * Specific * Temporary patch	2021-08-16 11:40:38 -04:00
Patrick von Platen	73caccde3f	fix bug (#13051 )	2021-08-16 16:02:34 +02:00
Omar Sanseviero	c066598c23	Fix frameworks table so it's alphabetical (#13118 ) * Fix frameworks table so it's alphabetical * Update index.rst * Don't differentiate when sorting between upper and lower case	2021-08-16 15:45:19 +02:00
Lysandre	62ba3b6b43	Depend on hidden_dropout_prob	2021-08-16 10:52:28 +02:00
Lysandre	3c6d73bc5c	Fix BERT/MobileBERT classifier dropout	2021-08-16 10:43:59 +02:00
weierstrass_walker	7d2feb3a3b	Update modeling_bert.py (#13129 )	2021-08-16 04:17:37 -04:00
Omar Sanseviero	a13c8145bc	Fix docstring of train_new_from_iterator	2021-08-13 17:38:02 +02:00
Minwoo Lee	86a154722f	Fix omitted lazy import for xlm-prophetnet (#13052 ) * Fix omitted lazy import for xlm-prophetnet * Update src/transformers/models/xlm_prophetnet/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fix style using black Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-08-13 12:24:53 +02:00
Nicolas Patry	d58926ab1d	Moving fill-mask pipeline to new testing scheme (#12943 ) * Fill mask pipelines test updates. * Model eval !! * Adding slow test with actual values. * Making all tests pass (skipping quite a bit.) * Doc styling. * Better doc cleanup. * Making an explicit test with no pad token tokenizer. * Typo.	2021-08-13 12:04:18 +02:00
Yih-Dar	a04d4bf2d7	Fix flax gpt2 hidden states (#13109 ) * Fix inconsistency of the last element in hidden_states between PyTorch/Flax GPT2(Neo) (#13102) * Fix missing elements in outputs tuple * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * Fix local variable 'all_hidden_states' referenced before assignment * Fix by returning tuple containing None values * Fix quality Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2021-08-13 14:15:53 +05:30
Will Frey	d8fb278a2c	Create py.typed (#12893 ) * Create py.typed This creates a [py.typed as per PEP 561](https://www.python.org/dev/peps/pep-0561/#packaging-type-information) that should be distributed to mark that the package includes (inline) type annotations. * Update setup.py Include py.typed as package data * Update setup.py Call `setup(...)` with `zip_safe=False`.	2021-08-13 04:12:59 -04:00
Sylvain Gugger	b0a917c48a	Fix CircleCI nightly tests (#13113 )	2021-08-13 08:57:30 +02:00
Gunjan Chhablani	bda1cb0236	Fix VisualBERT docs (#13106 ) * Fix VisualBERT docs * Show example notebooks as lists * Fix style	2021-08-13 11:44:04 +05:30
Bill Schnurr	e46ad22cd6	Improve type checker performance (#13094 ) * conditional declare `TOKENIZER_MAPPING_NAMES` within a `if TYPE_CHECKING` block so that type checkers dont need to evaluate the RHS of the assignment. this improves performance of the pylance/pyright type checkers * Update src/transformers/models/auto/tokenization_auto.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * adding missing import * format Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-08-12 18:45:54 +02:00
Sylvain Gugger	b9962b8656	Ci last fix (#13103 ) * Only report failures on failures * Fix typo * Put it everywhere	2021-08-12 10:45:06 -04:00
Suraj Patil	f5cd27694a	[FlaxCLIP] allow passing params to image and text feature methods (#13099 ) * allow passing params to image and text feature method * ifx for hybrid clip as well	2021-08-12 18:35:01 +05:30
Sylvain Gugger	9a498c37a2	Rely on huggingface_hub for common tools (#13100 ) * Remove hf_api module and use hugginface_hub * Style * Fix to test_fetcher * Quality	2021-08-12 14:59:02 +02:00
Patrick von Platen	6900dded49	[Flax/JAX] Run jitted tests at every commit (#13090 ) * up * up * up	2021-08-12 14:49:46 +02:00
Yih-Dar	773d386041	Change a parameter name in FlaxBartForConditionalGeneration.decode() (#13074 ) * Change FlaxBartForConditionalGeneration.decode() argument: deterministic -> train * Also change the parameter name to train for flax marian and mbart Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2021-08-12 17:49:48 +05:30
Sylvain Gugger	f176fbf588	Fix doc building error	2021-08-12 05:49:02 -04:00

1 2 3 4 5 ...

7782 Commits