transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Anurag Kumar	92d4ef9ab0	Update setup.py (#13421 )	2021-09-06 17:32:24 -04:00
Shiv Dhar	75858ca156	Update version of `packaging` package (#13454 )	2021-09-06 17:19:02 -04:00
Anton Lozhkov	f8363e49f9	Install libsndfile (#13403 )	2021-09-06 17:12:43 -04:00
NielsRogge	5642a555ae	Add TAPAS MLM-only models (#13408 ) * Add conversion of TapasForMaskedLM * Add copied from statements	2021-09-06 19:19:30 +02:00
Suraj Patil	2dd975b235	skip image classification test (#13451 )	2021-09-06 21:46:25 +05:30
Nils Reimers	c8be8a9adb	Update model configs - Allow setters for common properties (#13026 ) * refactor GPT Config to allow dyn. properties * make attribute_map a class attribute * remove old code * update unit test to test config: Add test for common properties setter * update unit test to test config: Add test for common properties passed as parameters to __init__ * update to black code format * Allow that setters are not defined for certain config classes * update config classes to implement attribute_map * bugfix lxmert config - id2labels was not defined when num_labels was set * update broken configs - add attribute_maps * update bart config * update black codestyle * update documentation on common config attributes * update GPTJ config to new attribute map * update docs on common attributes * gptj config: add max_position_embeddings * gptj config: format with black * update speech to text 2 config * format doc file to max_len 119 * update config template	2021-09-06 16:30:13 +02:00
Nicolas Patry	cf4eb8b3f9	Adding a test for multibytes unicode. (#13447 ) * Adding a test for multibytes unicode. * Adding some accents. * Making sure decoding works. * Make tests passing by being cheesy.	2021-09-06 16:11:23 +02:00
Patrick von Platen	607611f240	up (#13448 )	2021-09-06 16:09:24 +02:00
Suraj Patil	6b29bff852	add torchvision in example test requirements (#13438 )	2021-09-06 15:17:54 +02:00
Anton Lozhkov	26700a9516	Fix scheduled tests for `SpeechEncoderDecoderModel` (#13422 ) * Add inputs to pretrained tests * Make style	2021-09-06 14:55:13 +02:00
Yih-Dar	73ad258806	Fix tests without any real effect (#13406 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2021-09-06 14:51:45 +02:00
Nathan Raw	76c4d8bf26	✨ Add PyTorch image classification example (#13134 ) * ✨ add pytorch image classification example * 🔥 remove utils.py * 💄 fix flake8 style issues * 🔥 remove unnecessary line * ✨ limit dataset sizes * 📌 update reqs * 🎨 restructure - use datasets lib * 🎨 import transforms directly * 📝 add comments * 💄 style * 🔥 remove flag * 📌 update requirement warning * 📝 add vision README.md * 📝 update README.md * 📝 update README.md * 🎨 add image-classification tag to model card * 🚚 rename vision ➡️ image-classification * 📝 update image-classification README.md	2021-09-02 13:29:42 -06:00
Patrick von Platen	9bd5d97cdd	up (#13396 )	2021-09-02 18:47:09 +02:00
Patrick von Platen	efa4f5f0ea	fix (#13395 )	2021-09-02 18:11:26 +02:00
Aman Madaan	596bb85f2f	[docs] Update perplexity.rst to use negative log likelihood (#13386 ) * [docs] Update perplexity.rst to use negative log likelihood Model `forward` returns the negative log likelihood. The document correctly defines and calculates perplexity, but the description and variable names are inconsistent, which might cause confusion. * [docs] restyle perplexity.rst	2021-09-02 07:49:12 -04:00
Apoorv Garg	b91e65afe0	Correct order of overflowing_tokens for slow tokenizer (#13179 ) * correct order of overflowing_tokens for slow tokenizer (issue fix #13148) * python 3.9 requires sentencepiece version 0.1.94 or above * slicing of ids fixed in truncated_sequence() * Update setup.py * Correct order of overflowing tokens for pair of sentences * code reformatted * Update tokenization_utils_base.py * reformatting file * test to check single_input added * missing function restored * test to check pair_input overflowing tokens order * test to check pair_input overflowing tokens order * test to check pair_input overflowing tokens order * added an error message for pair of seq and longest_first strategy * test for pair_input modified * variable name corrected * fixed a typo in error message * requested changes implemented * required test added * Corrected the message to match test message * added error message for Luke Tokenizer * lost test recovered * docstring for truncate_sequences and prepare_for_model updated * docstring for luke tokenizer updated * updated ENCODE_PLUS_ADDITIONAL_KWARGS_DOCSTRING * aligned text and fixed puncuatations * improved style and quality of code * fixed error_msg in truncate_sequences * replaced encode_plus method with regular call method * clean up * rephrased the docstring	2021-09-02 05:58:23 -04:00
Nicolas Patry	c9184a2e03	Enabling automatic loading of tokenizer with `pipeline` for (#13376 ) `audio-classification`.	2021-09-02 05:37:42 -04:00
Suraj Patil	e92140c567	fix example (#13387 )	2021-09-02 11:32:18 +02:00
NielsRogge	4114c9a75b	Add tokenizer docs (#13373 )	2021-09-02 09:46:05 +02:00
Sachin Abeywardana	872e6be03d	Update clip loss calculation (#13217 ) * Update clip loss calculation Hello, I'm the author of the blog you took the snippet from. I think this way of calculating is possibly slightly more accurate for calculation. * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com>	2021-09-02 12:15:56 +05:30
Eduardo Gonzalez Ponferrada	0a22335e66	[Flax/run_hybrid_clip] Fix duplicating images when captions_per_image exceeds the number of captions, enable truncation	2021-09-02 11:19:49 +05:30
Sylvain Gugger	c1c2d68d37	Fix name and get_class method in AutoFeatureExtractor (#13385 )	2021-09-01 20:54:49 -04:00
Patrick von Platen	a105c9b776	fix (#13383 )	2021-09-01 23:12:01 +02:00
Patrick von Platen	4475f1dc2a	[Flax] Fix BigBird (#13380 ) * finish * finish	2021-09-01 18:33:54 +02:00
Lysandre Debut	ecd5397106	Fix RemBERT (#13375 )	2021-09-01 11:11:32 -04:00
Lysandre Debut	33b7c9a8aa	Add missing feature extractors (#13374 )	2021-09-01 11:10:49 -04:00
Anton Lozhkov	2406892a2e	Add `Hubert` to the `AutoFeatureExtractor` (#13366 ) * Add Hubert to the auto feature extractor * Fix import structure	2021-09-01 18:09:02 +03:00
Sylvain Gugger	6b3532643f	Properly register missing submodules in main init (#13372 )	2021-09-01 10:57:43 -04:00
NielsRogge	4b7988eb49	Fix assertion (#13369 )	2021-09-01 16:42:59 +02:00
SaulLu	c4d78f01de	Fix tokenizer saving during training with `Trainer` (#12806 ) * add test in trainer and test tokenizer saving wi th trainer * quality * reverse trainer changes * replace test in test_trainer by a test for all the tokenizers * format * add can_save_slow_tokenizer attribute to all tokenizers * fix Herbert * format * Change comment in error * add comments and a new assert * Update src/transformers/models/albert/tokenization_albert_fast.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * change ValueError barthez * change ValueError BigBird * change ValueError Camembert * change ValueError Mbart50 * change ValueError Pegasus * change ValueError ReFormer * change ValueError T5 * change ValueError RoBERTa * XLNET fast * Update tests/test_tokenization_common.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * change `assert` into `self.assertIn` * format Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-09-01 16:32:56 +02:00
Sylvain Gugger	c1b20e42f5	Redeploy stable documentation	2021-09-01 09:21:50 -04:00
Li-Huai (Allan) Lin	85cb447766	Revert "Correct wrong function signatures on the docs website (#13198 )" This reverts commit `ffecfea949`.	2021-09-01 09:17:08 -04:00
NielsRogge	4766e009b0	Improve T5 docs (#13240 ) * Remove disclaimer * First draft * Fix rebase * Improve docs some more * Add inference section * Improve example scripts section * Improve code examples of modeling files * Add docs regarding task prefix * Address @craffel's comments * Apply suggestions from @patrickvonplaten's review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Add suggestions from code review * Apply @sgugger's suggestions * Fix Flax code examples * Fix index.rst Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-09-01 15:05:40 +02:00
donggyukimc	ba1b3db709	fix wrong 'cls' masking for bigbird qa model output (#13143 )	2021-09-01 14:03:16 +02:00
Sylvain Gugger	7a26307e31	Fixes for the documentation (#13361 )	2021-09-01 07:54:28 -04:00
Patrick von Platen	0b8c84e110	Add SpeechEncoderDecoder & Speech2Text2 (#13186 ) * fix_torch_device_generate_test * remove @ * up * correct some bugs * correct model * finish speech2text extension * up * up * up * up * Update utils/custom_init_isort.py * up * up * update with tokenizer * correct old tok * correct old tok * fix bug * up * up * add more tests * up * fix docs * up * fix some more tests * add better config * correct some more things " * fix tests * improve docs * Apply suggestions from code review * Apply suggestions from code review * final fixes * finalize * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * apply suggestions Lysandre and Sylvain * apply nicos suggestions * upload everything * finish Co-authored-by: Patrick von Platen <patrick@huggingface.co> Co-authored-by: your_github_username <your_github_email> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-09-01 13:33:31 +02:00
Lysandre Debut	9396b40433	Fix GPT-J _CHECKPOINT_FOR_DOC typo (#13368 )	2021-09-01 06:57:43 -04:00
Hamid Shojanazeri	53ee995ac9	Fix for the issue of device-id getting hardcoded for token_type_ids during Tracing for ConvBert (#12287 ) * added token_type_ids buffer to fix the issue #5664 * Handling the case that position_id buffer is not registered * added token_type_ids buffer to fix the issue #5664 * modified to support device conversion when the model is traced	2021-09-01 04:47:58 -04:00
Hamid Shojanazeri	5adf5cab2f	Fix for the issue of device-id getting hardcoded for position-ids during Tracing for Distillbert (#12290 ) * registered buffer for position-ids to address issues similar to issue#5664 * added comment * added the flag to prevent from adding the buffer into the state_dict	2021-09-01 04:47:25 -04:00
Hamid Shojanazeri	5d1a3d135c	Fix for the issue of device-id getting hardcoded for position-ids during Tracing for Flaubert (#12292 ) * adding position_ids buffer to fix the issue simialr to #5664 * adding position-id buffer to address similar issues to #5664	2021-09-01 04:46:58 -04:00
Lysandre Debut	58e999b7e6	Torchscript test for Flaubert (#13353 ) * Torchscript test for Flaubert * Update tests/test_modeling_flaubert.py * Update tests/test_modeling_flaubert.py	2021-09-01 04:44:31 -04:00
Lysandre Debut	d07c771dd9	Torchscript test for ConvBERT (#13352 ) * Torchscript test for ConvBERT * Apply suggestions from code review	2021-09-01 04:43:09 -04:00
Lysandre Debut	680733a7c4	Torchscript test for DistilBERT (#13351 ) * Torchscript test for DistilBERT * Update tests/test_modeling_distilbert.py	2021-09-01 04:42:21 -04:00
Lysandre Debut	73a0381282	Torchscript test (#13350 ) * Torchscript test * Remove print statement	2021-09-01 04:41:46 -04:00
Anton Lozhkov	b9c6a97694	Add the `AudioClassificationPipeline` (#13342 ) * Add the audio classification pipeline * Remove autoconfig exception * Mark ffmpeg test as slow * Rearrange pipeline tests * Add small test * Replace asserts with ValueError	2021-09-01 11:03:48 +03:00
Patrick von Platen	02039352b2	Update README.md	2021-09-01 09:50:21 +02:00
Jonathan Chang	d160782a53	Add template for adding flax models (#12441 ) * Add option to add flax * Add flax template for __init__.py * Add flax template for .rst * Copy TF modeling template * Add a missing line in modeling_tf_... template * Update first half of modeling_flax_.. * Update encoder flax template * Copy test_modeling_tf... as test_modeling_flax... * Replace some TF to Flax in test_modeling_flax_... * Replace tf to np some function might not work, like _assert_tensors_equal * Replace remaining tf to np (might not work) * Fix cookiecutter * Add Flax in to_replace_... template * Update transformers-cli add-new-model * Save generate_flax in configuration.json This will be read by transformers-cli * Fix to_replace_... and cli * Fix replace cli * Fix cookiecutter name * Move docstring earlier to avoid not defined error * Fix a missing Module * Add encoder-decoder flax template from bart * Fix flax test * Make style * Fix endif * Fix replace all "utf-8 -> unp-8" * Update comment * Fix flax template (add missing ..._DOCSTRING) * Use flax_bart imports in template (was t5) * Fix unp * Update templates/adding_a_new_model/tests * Revert "Fix unp" This reverts commit `dc9002a41d`. * Remove one line of copied from to suppress CI error * Use generate_tensorflow_pytorch_and_flax * Add a missing part * fix typo * fix flax config * add examples for flax * small rename * correct modeling imports * correct auto loading * corrects some flax tests * correct small typo * correct as type * finish modif * correct more templates * final fixes * add file testers * up * make sure tests match template regex * correct pytorch * correct tf * correct more tf * correct imports * minor error * minor error * correct init * more fixes * correct more flax tests * correct flax test * more fixes * correct docs * update * fix Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-09-01 09:49:03 +02:00
Patrick von Platen	8e20887886	Update self-push.yml (#13364 )	2021-09-01 03:37:51 -04:00
Stella Biderman	c02cd95c56	GPT-J-6B (#13022 ) * Test GPTJ implementation * Fixed conflicts * Update __init__.py * Update __init__.py * change GPT_J to GPTJ * fix missing imports and typos * use einops for now (need to change to torch ops later) * Use torch ops instead of einsum * remove einops deps * Update configuration_auto.py * Added GPT J * Update gptj.rst * Update __init__.py * Update test_modeling_gptj.py * Added GPT J * Changed configs to match GPT2 instead of GPT Neo * Removed non-existent sequence model * Update configuration_auto.py * Update configuration_auto.py * Update configuration_auto.py * Update modeling_gptj.py * Update modeling_gptj.py * Progress on updating configs to agree with GPT2 * Update modeling_gptj.py * num_layers -> n_layer * layer_norm_eps -> layer_norm_epsilon * attention_layers -> num_hidden_layers * Update modeling_gptj.py * attention_pdrop -> attn_pdrop * hidden_act -> activation_function * Update configuration_gptj.py * Update configuration_gptj.py * Update configuration_gptj.py * Update configuration_gptj.py * Update configuration_gptj.py * Update modeling_gptj.py * Update modeling_gptj.py * Update modeling_gptj.py * Update modeling_gptj.py * Update modeling_gptj.py * Update modeling_gptj.py * fix layernorm and lm_head size delete attn_type * Update docs/source/model_doc/gptj.rst Co-authored-by: Suraj Patil <surajp815@gmail.com> * removed claim that GPT J uses local attention * Removed GPTJForSequenceClassification * Update src/transformers/models/gptj/configuration_gptj.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Removed unsupported boilerplate * Update tests/test_modeling_gptj.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update tests/test_modeling_gptj.py Co-authored-by: Eric Hallahan <eric@hallahans.name> * Update tests/test_modeling_gptj.py Co-authored-by: Eric Hallahan <eric@hallahans.name> * Update tests/test_modeling_gptj.py Co-authored-by: Eric Hallahan <eric@hallahans.name> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * Update __init__.py * Update configuration_gptj.py * Update modeling_gptj.py * Corrected indentation * Remove stray backslash * Delete .DS_Store * Delete .DS_Store * Delete .DS_Store * Delete .DS_Store * Delete .DS_Store * Update docs to match * Remove tf loading * Remove config.jax * Remove stray `else:` statement * Remove references to `load_tf_weights_in_gptj` * Adapt tests to match output from GPT-J 6B * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * Default `activation_function` to `gelu_new` - Specify the approximate formulation of GELU to ensure parity with the default setting of `jax.nn.gelu()` * Fix part of the config documentation * Revert "Update configuration_auto.py" This reverts commit `e9860e9c04`. * Revert "Update configuration_auto.py" This reverts commit `cfaaae4c4d`. * Revert "Update configuration_auto.py" This reverts commit `687788954f`. * Revert "Update configuration_auto.py" This reverts commit `194d024ea8`. * Hyphenate GPT-J * Undid sorting of the models alphabetically * Reverting previous commit * fix style and quality issues * Update docs/source/model_doc/gptj.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_modeling_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/configuration_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/configuration_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/configuration_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Replaced GPTJ-specific code with generic code * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Made the code always use rotary positional encodings * Update index.rst * Fix documentation * Combine attention classes - Condense all attention operations into `GPTJAttention` - Replicate GPT-2 and improve code clarity by renaming `GPTJAttention.attn_pdrop` and `GPTJAttention.resid_pdrop` to `GPTJAttention.attn_dropout` and `GPTJAttention.resid_dropout` * Removed `config.rotary_dim` from tests * Update test_modeling_gptj.py * Update test_modeling_gptj.py * Fix formatting * Removed depreciated argument `layer_id` to `GPTJAttention` * Update modeling_gptj.py * Update modeling_gptj.py * Fix code quality * Restore model functionality * Save `lm_head.weight` in checkpoints * Fix crashes when loading with reduced precision * refactor self._attn(...)` and rename layer weights" * make sure logits are in fp32 for sampling * improve docs * Add `GPTJForCausalLM` to `TextGenerationPipeline` whitelist * Added GPT-J to the README * Fix doc/readme consistency * Add rough parallelization support - Remove unused imports and variables - Clean up docstrings - Port experimental parallelization code from GPT-2 into GPT-J * Clean up loose ends * Fix index.rst Co-authored-by: kurumuz <kurumuz1@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Eric Hallahan <eric@hallahans.name> Co-authored-by: Leo Gao <54557097+leogao2@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: your_github_username <your_github_email> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-08-31 17:53:02 +02:00
Lysandre	e53af030c0	Re-deploy documentation	2021-08-31 16:18:14 +02:00

1 2 3 4 5 ...

7895 Commits