transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-03 03:31:05 +06:00

Author	SHA1	Message	Date
Ryokan RI	4d1ce39683	Debug LukeForMaskedLM (#17499 ) * add a test for a word only input * make LukeForMaskedLM work without entity inputs * update test * add LukeForMaskedLM to MODEL_FOR_MASKED_LM_MAPPING_NAMES * restore pyproject.toml * empty line at the end of pyproject.toml	2022-06-01 10:03:06 -04:00
Sylvain Gugger	4390151ba2	Fix MP and CPU offload tests for Funnel and GPT-Neo (#17503 )	2022-06-01 09:59:40 -04:00
Sylvain Gugger	6813439fdc	Exclude Databricks from notebook env (#17496 )	2022-06-01 09:00:11 -04:00
Will Frey	3042ea4f6f	Fix `tokenizer` type annotation in `pipeline(...)` (#17500 ) I think you mean to accept either an instance of `PreTrainedTokenizer` or `PreTrainedTokenizerFast` inside of the `pipeline(...)` factory function, if the `tokenizer` argument isn't a `str`.	2022-06-01 08:43:28 -04:00
amyeroberts	bdc01711d6	Refactor classes to inherit from nn.Module instead of nn.Sequential (#17493 ) * Adapt Maskformer, VAN, ResNet and RegNet modules to inherit from nn.Module	2022-06-01 13:36:19 +01:00
nilboy	b1160c0b56	Fix wav2vec2 export onnx model with attention_mask error (#16004 ) * Fix wav2vec2 export onnx model with attention_mask error * fix repository_consistency	2022-06-01 13:30:58 +02:00
Xing Han Lu	d91da4c6df	Add warning when using older version of torch for ViltFeatureExtractor (#16756 ) * Update feature_extraction_vilt.py * apply black * Update imports * Change warning to logging * Use logger instead of logging.logging * make fixup * Move error message * Update src/transformers/models/vilt/feature_extraction_vilt.py Co-authored-by: Xing Han Lu <xhlperso@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2022-06-01 07:15:38 -04:00
Kyeongpil Kang	24092b1464	Fix typo of variable names for key and query projection layer (#17155 ) self.pos_proj and self.pos_q_proj should be changed to self.pos_key_proj and self.pos_query_proj as same as PyTorch implements.	2022-06-01 11:38:44 +01:00
Jimin Park	811da2b8c2	Fixed wrong error message for missing weight file (#17216 )	2022-06-01 06:24:20 -04:00
Ruihua Fang	4f38808e9e	Add OnnxConfig for SqueezeBert iss17314 (#17315 ) * add onnx config for SqueezeBert * add test for onnx config for SqueezeBert * add automatically updated doc for onnx config for SqueezeBert * Update src/transformers/onnx/features.py Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * Update src/transformers/models/squeezebert/configuration_squeezebert.py Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>	2022-06-01 06:16:15 -04:00
Patrick von Platen	ba286fe7d5	[GPT2Tokenizer] Fix GPT2 with bos token (#17498 )	2022-05-31 20:06:48 +02:00
Arthur	7822a9b7a7	Opt in flax and tf (#17388 ) * initial commit * add init file * update globakl init * update index and dummy objects * style * update modelling auto * fix initi typo in src/transformers * fix typo in modeling tf auto, opt was in wrong mapping name * fixed a slow test : saved_model * style * fix positionnal embedding if no position id is provided * update tf test * update test flax requirements * fixed serialization * update * update tf name to allow smooth convertion * update flax tests * style * fix test typo * fix tf typo test * add xla for generate support in causal LM * fixed bug * cleaned tf tests * style * removed from PT for slow tests * fix typp * opt test as slow * trying to fix GPT2 undefined * correct documentation and add to test doc * update tf doc * fix doc * fake commit * Apply suggestions from code review Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * update test based on review * merged main layer for functionning test * fixup + quality * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * update long comment * make fix copies Co-authored-by: Arthur <arthur@huggingface.co> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-05-31 18:41:22 +02:00
Patrick von Platen	f394a2a50d	[Json configs] Make json prettier for all saved tokenizer files & ensure same json format for all processors (tok + feat_extract) (#17457 ) * [Json dump] Make json prettier * correct more tokenizeirs * more patterns * add aggressive test * the aggressive test was actually useful :-) * more tests * Apply suggestions from code review	2022-05-31 17:07:30 +02:00
Vít Novotný	6ee1474b67	Accumulate tokens into batches in `PreTrainedTokenizerBase.add_tokens()` (#17119 ) * Accumulate tokens into batches in PreTrainedTokenizerBase.add_tokens() For tokenizers with a small number of special tokens or special tokens with consecutive token IDs, this reduces the time complexity of creating the trie from quadratic to linear, see also #16936. * Extend explanation of batching added tokens	2022-05-31 16:36:45 +02:00
Patrick von Platen	52e7c92920	Add HF.co for PRs / Issues regarding specific model checkpoints (#17485 ) * Add HF.co for PRs / Issues regarding specific model checkpoints * Update .github/ISSUE_TEMPLATE/config.yml Co-authored-by: Julien Chaumond <julien@huggingface.co> Co-authored-by: Julien Chaumond <julien@huggingface.co>	2022-05-31 15:58:39 +02:00
Martina Fumanelli	dfc38463b8	Setup for Italian translation and add quicktour.mdx translation (#17472 ) * Setup for Italian translation and add first document - Add 'it' folder for files translated into Italian - Add _config.py and _toctree.yml files - Add translation of quicktour.mdx * Fix style issue of italian documentation files * Add 'it' to the languages section in the .github/workflows * Remove - installation from _toctree for Italian * Translation for index file - Add index to _toctree.yml - Add translation of index.mdx * Fix typo in docs/source/it/index.mdx * Translate code comments in docs/source/it/_config.py Co-authored-by: Martina Fumanelli <martinafumanelli@Martinas-MBP.homenet.telecomitalia.it>	2022-05-31 09:57:43 -04:00
Yih-Dar	8f8b3cbce4	Fix checkpoint name (#17484 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-31 15:40:48 +02:00
Yih-Dar	400b30936a	Docker image build in parallel (#17434 ) * docker image build in parallel Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-31 15:39:03 +02:00
Ritik Nandwal	5af38953bb	Added XLM onnx config (#17030 ) * Add onnx configuration for xlm * Add supported features for xlm * Add xlm to models exportable with onnx * Add xlm architecture to test file * Modify docs * Make code quality fixes	2022-05-31 09:26:06 -04:00
Sylvain Gugger	567d9c061d	Disk offload fix (#17428 ) * Fix offload to disk for big models * Add test * Fix test for other models	2022-05-31 09:16:18 -04:00
Joao Gante	975dd2bbbc	TF: GPT-2 generation supports left-padding (#17426 ) * TF GPT-2 now properly works with left padding * throw a warning when eos token == pad token and there is no attention mask	2022-05-31 14:06:44 +01:00
Yih-Dar	c1a138613d	Fix ViTMAEModelTester (#17470 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-31 15:01:54 +02:00
Patrick von Platen	b0e0ac8a67	[Generate] Fix output scores greedy search (#17442 )	2022-05-31 14:59:49 +02:00
Omar U. Espejel	2ef09ecfb8	Fix nits (#17349 )	2022-05-31 08:41:54 -04:00
Michael Benayoun	28d0048218	Fx support for multiple model architectures (#17393 ) * Support for Bart and LayoutLM, and partial support for XLNet * Support for mbart * A lot of new models supported * Support for other models * LayoutLM fix * Use strings instead of classes	2022-05-31 10:02:55 +02:00
Ivan Gonzalez	04681c1d81	typo IBERT in __repr__ quant_mode (#17398 ) fix #17397	2022-05-31 03:48:10 -04:00
Michele Conti	13fd67346a	Fix typo (remove parenthesis) (#17415 )	2022-05-31 03:21:32 -04:00
Sourab Mangrulkar	d156898f3b	Improve notrainer examples (#17449 ) * improve no-trainer examples * Trigger CI * adding comment to clarify tracker init on main process * Trigger CI * Trigger CI * Trigger CI	2022-05-28 00:06:31 +05:30
Patrick von Platen	7999ec125f	[OPT] Fix bos token id default (#17441 )	2022-05-26 18:24:12 +02:00
Sylvain Gugger	98f6e1ee87	Fix model parallelism test (#17439 )	2022-05-26 09:57:12 -04:00
Sylvain Gugger	7535d92e71	Pin protobouf that breaks TensorBoard in PyTorch (#17440 )	2022-05-26 09:56:55 -04:00
Yhary Arias	2295bcaea8	Spanish translation of the file preprocessing.mdx (#16299 ) * Spanish translation of the file training.mdx * Settings - Spanish translation of the file training.mdx * Latest changes to the Spanish translation of the training.mdx file * Delete Hugging.mdx * Last changes to the training fil Espanish version * Latest modifications * Latest changes, document ready for PR * Nits * Spanish translation of the preprocessing file * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Update docs/source_es/preprocessing.mdx * Nits and add preprocessing to _toctree.yml Co-authored-by: Yhary Arias <yharystefa@gmail.com> Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>	2022-05-26 07:28:14 -04:00
Juanjo do Olmo	8f46ac9849	Spanish translation of the files sagemaker.mdx and image_classification.mdx (#17262 ) * Duplication of the source eng file * Spanish translation of the file multilingual.mdx * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Fix nits and finish translation * Spanish translation of sagemaker.mdx * Was deleted in main * Security saving * Complete translation of image_classification.mdx * Nits * nits * Update docs/source/es/image_classification.mdx * Add files to _toctree.yml * Fix toctree and add tasks folder Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>	2022-05-25 19:10:16 -04:00
Joaq	5e7f085fcc	Added es version of bertology.mdx doc (#17255 ) * added bertology es doc * toctree fix * Update docs/source/es/bertology.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source/es/bertology.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source/es/bertology.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * change position of bertology in _toctree.yml Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>	2022-05-25 18:46:53 -04:00
Jonatas Grosman	70484a8d74	Adding the Portuguese version of the tasks/sequence_classification.mdx documentation (#17352 ) * add sequence_classification pt doc structure * add Portuguese tasks/sequence_classification.mdx	2022-05-25 16:21:27 -04:00
Patrick von Platen	a9eca74372	Wav2vec2 finetuning shared file system (#17423 ) * fix_torch_device_generate_test * remove @ * [Fix shared file system] Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2022-05-25 22:04:43 +02:00
Leandro von Werra	740a1574f1	fix link in performance docs (#17419 )	2022-05-25 20:54:43 +02:00
lewtun	284fc6c0bb	Add link to Hub PR docs in model cards (#17421 )	2022-05-25 20:38:56 +02:00
Cookie_thief	35e2d13f3c	Upd AutoTokenizer.from_pretrained doc examples (#17416 )	2022-05-25 11:35:50 -04:00
Animesh Jain	897a8dd89f	Support compilation via Torchdynamo, AOT Autograd, NVFuser (#17308 ) * Support compilation via Torchdynamo, AOT Autograd, NVFuser * Address comments * Lint * Stas comments - missing quality test * Lintere * Quality test * Doc lint * Reset CUDA peak mem * Add CustomTrainer * require a single gpu Co-authored-by: Stas Bekman <stas@stason.org>	2022-05-25 11:16:09 -04:00
Sylvain Gugger	31484afbed	Add test for new model parallelism features (#17401 )	2022-05-25 10:51:27 -04:00
Sylvain Gugger	56b35ce3eb	Make check_init script more robust and clean inits (#17408 )	2022-05-25 07:23:56 -04:00
Sylvain Gugger	bd908e9bb1	Fix README localizer script (#17407 )	2022-05-25 07:23:40 -04:00
Yih-Dar	4d727bd2df	Fix expected value for OPT test `test_inference_no_head` (#17395 ) * Fix expected value * 5e-5 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-25 11:19:06 +02:00
dependabot[bot]	1ef9a1ed4a	Bump tensorflow in /examples/research_projects/decision_transformer (#17400 ) Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.8.0 to 2.8.1. - [Release notes](https://github.com/tensorflow/tensorflow/releases) - [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/tensorflow/compare/v2.8.0...v2.8.1) --- updated-dependencies: - dependency-name: tensorflow dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-05-24 19:36:55 -04:00
Jason Phang	71e602725b	[WIP] Adding GPT-NeoX-20B (#16659 ) * initial * first try * working 20B * 20B tokenizers * Docs * Import fixes for missing classes * Update docs, fixup * black formatting * isort * flake * dummy objects * documentation * Documentation yml * more docs * tweaks for tests * tokenization auto * fix neox tests * test * test * einsum * address PR feedback * Documentation * Update README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gpt_neox/__init__.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/gpt_neox/configuration_gpt_neox.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Remove undefined LaTeX syntax * Update to full url to avoid confusion about if that's supposed to refer to the Hub * fix auto * move tests * documentation fix * more doc fixes * test refactor * fix import * fix import * fix import * fix import * fix import * style fixes * More modeling fixes Co-authored-by: Jason Phang <zp489@gr057.hpc.nyu.edu> Co-authored-by: Stella Biderman <stellabiderman@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-05-24 09:31:10 -04:00
NielsRogge	374a2f693f	Clean up CLIP tests (#17380 ) Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-05-24 14:51:26 +02:00
Nicolas Patry	d980929803	Enabling `imageGPT` auto feature extractor. (#16871 ) * Enablign `imageGPT` auto feature extractor. Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> * Small updates. * Update after rebase to use `input_ids` instead of `pixel_values`. Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-24 12:30:46 +02:00
NielsRogge	31ee80d556	Add LayoutLMv3 (#17060 ) * Make forward pass work * More improvements * Remove unused imports * Remove timm dependency * Improve loss calculation of token classifier * Fix most tests * Add docs * Add model integration test * Make all tests pass * Add LayoutLMv3FeatureExtractor * Improve integration test + make fixup * Add example script * Fix style * Add LayoutLMv3Processor * Fix style * Add option to add visual labels * Make more tokenizer tests pass * Fix more tests * Make more tests pass * Fix bug and improve docs * Fix import of processors * Improve docstrings * Fix toctree and improve docs * Fix auto tokenizer * Move tests to model folder * Move tests to model folder * change default behavior add_prefix_space * add prefix space for fast * add_prefix_spcae set to True for Fast * no space before `unique_no_split` token * add test to hightligh special treatment of added tokens * fix `test_batch_encode_dynamic_overflowing` by building a long enough example * fix `test_full_tokenizer` with add_prefix_token * Fix tokenizer integration test * Make the code more readable * Add tests for LayoutLMv3Processor * Fix style * Add model to README and update init * Apply suggestions from code review * Replace asserts by value errors * Add suggestion by @ducviet00 * Add model to doc tests * Simplify script * Improve README * a step ahead to fix * Update pair_input_test * Make all tokenizer tests pass - phew * Make style * Add LayoutLMv3 to CI job * Fix auto mapping * Fix CI job name * Make all processor tests pass * Make tests of LayoutLMv2 and LayoutXLM consistent * Add copied from statements to fast tokenizer * Add copied from statements to slow tokenizer * Remove add_visual_labels attribute * Fix tests * Add link to notebooks * Improve docs of LayoutLMv3Processor * Fix reference to section Co-authored-by: SaulLu <lucilesaul.com@gmail.com> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-05-24 09:53:45 +02:00
Sylvain Gugger	13541b4aa2	Add support for `device_map="auto"` to OPT (#17382 )	2022-05-23 15:25:51 -04:00

1 2 3 4 5 ...

9904 Commits