transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 10:12:23 +06:00

Author	SHA1	Message	Date
Steven Liu	0a75717602	Fix task guide formatting (#21409 ) fix formatting	2023-02-02 10:06:26 -08:00
Yih-Dar	a6d8a149a8	Fix some pipeline tests (#21401 ) * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-02 19:03:31 +01:00
Yih-Dar	145bf41c13	Allow to add more information in `is_flaky` (#21426 ) * Allow to add more information * fix style --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-02 17:41:22 +01:00
Younes Belkada	8298e4ec02	[`bnb`] Fine-tuning HF 8-bit models (#21290 ) * force `memory_efficient_backward=True` * enhancements - trainer support - add new flag * some changes - internal changes in `Trainer` - small refactor * make quality * Fixes - add new testing util - add new test - change test in Trainer * fix CI test * educate users on how to ft 8bit models * more checks * fix `logger` error * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * adapt from review * fix * add comment * use return instead --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-02 16:39:23 +01:00
Clémentine Fourrier	67a3920d85	Fix Graphormer test suite (#21419 ) * [FIX] path for Graphormer checkpoint * [FIX] Test suite for graphormer * [FIX] Update graphormer default num_classes	2023-02-02 16:29:13 +01:00
Joel Lamy-Poirier	e006ab51ac	Add the GeLU activation from pytorch with the tanh approximation (#21345 ) * gelu_python_tanh * rename * Version check, add test * Pr comment	2023-02-02 09:33:04 -05:00
Matt	53d374f1b9	Add distinct section names for PyTorch and TF (#21422 ) * Add distinct section names for PyTorch and TF * Remove extra space	2023-02-02 14:29:58 +00:00
Shikhar Tuli	0ae8dc0adf	Fix image_processor_class bug (#21410 ) Co-authored-by: Shreshth Tuli <shreshthtuli@gmail.com>	2023-02-02 09:20:52 -05:00
Yih-Dar	db572b3854	Use torch `1.13.1` in push/schedule CI (#21421 ) Use torch 1.13.1 in push/scheduled CI Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-02-02 14:58:52 +01:00
Joao Gante	92ce53aab8	Generate: decoder-only models can generate with `inputs_embeds` (#21405 )	2023-02-01 21:50:38 +00:00
amyeroberts	e5db7051a8	Add TF image classification example script (#19956 ) * TF image classification script * Update requirements * Fix up * Add tests * Update test fetcher Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fix directory path * Adding `zero-shot-object-detection` pipeline doctest. (#20274) * Adding `zero-shot-object-detection` pipeline doctest. * Remove nested_simplify. * Add generate kwargs to `AutomaticSpeechRecognitionPipeline` (#20952) * Add generate kwargs to AutomaticSpeechRecognitionPipeline * Add test for generation kwargs * Trigger CI * Data collator returns np * Update feature extractor -> image processor * Bug fixes - updates to reflect changes in API * Update flags to match PT & run faster * Update instructions - Maria's comment * Update examples/tensorflow/image-classification/README.md * Remove slow decorator --------- Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: bofeng huang <bofenghuang7@gmail.com> Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>	2023-02-01 19:09:36 +00:00
Jinen Setpal	3fadb4b211	Added DagshubCallback (#21404 ) * integrated logger * bugifx * added data * bugfix * model + state artifacts should log * fixed paths * i lied, trying again * updated function call * typo this is painful :( what a stupid error * typo this is painful :( what a stupid error * pivoted to adding a directory * silly path bug * multiple experiments * migrated to getattr * syntax fix * syntax fix * fixed repo pointer * fixed path error * added dataset if dataloader is present, uploaded artifacts * variable in scope * removed unnecessary line * updated error type Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * trimmed unused variables, imports * style formatting * removed type conversion reliance Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * reverted accidental line deletion --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-01 13:51:46 -05:00
Sylvain Gugger	8d580779a3	Skip batches fast with accelerate (#21390 ) * Skip batches fast with Accelerate * remove debug statement * Hack seed reload at the right time * Reorganize RNG sync * Fix accelerate version comp	2023-02-01 10:22:05 -05:00
raghavanone	77db257e2a	Fix the issue of using only inputs_embeds in convbert model (#21398 ) * Fix the input embeds issue with tests * Fix black and isort issue * Clean up tests * Add slow tag to the test introduced * Incorporate PR feedbacks	2023-02-01 09:47:25 -05:00
Maria Khalusova	65b5035a1d	Moved LiLT under multimodal models in TOC (#21393 ) moved LiLT under multimodal models	2023-02-01 08:03:00 -05:00
Patrick von Platen	90cddfa824	Add variant to transformers (#21332 ) * Bump onnx in /examples/research_projects/decision_transformer Bumps [onnx](https://github.com/onnx/onnx) from 1.11.0 to 1.13.0. - [Release notes](https://github.com/onnx/onnx/releases) - [Changelog](https://github.com/onnx/onnx/blob/main/docs/Changelog.md) - [Commits](https://github.com/onnx/onnx/compare/v1.11.0...v1.13.0) --- updated-dependencies: - dependency-name: onnx dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> * adapt * finish * Update examples/research_projects/decision_transformer/requirements.txt * up * add tests * Apply suggestions from code review Co-authored-by: Lucain <lucainp@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * fix test --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Lucain <lucainp@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-02-01 09:21:52 +01:00
Yih-Dar	bc44e947f3	Update `Graphormer` and fix its `torchscript` test failures (#21380 ) * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-31 17:32:25 +01:00
Joao Gante	19d67bfecb	Generate: fix TF XLA tests on models with `max_position_embeddings` or `max_target_positions` (#21389 )	2023-01-31 15:49:34 +00:00
Yih-Dar	6342427353	Remove more unused attributes in config classes (#21327 ) * remove unused classifier_dropout * remove unused dropout * remove unused pooler_fn * remove unnecessary is_encoder_decoder * remove unnecessary drop_rate * remove unused classifier_dropout * remove unused classifier_dropout * remove unused dropout * remove unused dropout * remove unused summary_* attributes * remove unused tie_word_embeddings * remove unused summary_* attributes * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-31 16:35:38 +01:00
raghavanone	da2a4d95a2	Add support of backward_prefetch and forward_prefetch (#21237 ) * Add support of backward_prefetch and forward_prefetch * Fix format issue * Fix isort issue * Fix doc style issue * Update src/transformers/trainer.py Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com> * Update src/transformers/training_args.py Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com> * Update src/transformers/training_args.py Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com> * Update src/transformers/training_args.py Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com> * Fix black issue * Fix doc-style issue * Make additional fsdp parameters into fsdp config * Fix black issue * Remove unused imports * Fix doc style issues * Incorporate PR feedbacks * Remove unused imports * Fix tests * Fix tests * Fix tests * Fix tests * Fix tests * Update src/transformers/training_args.py Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com> * Fix tests * Incorporate PR feedbacks * Incorporate PR feedbacks * Fix black issues --------- Co-authored-by: Sourab Mangrulkar <13534540+pacman100@users.noreply.github.com>	2023-01-31 09:51:35 -05:00
Quentin Lhoest	074d6b75fd	Simplify column_names in run_clm/mlm (#21382 ) * simplify column_names in run_clm * simplify column_names in run_mlm * minor	2023-01-31 15:23:47 +01:00
NielsRogge	c21298a69b	[Docs] Minor fixes (#21383 ) * Improve docs * Add DETA resources --------- Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2023-01-31 15:13:12 +01:00
regisss	d31497b196	Do not log the generation config for each prediction step in TrainerSeq2Seq (#21385 ) Do not log the generation config for each iteration	2023-01-31 09:05:22 -05:00
Yih-Dar	98d40fed3a	Cleanup the usage of `layer_norm_eps` in some models (#21336 ) * fix * fix * make style * For CLIP * For OwlViT * For XCLIP * For CLIPSeg * For GroupViT * fix docstrings * fix docstrings * For AltCLIP * For ChineseCLIP * For Blip * For GiT * make style * update * update * update * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-31 13:54:16 +01:00
Joao Gante	623346ab18	Template for framework-agnostic tests (#21348 )	2023-01-31 11:33:18 +00:00
NielsRogge	5451f8896c	Add DETA (#20983 ) * First draft * Add initial draft of conversion script * Convert all weights * Fix config * Add image processor * Fix DetaImageProcessor * Run make fix copies * Remove timm dependency * Fix dummy objects * Improve loss function * Remove conv_encoder attribute * Update conversion scripts * Improve postprocessing + docs * Fix copied from statements * Add tests * Improve postprocessing * Improve postprocessing * Update READMEs * More improvements * Fix rebase * Add is_torchvision_available * Add torchvision dependency * Fix typo and README * Fix bug * Add copied from * Fix style * Apply suggestions * Fix thanks to @ydshieh * Fix another dependency check * Simplify image processor * Add scipy * Improve code * Add threshold argument * Fix bug * Set default threshold * Improve integration test * Add another integration test * Update setup.py * Address review * Improve deformable attention function * Improve copied from * Use relative imports * Address review * Replace assertions * Address review * Update dummies * Remove dummies * Address comments, update READMEs * Remove custom kernel code * Add image processor tests * Add requires_backends * Add minor comment * Update scripts * Update organization name * Fix defaults, add doc tests * Add id2label for object 365 * Fix tests * Update task guide	2023-01-31 10:43:10 +01:00
Stas Bekman	98d88b23f5	[`run_(clm\|mlm).py` examples] add streaming dataset support (#21343 ) * [run_clm example] add streaming dataset support * unrefactor kwargs * fix * fix * require datasets>=2.0.0 * port to mlm	2023-01-30 14:01:35 -08:00
BFSS	95be242adc	translate index to zh(#20095 ) (#21351 ) translate index to zh Co-authored-by: bfss <bfss@bfss.com>	2023-01-30 16:50:57 -05:00
Adit Krishnan	914e5009fa	Adding resource section to GPT-J docs (#21270 ) * Added resource section to GPT-J docs * Added most of the links found * Addressing review comments * Fixing formatting * Update docs/source/en/model_doc/gptj.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Fixing one of the labels --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-01-30 16:48:04 -05:00
Clémentine Fourrier	14d989a91d	Fixes path for Graphormer checkpoint (#21367 ) [FIX] path for Graphormer checkpoint	2023-01-30 21:48:04 +01:00
Joao Gante	42b60f8b02	Generate: Relaxed `max_length` and `max_new_tokens` coexistence (#21347 ) Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-01-30 17:53:54 +00:00
Sylvain Gugger	6eb3c66a96	Add cPython files in build (#21372 )	2023-01-30 11:19:30 -05:00
amyeroberts	59611a0f3a	Fix DETR tests after #21144 (#21365 ) * Fix annotation check * Fix annotation check * Update type annotations	2023-01-30 15:55:00 +00:00
Yichao 'Peak' Ji	7a2e13204f	Remove duplicate declarations in dummy inputs for TFLongformer (#21352 ) Remove duplicate declarations	2023-01-30 10:03:19 -05:00
简律纯	96addecff8	Corrected (#21350 )	2023-01-30 09:38:15 -05:00
Wang, Yi	f3a7befffa	fix the issue that the output dict of jit model could not get [0] (#21354 )	2023-01-30 09:23:55 -05:00
Yih-Dar	c749bd405e	Pipeline testing - using tiny models on Hub (#20426 ) * rework pipeline tests * run pipeline tests * fix * fix * fix * revert the changes in get_test_pipeline() parameter list * fix expected error message * skip a test * clean up --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-30 10:39:43 +01:00
Yih-Dar	a582cfce3c	Fix `GitModelIntegrationTest.test_batched_generation` device issue (#21362 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-30 10:37:56 +01:00
Maria Khalusova	73a2ff6974	Automated compatible models list for task guides (#21338 ) * initial commit. added tip placeholders and a script * removed unused imports, fixed paths * fixed generated links * make style * split language modeling doc into two: causal language modeling and masked language modeling * added check_task_guides.py to make fix-copies * review feedback addressed	2023-01-27 13:19:28 -05:00
Lucain	8f3b4a1d5b	Little cleanup: let huggingface_hub manage token retrieval (#21333 ) * Let huggingface_hub manage token retrieval * flake8 * code quality * adapt in every PushToHubMixin children * add explicit return type	2023-01-27 12:09:49 -05:00
Arthur	0dff407d71	[Whisper] another patch (#21324 ) * another patch * fix timestamp test modeling * let it be negative when the token is None	2023-01-27 16:35:16 +01:00
Yih-Dar	e5eb3e22ea	Fix `RobertaPreLayerNorm` doctest (#21337 ) * add mask="<mask>" * update * update * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-27 16:20:25 +01:00
dependabot[bot]	36b668fa06	Bump onnx from 1.11.0 to 1.13.0 in /examples/research_projects/decision_transformer (#21331 ) Bump onnx in /examples/research_projects/decision_transformer Bumps [onnx](https://github.com/onnx/onnx) from 1.11.0 to 1.13.0. - [Release notes](https://github.com/onnx/onnx/releases) - [Changelog](https://github.com/onnx/onnx/blob/main/docs/Changelog.md) - [Commits](https://github.com/onnx/onnx/compare/v1.11.0...v1.13.0) --- updated-dependencies: - dependency-name: onnx dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-01-27 10:13:13 -05:00
Michael Benayoun	938f437c53	Fix M2M100 positional embedding creation for ONNX (#21328 ) * Fix M2M100 positional embedding creation for ONNX * Restore READMEs * Trigger CI	2023-01-27 10:43:19 +01:00
altryne	7d2a5fa749	Update Hebrew language code to he per IANA registry (#21310 ) Here's my original PR into whisper that changes the same: https://github.com/openai/whisper/pull/401 Per [IANA registry](https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry), `iw` was deprecated as the code for Hebrew in 1989 and the preferred code is `he` The correct subtag: ``` %% Type: language Subtag: he Description: Hebrew Added: 2005-10-16 Suppress-Script: Hebr %% ``` And the deprecation ``` %% Type: language Subtag: iw Description: Hebrew Added: 2005-10-16 Deprecated: 1989-01-01 Preferred-Value: he Suppress-Script: Hebr %% ```	2023-01-26 13:34:39 -05:00
Younes Belkada	b225ee6ea0	[Doctest] Fix `Perceiver` doctest (#21318 ) fix `Perceiver` doctest	2023-01-26 17:16:37 +01:00
Joao Gante	2b8feffad5	Generate: better `compute_transition_scores` examples (#21323 )	2023-01-26 16:06:05 +00:00
Yih-Dar	449df41f01	Fix `TFEncoderDecoder` tests (#21301 ) remove max_length=None Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-26 16:56:42 +01:00
Yih-Dar	857bad6e53	check paths in `utils/documentation_tests.txt` (#21315 ) * check paths in utils/documentation_tests.txt * check paths in utils/documentation_tests.txt Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-26 15:33:47 +01:00
Nicolas Patry	fd0ef8b66d	Small QoL for qa. (#21316 )	2023-01-26 14:50:09 +01:00

1 2 3 4 5 ...

11929 Commits