transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-16 11:08:23 +06:00

Author	SHA1	Message	Date
Ahmed Moubtahij	39b2ff69d6	Token healing (#30081 ) * token healing impl + trie with extensions * make fixup * prefix-robust space tokenization * examples readme and requirements * make fixup * allow input prompt and model * redundant defaults * Specialized Trie * make fixup * updated tests with new inherited Tree * input ids to auto device_map * rm unused import * Update src/transformers/generation/utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * naming convention * Revert "naming convention" This reverts commit dd39d9c5b7a969e2d8a8d2a8e54f121b82dc44f0. * naming convention * last -hopefully- changes --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-06-03 10:53:15 +02:00
Arthur	96eb06286b	Diff converter v2 (#30868 ) * current working example! * commit regex and result file * update * nit * push the conversion file * oups * roadmap and nits * attempt diffs for 3 files * persimmon * nit * add diff file that is the same as the modeling_llama.py * fix rope nits * updates * updates with converted versions * give some breathing space to the code * delete * update * update * push the actual result * update regex patterns * update regex patterns * fix some issues * fix some issues * fix some issues * updates * updates * updates * updates * updates * revert changes done to llama * updates * update gemma * updates * oups * current state * current state * update * ouiiii * nit * clear diffs * nit * fixup * update * doc 🚀 * 🔥 * for now use gemma * deal with comments * style * handle funtions * deal with assigns * todos * process inheritage * keep decorators? * 🤗 * deal with duplicates * fixup * correctly remove duplicate code * run ruff post script * ruff deals pretty well with imports, let's leave it to him * ah maybe not lol * for now remove all imports from child. * nit * conversion of llama * okay * convert starcoder2 * synch with main * update llama diff * updates * https://docs.astral.sh/ruff/rules/redefined-while-unused/ fixes the imports, bit needs later version of ruff * updates * okay actual state * non zero exit * update! * revert unrelated * remove other diff files * updates * cleanup * update * less diff! * stash * current updates * updates * No need for call * finished fining deps * update * current changes * current state * current state * new status * nit * finally * fixes * nits * order is now expected * use logger info instead of prints * fixup * up * nit * update * nits * update * correct merge * update * update * update * add warning * update caution message * update * better merging strategy * copy class statements :wink * fixups * nits * update * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * nits * smaller header * do cleanup some stuff * even simpler header? * fixup * updates * ruff * update examples * nit * TODO * state * OUUUUUUF * current state * nits * final state * add a readme * fixup * remove diff llama * fix * nit * dummy noy funny * ruff format tests src utils --check * everless diffs * less diffs and fix test * fixes * naming nit? * update converter and add supper example * nits * updated for function signatures * update * update * add converted dummies * autoformat * single target assign fix * fixup * fix some imports * fixes * don't push them * `# noqa: F841` --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-05-31 18:37:43 +02:00
Pavel Iakubovskii	cdc813113a	Instance segmentation examples (#31084 ) * Initial setup * Metrics * Overfit on two batches * Train 40 epochs * Memory leak debugging * Trainer fine-tuning * Draft * Fixup * Trained end-to-end * Add requirements * Rewrite evaluator * nits * Add readme * Add instance-segmentation to the table * Support void masks * Remove sh * Update docs * Add pytorch test * Add accelerate test * Update examples/pytorch/instance-segmentation/README.md * Update examples/pytorch/instance-segmentation/run_instance_segmentation.py * Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py * Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py * Update examples/pytorch/instance-segmentation/run_instance_segmentation.py * Fix consistency oneformer * Fix imports * Fix imports sort * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update examples/pytorch/instance-segmentation/run_instance_segmentation.py Co-authored-by: Sangbum Daniel Choi <34004152+SangbumChoi@users.noreply.github.com> * Add resources to docs * Update examples/pytorch/instance-segmentation/README.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update examples/pytorch/instance-segmentation/README.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Remove explicit model_type argument * Fix tests * Update readme * Note about other models --------- Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Sangbum Daniel Choi <34004152+SangbumChoi@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-05-31 16:56:17 +01:00
dependabot[bot]	e05baad861	Bump requests from 2.31.0 to 2.32.2 in /examples/research_projects/visual_bert (#30983 ) Bump requests in /examples/research_projects/visual_bert Bumps [requests](https://github.com/psf/requests) from 2.31.0 to 2.32.2. - [Release notes](https://github.com/psf/requests/releases) - [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md) - [Commits](https://github.com/psf/requests/compare/v2.31.0...v2.32.2) --- updated-dependencies: - dependency-name: requests dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-23 12:38:00 +01:00
Pavel Iakubovskii	15585b81a5	Update object detection with latest resize and pad strategies (#30955 ) * Update with new resizing and pad strategy * Return pixel mask param * Update inference in guide * Fix empty compose * Update guide	2024-05-23 00:13:56 +01:00
dependabot[bot]	edb14eba64	Bump requests from 2.31.0 to 2.32.2 in /examples/research_projects/lxmert (#30956 ) --- updated-dependencies: - dependency-name: requests dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-22 11:27:41 +01:00
Arthur	673440d073	update ruff version (#30932 ) * update ruff version * fix research projects * Empty * Fix errors --------- Co-authored-by: Lysandre <lysandre@huggingface.co>	2024-05-22 06:40:15 +02:00
Lu Teng	5bf9caa06d	Fix inhomogeneous shape error in example (#30434 ) Fix inhomogeneous shape error in example.	2024-05-21 18:14:11 +01:00
dependabot[bot]	d502bd6475	Bump requests from 2.31.0 to 2.32.0 in /examples/research_projects/decision_transformer (#30925 ) --- updated-dependencies: - dependency-name: requests dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-21 09:41:29 +01:00
Arthur Zucker	b6eb708bf1	v4.42.dev.0	2024-05-17 17:30:41 +02:00
Sanchit Gandhi	57edd84bdb	[whisper] fix multilingual fine-tuning (#30865 ) * [whisper] fix multilingual fine-tuning * config ids as well	2024-05-17 15:12:44 +01:00
amyeroberts	57c965a8f1	Remove deprecated logic and warnings (#30743 ) * Remove deprecated logic and warnings * Add back some code that seems to be important... * Let's just add all he nllb stuff back; removing it is a bit more involved * Remove kwargs * Remove more kwargs	2024-05-17 12:15:59 +01:00
Aaron Jimenez	8ce4fefc52	[docs] Update link in es/pipeline_webserver.md (#30745 ) * update link * run make style	2024-05-10 09:29:26 -07:00
Pavel Iakubovskii	998dbe068b	Add examples for detection models finetuning (#30422 ) * Training script for object detection * Evaluation script for object detection * Training script for object detection with eval loop outside trainer * Trainer DETR finetuning * No trainer DETR finetuning * Eval script * Refine object detection example with trainer * Remove commented code and enable telemetry * No trainer example * Add requirements for object detection examples * Add test for trainer example * Readme draft * Fix uploading to HUB * Readme improvements * Update eval script * Adding tests for object-detection examples * Add object-detection example * Add object-detection resources to docs * Update README with custom dataset instructions * Update year * Replace valid with validation * Update instructions for custom dataset * Remove eval script * Remove use_auth_token * Add copied from and telemetry * Fixup * Update readme * Fix id2label * Fix links in docs * Update examples/pytorch/object-detection/run_object_detection.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update examples/pytorch/object-detection/run_object_detection.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Move description to the top * Fix Trainer example * Update no trainer example * Update albumentations version --------- Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>	2024-05-08 11:42:07 +01:00
dependabot[bot]	4a17200891	Bump tqdm from 4.48.2 to 4.66.3 in /examples/research_projects/lxmert (#30644 ) Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.48.2 to 4.66.3. - [Release notes](https://github.com/tqdm/tqdm/releases) - [Commits](https://github.com/tqdm/tqdm/compare/v4.48.2...v4.66.3) --- updated-dependencies: - dependency-name: tqdm dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-07 12:45:29 +01:00
dependabot[bot]	3733391c53	Bump tqdm from 4.48.2 to 4.66.3 in /examples/research_projects/visual_bert (#30645 ) Bump tqdm in /examples/research_projects/visual_bert Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.48.2 to 4.66.3. - [Release notes](https://github.com/tqdm/tqdm/releases) - [Commits](https://github.com/tqdm/tqdm/compare/v4.48.2...v4.66.3) --- updated-dependencies: - dependency-name: tqdm dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-07 11:57:30 +01:00
dependabot[bot]	4051d362cb	Bump tqdm from 4.63.0 to 4.66.3 in /examples/research_projects/decision_transformer (#30646 ) Bump tqdm in /examples/research_projects/decision_transformer Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.63.0 to 4.66.3. - [Release notes](https://github.com/tqdm/tqdm/releases) - [Commits](https://github.com/tqdm/tqdm/compare/v4.63.0...v4.66.3) --- updated-dependencies: - dependency-name: tqdm dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-07 11:57:10 +01:00
dependabot[bot]	ce47582d81	Bump werkzeug from 3.0.1 to 3.0.3 in /examples/research_projects/decision_transformer (#30679 ) Bump werkzeug in /examples/research_projects/decision_transformer Bumps [werkzeug](https://github.com/pallets/werkzeug) from 3.0.1 to 3.0.3. - [Release notes](https://github.com/pallets/werkzeug/releases) - [Changelog](https://github.com/pallets/werkzeug/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/werkzeug/compare/3.0.1...3.0.3) --- updated-dependencies: - dependency-name: werkzeug dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-07 09:39:35 +01:00
dependabot[bot]	a898fb95bd	Bump jinja2 from 3.1.3 to 3.1.4 in /examples/research_projects/decision_transformer (#30680 ) Bump jinja2 in /examples/research_projects/decision_transformer Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.3 to 3.1.4. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/jinja/compare/3.1.3...3.1.4) --- updated-dependencies: - dependency-name: jinja2 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-07 09:28:56 +01:00
Bai Li	12c5544dca	Fix memory leak with CTC training script on Chinese languages (#30358 ) * Fix memory leak with CTC training script on Chinese languages * Fix lint	2024-05-02 09:33:36 +01:00
dependabot[bot]	c681b58b06	Bump torch from 1.9.0+cpu to 1.13.1 in /examples/flax/vision (#21168 ) Bumps [torch](https://github.com/pytorch/pytorch) from 1.9.0+cpu to 1.13.1. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/master/RELEASE.md) - [Commits](https://github.com/pytorch/pytorch/commits/v1.13.1) --- updated-dependencies: - dependency-name: torch dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-05-01 20:14:57 +01:00
dependabot[bot]	3a36597a5f	Bump pillow from 10.0.1 to 10.2.0 in /examples/research_projects/decision_transformer (#28655 ) Bump pillow in /examples/research_projects/decision_transformer Bumps [pillow](https://github.com/python-pillow/Pillow) from 10.0.1 to 10.2.0. - [Release notes](https://github.com/python-pillow/Pillow/releases) - [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst) - [Commits](https://github.com/python-pillow/Pillow/compare/10.0.1...10.2.0) --- updated-dependencies: - dependency-name: pillow dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 19:58:34 +01:00
dependabot[bot]	4f3c7af489	Bump torch from 1.9.0+cpu to 1.13.1 in /examples/research_projects/jax-projects/hybrid_clip (#21167 ) Bump torch in /examples/research_projects/jax-projects/hybrid_clip Bumps [torch](https://github.com/pytorch/pytorch) from 1.9.0+cpu to 1.13.1. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/master/RELEASE.md) - [Commits](https://github.com/pytorch/pytorch/commits/v1.13.1) --- updated-dependencies: - dependency-name: torch dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 18:37:55 +01:00
dependabot[bot]	6f465d45d9	Bump torch from 1.11.0 to 1.13.1 in /examples/research_projects/decision_transformer (#21171 ) Bump torch in /examples/research_projects/decision_transformer Bumps [torch](https://github.com/pytorch/pytorch) from 1.11.0 to 1.13.1. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/master/RELEASE.md) - [Commits](https://github.com/pytorch/pytorch/compare/v1.11.0...v1.13.1) --- updated-dependencies: - dependency-name: torch dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 18:16:25 +01:00
dependabot[bot]	6e0cba3cec	Bump torch from 1.6.0 to 1.13.1 in /examples/research_projects/visual_bert (#21172 ) Bump torch in /examples/research_projects/visual_bert Bumps [torch](https://github.com/pytorch/pytorch) from 1.6.0 to 1.13.1. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/master/RELEASE.md) - [Commits](https://github.com/pytorch/pytorch/compare/v1.6.0...v1.13.1) --- updated-dependencies: - dependency-name: torch dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 16:40:54 +01:00
dependabot[bot]	ce66c0e989	Bump torch from 1.11.0 to 1.13.1 in /examples/research_projects/codeparrot (#21170 ) Bump torch in /examples/research_projects/codeparrot Bumps [torch](https://github.com/pytorch/pytorch) from 1.11.0 to 1.13.1. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/master/RELEASE.md) - [Commits](https://github.com/pytorch/pytorch/compare/v1.11.0...v1.13.1) --- updated-dependencies: - dependency-name: torch dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 16:40:19 +01:00
dependabot[bot]	7a29c577e8	Bump torch from 1.6.0 to 1.13.1 in /examples/research_projects/lxmert (#21174 ) Bumps [torch](https://github.com/pytorch/pytorch) from 1.6.0 to 1.13.1. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/master/RELEASE.md) - [Commits](https://github.com/pytorch/pytorch/compare/v1.6.0...v1.13.1) --- updated-dependencies: - dependency-name: torch dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 16:39:55 +01:00
dependabot[bot]	b33f01fe6b	Bump pyarrow from 1.0.1 to 15.0.0 in /examples/research_projects/lxmert (#30584 ) Bumps [pyarrow](https://github.com/apache/arrow) from 1.0.1 to 15.0.0. - [Commits](https://github.com/apache/arrow/compare/apache-arrow-1.0.1...go/v15.0.0) --- updated-dependencies: - dependency-name: pyarrow dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 16:38:07 +01:00
dependabot[bot]	0ec3003ae9	Bump pyarrow from 1.0.1 to 15.0.0 in /examples/research_projects/visual_bert (#30583 ) Bump pyarrow in /examples/research_projects/visual_bert Bumps [pyarrow](https://github.com/apache/arrow) from 1.0.1 to 15.0.0. - [Commits](https://github.com/apache/arrow/compare/apache-arrow-1.0.1...go/v15.0.0) --- updated-dependencies: - dependency-name: pyarrow dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 16:37:54 +01:00
dependabot[bot]	aefbdfe8cf	Bump pyarrow from 7.0.0 to 15.0.0 in /examples/research_projects/decision_transformer (#30582 ) Bump pyarrow in /examples/research_projects/decision_transformer Bumps [pyarrow](https://github.com/apache/arrow) from 7.0.0 to 15.0.0. - [Commits](https://github.com/apache/arrow/compare/go/v7.0.0...go/v15.0.0) --- updated-dependencies: - dependency-name: pyarrow dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 16:37:40 +01:00
dependabot[bot]	7164171212	Bump gitpython from 3.1.32 to 3.1.41 in /examples/research_projects/distillation (#30586 ) Bump gitpython in /examples/research_projects/distillation Bumps [gitpython](https://github.com/gitpython-developers/GitPython) from 3.1.32 to 3.1.41. - [Release notes](https://github.com/gitpython-developers/GitPython/releases) - [Changelog](https://github.com/gitpython-developers/GitPython/blob/main/CHANGES) - [Commits](https://github.com/gitpython-developers/GitPython/compare/3.1.32...3.1.41) --- updated-dependencies: - dependency-name: gitpython dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 16:36:57 +01:00
dependabot[bot]	ff8f624542	Bump grpcio from 1.44.0 to 1.53.2 in /examples/research_projects/decision_transformer (#30585 ) Bump grpcio in /examples/research_projects/decision_transformer Bumps [grpcio](https://github.com/grpc/grpc) from 1.44.0 to 1.53.2. - [Release notes](https://github.com/grpc/grpc/releases) - [Changelog](https://github.com/grpc/grpc/blob/master/doc/grpc_release_schedule.md) - [Commits](https://github.com/grpc/grpc/compare/v1.44.0...v1.53.2) --- updated-dependencies: - dependency-name: grpcio dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 16:35:52 +01:00
dependabot[bot]	b71f512823	Bump gitpython from 3.1.32 to 3.1.41 in /examples/research_projects/decision_transformer (#30587 ) Bump gitpython in /examples/research_projects/decision_transformer Bumps [gitpython](https://github.com/gitpython-developers/GitPython) from 3.1.32 to 3.1.41. - [Release notes](https://github.com/gitpython-developers/GitPython/releases) - [Changelog](https://github.com/gitpython-developers/GitPython/blob/main/CHANGES) - [Commits](https://github.com/gitpython-developers/GitPython/compare/3.1.32...3.1.41) --- updated-dependencies: - dependency-name: gitpython dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-01 16:30:24 +01:00
amyeroberts	bbaa8ceff6	Fix canonical model --model_type in examples (#30480 ) Fix --model_type in examples	2024-05-01 15:47:05 +01:00
Matt	1e05671d21	Fix QA example (#30580 ) * Handle cases when CLS token is absent * Use BOS token as a fallback	2024-05-01 08:43:02 +01:00
Anton Vlasjuk	9112520b15	Fix seq2seq collator padding (#30556 ) * fix seq2seq data collator to respect the given padding strategy further added tests for the seq2seq data collator in the style of the `data_collator_for_token_classification` (pt, tf, np) * formatting and change bool equals "==" to "is" * add missed return types in tests * update numpy test as it can handle unequal shapes, not like pt or tf	2024-04-30 18:32:30 +01:00
Sanchit Gandhi	38b53da38a	[examples] update whisper fine-tuning (#29938 ) * [examples] update whisper fine-tuning * deprecate forced/suppress tokens * item assignment * update readme * final fix	2024-04-26 17:06:03 +01:00
Arthur	661190b44d	[`research_project`] Most of the security issues come from this requirement.txt (#29977 ) update most of decision transformers research project	2024-04-24 17:56:45 +02:00
Zach Mueller	60d5f8f9f0	🚨🚨🚨Deprecate `evaluation_strategy` to `eval_strategy`🚨🚨🚨 (#30190 ) * Alias * Note alias * Tests and src * Rest * Clean * Change typing? * Fix tests * Deprecation versions	2024-04-18 12:49:43 -04:00
Lysandre	ce8e64fbe2	Dev version	2024-04-18 15:53:25 +02:00
Nicolas Patry	8e5f76f511	Upgrading to tokenizers 0.19.0 (#30289 ) * [DO NOT MERGE] Testing tokenizers 0.19.0rc0 * Accounting for the breaking change. * Ruff. * Upgrading to tokenizers `0.19` (new release with preprend_scheme fixed and new surface for BPE tiktoken bug).	2024-04-17 17:17:50 +02:00
Yih-Dar	05dab4e5ba	Fix test `ExamplesTests::test_run_translation` (#30281 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-04-17 11:46:33 +02:00
Sai-Suraj-27	b3595cf02b	fix: Replaced deprecated `typing.Text` with `str` (#30230 ) typing.Text is deprecated. Use str instead	2024-04-15 12:18:37 +01:00
JINO ROHIT	f010786218	Set pad_token in run_glue_no_trainer.py #28534 (#30234 )	2024-04-15 11:39:10 +01:00
Pavel Iakubovskii	56d001b26f	Fix and simplify semantic-segmentation example (#30145 ) * Remove unused augmentation * Fix pad_if_smaller() and remove unused augmentation * Add indentation * Fix requirements * Update dataset use instructions * Replace transforms with albumentations * Replace identity transform with None * Fixing formatting * Fixed comment place	2024-04-10 09:10:52 +01:00
NielsRogge	e9c23fa056	[Trainer] Undo #29896 (#30129 ) * Undo * Use tokenizer * Undo data collator	2024-04-09 12:55:42 +02:00
JINO ROHIT	f5658732d5	fixing issue 30034 - adding data format for run_ner.py (#30088 )	2024-04-08 12:49:59 +01:00
Haz Sameen Shahgir	5e673ed2dc	updated examples/pytorch/language-modeling scripts and requirements.txt to require datasets>=2.14.0 (#30120 ) updated requirements.txt and require_version() calls in examples/pytorch/language-modeling to require datasets>=2.14.0	2024-04-08 12:41:28 +01:00
NielsRogge	1ab7136488	[Trainer] Allow passing image processor (#29896 ) * Add image processor to trainer * Replace tokenizer=image_processor everywhere	2024-04-05 10:10:44 +02:00
Mario Šaško	fce52cefa7	Fix `remove_columns` in `text-classification` example (#29351 )	2024-04-02 19:15:27 +02:00
Jacky Lee	156d30da94	Add warning message for `run_qa.py` (#29867 ) * improve: error message for best model metric * update: raise warning instead of error	2024-03-30 17:02:31 +01:00
Lysandre Debut	39114c0383	Remove static pretrained maps from the library's internals (#29112 ) * [test_all] Remove static pretrained maps from the library's internals * Deprecate archive maps instead of removing them * Revert init changes * [test_all] Deprecate instead of removing * [test_all] PVT v2 support * [test_all] Tests should all pass * [test_all] Style * Address review comments * Update src/transformers/models/deprecated/_archive_maps.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/deprecated/_archive_maps.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * [test_all] trigger tests * [test_all] LLAVA * [test_all] Bad rebase --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-03-25 10:33:38 +01:00
Jacky Lee	ef6e371dba	Add support for `torch_dtype` in the run_mlm example (#29776 ) feat: add support for torch_dtype Co-authored-by: Jacky Lee <jackylee328@gmail.com>	2024-03-21 15:09:35 +00:00
Arthur Zucker	1248f09252	v4.40.0.dev.0	2024-03-20 23:31:47 +09:00
Quentin Lhoest	f02aea2737	Rename `glue` to `nyu-mll/glue` (#29679 ) * Update run_glue.py * Update run_glue.py * Update run_glue_no_trainer.py	2024-03-15 16:35:02 +01:00
guangwu	03847ef451	fix: typos (#29653 ) Signed-off-by: guoguangwu <guoguangwug@gmail.com>	2024-03-15 15:02:50 +00:00
Joao Gante	d47966536c	Examples: check `max_position_embeddings` in the translation example (#29600 ) check max_position_embeddings	2024-03-12 18:58:12 +00:00
Hilco van der Wilk	b6404866cd	Update legacy Repository usage in various example files (#29085 ) * Update legacy Repository usage in `examples/pytorch/text-classification/run_glue_no_trainer.py` Marked for deprecation here https://huggingface.co/docs/huggingface_hub/guides/upload#legacy-upload-files-with-git-lfs * Fix import order * Replace all example usage of deprecated Repository * Fix remaining repo call and rename args variable * Revert removing creation of gitignore files and don't change research examples	2024-03-12 13:20:49 +00:00
Matt	81ec8028f9	Stop passing None to compile() in TF examples (#29597 ) * Fix examples to stop passing None to compile(), rework example invocation for run_text_classification.py * Add Amy's fix	2024-03-12 12:22:29 +00:00
Yitong Huang	873d9bb3cc	Make torch xla available on GPU (#29334 ) * add USE_TORCH_XLA env * rename torch_tpu to torch_xla * better is_torch_xla_available; fix some fsdp and performance issues * fix format * fix bug when pjrt_device is cpu * fix bug * fix the deprecation handling --------- Co-authored-by: anw90 <ang868@gmail.com> Co-authored-by: wangang.wa <wangang.wa@alibaba-inc.com>	2024-03-11 14:07:16 +00:00
Tanay Mehta	6d67837f06	Add Fill-in-the-middle training objective example - PyTorch (#27464 ) * add: initial script to train clm fim * fix: if training model from scratch, new tokens will be added and embeddings resized * fix: fixed attention_mask errors when generating FIM data * fix: file formatted using black * add: run_fim_no_trainer.py and fixed some comments in run_fim.py * add: added fim examples to the README.md and ran code fixup * fix: little bug in both fim training scripts * fix: remove comment from notebook and added a note on fim related params * fix: minor typo in README * add: suggested minor changes to README and run_fim.py * add: gradient_accumulation_steps and gradient_checkpointing args * add: improved model embedding resizing * add: pad_to_multiple_of and attn_implementation params * add: requested minor changes * add: deepspeed zero compatibility * add: resize embeddings layer with zero3 support for fim model initialization	2024-03-11 12:14:02 +00:00
Arthur Zucker	1a77f07f65	v4.39.dev.0	2024-02-21 15:23:22 +09:00
Arthur	b2724d7b4c	change version (#29097 ) * change version * nuke * this doesn't make sense * update some requirements.py * revert + no main * nits * change cache number * more pin * revert --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-02-19 22:08:44 +08:00
Jay Zhou	79132d4cfe	Fix a typo in `examples/pytorch/text-classification/run_classification.py` (#29072 )	2024-02-19 13:01:15 +00:00
Lysandre Debut	f497f564bb	Update all references to canonical models (#29001 ) * Script & Manual edition * Update	2024-02-16 08:16:58 +01:00
Klaus Hipp	fe3df9d5b3	[Docs] Add language identifiers to fenced code blocks (#28955 ) Add language identifiers to code blocks	2024-02-12 10:48:31 -08:00
Alexey Fadeev	792819f6cf	Updated requirements for image-classification samples: datasets>=2.14.0 (#28974 ) Updated datasets requirements. Need a package version >= 2.14.0	2024-02-12 14:57:25 +00:00
Yih-Dar	308d2b9004	Update the cache number (#28905 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-02-07 16:37:09 +01:00
Klaus Hipp	1c31b7aa3b	[Docs] Add missing language options and fix broken links (#28852 ) * Add missing entries to the language selector * Add links to the Colab and AWS Studio notebooks for ONNX * Use anchor links in CONTRIBUTING.md * Fix broken hyperlinks due to spaces * Fix links to OpenAI research articles * Remove confusing footnote symbols from author names, as they are also considered invalid markup	2024-02-06 12:01:01 -08:00
dependabot[bot]	e83227d76e	Bump cryptography from 41.0.2 to 42.0.0 in /examples/research_projects/decision_transformer (#28879 ) Bump cryptography in /examples/research_projects/decision_transformer Bumps [cryptography](https://github.com/pyca/cryptography) from 41.0.2 to 42.0.0. - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pyca/cryptography/compare/41.0.2...42.0.0) --- updated-dependencies: - dependency-name: cryptography dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-02-06 03:53:08 +01:00
dependabot[bot]	ca8944c4e3	Bump dash from 2.3.0 to 2.15.0 in /examples/research_projects/decision_transformer (#28845 ) Bump dash in /examples/research_projects/decision_transformer Bumps [dash](https://github.com/plotly/dash) from 2.3.0 to 2.15.0. - [Release notes](https://github.com/plotly/dash/releases) - [Changelog](https://github.com/plotly/dash/blob/dev/CHANGELOG.md) - [Commits](https://github.com/plotly/dash/compare/v2.3.0...v2.15.0) --- updated-dependencies: - dependency-name: dash dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-02-05 03:12:30 +01:00
Klaus Hipp	721ee783ca	[Docs] Fix spelling and grammar mistakes (#28825 ) * Fix typos and grammar mistakes in docs and examples * Fix typos in docstrings and comments * Fix spelling of `tokenizer` in model tests * Remove erroneous spaces in decorators * Remove extra spaces in Markdown link texts	2024-02-02 08:45:00 +01:00
zspo	d98591a12b	[docs] fix some bugs about parameter description (#28806 ) Co-authored-by: p_spozzhang <p_spozzhang@tencent.com>	2024-02-01 16:59:29 +00:00
Matt	74c9cfeaa7	Pin Torch to <2.2.0 (#28785 ) * Pin torch to <2.2.0 * Pin torchvision and torchaudio as well * Playing around with versions to see if this helps * twiddle something to restart the CI * twiddle it back * Try changing the natten version * make fixup * Revert "Try changing the natten version" This reverts commit `de0d6592c3`. * make fixup * fix fix fix * fix fix fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-01-30 23:01:12 +01:00
Matt	415e9a0980	Add tf_keras imports to prepare for Keras 3 (#28588 ) * Port core files + ESM (because ESM code is odd) * Search-replace in modelling code * Fix up transfo_xl as well * Fix other core files + tests (still need to add correct import to tests) * Fix cookiecutter * make fixup, fix imports in some more core files * Auto-add imports to tests * Cleanup, add imports to sagemaker tests * Use correct exception for importing tf_keras * Fixes in modeling_tf_utils * make fixup * Correct version parsing code * Ensure the pipeline tests correctly revert to float32 after each test * Ensure the pipeline tests correctly revert to float32 after each test * More tf.keras -> keras * Add dtype cast * Better imports of tf_keras * Add a cast for tf.assign, just in case * Fix callback imports	2024-01-30 17:26:36 +00:00
Klaus Hipp	39fa400969	Fix input data file extension in examples (#28741 )	2024-01-29 10:06:31 +00:00
Steven Liu	abe0289e6d	[docs] Fix datasets in guides (#28715 ) * change datasets * fix	2024-01-26 09:29:07 -08:00
bofeng huang	deb2b59073	Fix lr_scheduler in no_trainer training scripts (#27872 ) * Fix lr_scheduler * Fix lr scheduler	2024-01-22 14:22:18 +00:00
jheitmann	f0acf7b6d8	Fix id2label assignment in run_classification.py (#28590 )	2024-01-22 11:31:31 +00:00
Amy Roberts	b2748a6efd	v4.38.dev.0	2024-01-19 10:43:28 +00:00
Yoach Lacombe	772307be76	Making CTC training example more general (#28582 ) * add w2v2bert compatibility * Update examples/pytorch/speech-recognition/run_speech_recognition_ctc.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-01-18 17:01:49 +00:00
Yoach Lacombe	d2cdefb9ec	Add new meta w2v2-conformer BERT-like model (#28165 ) * first commit * correct default value non causal * update config and modeling code * update converting checkpoint * clean modeling and fix tests * make style * add new config parameters to docstring * fix copied from statements * Apply suggestions from code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * make position_embeddings_type docstrings clearer * clean converting script * remove function not used * clean modeling file * apply suggestion for test file + add convert script to not_doctested * modify tests according to review - cleaner logic and more tests * Apply nit suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add checker of valid position embeddings type * instantiate new layer norm layer with the right eps * fix freeze_feature_encoder since it can be None in some cases * add test same output in convert script * restore wav2vec2conformer and add new model * create processor and FE + clean * add new model code * fix convert script and set default config parameters * correct model id paths * make style * make fix-copies and cleaning files * fix copied from statements * complete .md and fixe copies * clean convert script argument defaults * fix config parameters docstrings * fix config docstring * add copied from and enrich FE tests * fix copied from and repo-consistency * add autotokenizer * make test input length shorter and change docstring code * fix docstrings and copied from * add add_adapter to ASR training example * make testing of adapters more robust * adapt to multi adapter layers * refactor input_values->input_features and remove w2v2-bert feature extractor * remove pretraining model * remove depreciated features and useless lines * add copied from and ignore statements to modeling tests * remove pretraining model #2 * change import in convert script * change default in convert script * update readme and remove useless line * Update tests/models/wav2vec2_bert/test_processor_wav2vec2_bert.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * refactor BERT to Bert for consistency * remove useless ignore copy statement * add persistent to buffer in rotary * add eps in LayerNorm init and remove copied from * add adapter activation parameters and add copied from statements * Fix copied statements and add unitest.skip reasons * add copied statement in test_processor * refactor processor * make style * replace numpy random by torch rand * remove expected output CTC * improve converting script with processor class * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * remove gumbel class * remove tests related to previously deleted class * Update src/transformers/models/wav2vec2_bert/configuration_wav2vec2_bert.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * correct typos * remove uused parameters * update processor to takes both text and audio * update checkpoints * update expected output and add ctc expected output * add label_attention_mask * replace pt with np in processor tests * fix typo * revert to behaviour with labels_attention_mask --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-01-18 13:37:34 +00:00
hugo-syn	5d8eb93eee	chore: Fix multiple typos (#28574 )	2024-01-18 13:35:09 +00:00
regisss	0cdcd7a2b3	Remove `task` arg in `load_dataset` in image-classification example (#28408 ) * Remove `task` arg in `load_dataset` in image-classification example * Manage case where "train" is not in dataset * Add new args to manage image and label column names * Similar to audio-classification example * Fix README * Update tests	2024-01-16 08:04:08 +01:00
Timothy Cronin	ff86bc364d	improve dev setup comments and hints (#28495 ) * improve dev setup comments and hints * fix tests for new dev setup hints	2024-01-15 18:36:40 +00:00
Yih-Dar	64bdbd888c	Don't set `finetuned_from` if it is a local path (#28482 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-01-15 11:38:20 +01:00
Joao Gante	4fb3d3a0f6	TF: purge `TFTrainer` (#28483 )	2024-01-12 16:56:34 +00:00
dependabot[bot]	c45ef1c0d1	Bump jinja2 from 2.11.3 to 3.1.3 in /examples/research_projects/decision_transformer (#28457 ) Bump jinja2 in /examples/research_projects/decision_transformer Bumps [jinja2](https://github.com/pallets/jinja) from 2.11.3 to 3.1.3. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/jinja/compare/2.11.3...3.1.3) --- updated-dependencies: - dependency-name: jinja2 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-01-12 15:28:55 +01:00
Alex Hedges	95091e1582	Set `cache_dir` for `evaluate.load()` in example scripts (#28422 ) While using `run_clm.py`,[^1] I noticed that some files were being added to my global cache, not the local cache. I set the `cache_dir` parameter for the one call to `evaluate.load()`, which partially solved the problem. I figured that while I was fixing the one script upstream, I might as well fix the problem in all other example scripts that I could. There are still some files being added to my global cache, but this appears to be a bug in `evaluate` itself. This commit at least moves some of the files into the local cache, which is better than before. To create this PR, I made the following regex-based transformation: `evaluate\.load$(.*?)$` -> `evaluate\.load$$1, cache_dir=model_args.cache_dir$`. After using that, I manually fixed all modified files with `ruff` serving as useful guidance. During the process, I removed one existing usage of the `cache_dir` parameter in a script that did not have a corresponding `--cache-dir` argument declared. [^1]: I specifically used `pytorch/language-modeling/run_clm.py` from v4.34.1 of the library. For the original code, see the following URL: `acc394c4f5/examples/pytorch/language-modeling/run_clm.py`.	2024-01-11 15:38:44 +01:00
dependabot[bot]	932ad8af7a	Bump fonttools from 4.31.1 to 4.43.0 in /examples/research_projects/decision_transformer (#28417 ) Bump fonttools in /examples/research_projects/decision_transformer Bumps [fonttools](https://github.com/fonttools/fonttools) from 4.31.1 to 4.43.0. - [Release notes](https://github.com/fonttools/fonttools/releases) - [Changelog](https://github.com/fonttools/fonttools/blob/main/NEWS.rst) - [Commits](https://github.com/fonttools/fonttools/compare/4.31.1...4.43.0) --- updated-dependencies: - dependency-name: fonttools dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-01-10 11:22:43 +01:00
Lysandre	3ed3e3190c	Dev version	2023-12-13 18:29:31 +01:00
saswatmeher	a49f4acab3	Fix link in README.md of Image Captioning (#27969 ) Update the link for vision encoder decoder doc used by FlaxVisionEncoderDecoderModel link.	2023-12-12 08:07:15 -05:00
Adam Louly	4850aaba6f	fix no sequence length models error (#27522 ) * fix no sequence length models error * block size check --------- Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-12-11 18:01:26 +00:00
Phuc Van Phan	0410a29a2d	fix: fix gradient accumulate step for learning rate (#27667 )	2023-12-07 07:59:26 +01:00
Dave Berenbaum	fe41647afc	uses dvclive_test mode in examples/pytorch/test_accelerate_examples.py (#27763 )	2023-11-30 14:52:03 +01:00
Peter Pan	ce31508134	docs: replace torch.distributed.run by torchrun (#27528 ) * docs: replace torch.distributed.run by torchrun `transformers` now officially support pytorch >= 1.10. The entrypoint `torchrun`` is present from 1.10 onwards. Signed-off-by: Peter Pan <Peter.Pan@daocloud.io> * Update src/transformers/trainer.py with @ArthurZucker's suggestion Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Signed-off-by: Peter Pan <Peter.Pan@daocloud.io> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-11-27 16:26:33 +00:00
Mathias Nielsen	f31af3927f	[ examples] fix loading jsonl with load dataset in run translation example (#26924 ) * Renamed variable extension to builder_name * If builder name is jsonl change to json to align with load_datasets * Apply suggestions from code review Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> --------- Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>	2023-11-20 15:45:42 +01:00
V.Prasanna kumar	ffbcfc0166	Broken links fixed related to datasets docs (#27569 ) fixed the broken links belogs to dataset library of transformers	2023-11-17 13:44:09 -08:00
Arthur	651408a077	[`Styling`] stylify using ruff (#27144 ) * try to stylify using ruff * might need to remove these changes? * use ruf format andruff check * use isinstance instead of type comparision * use # fmt: skip * use # fmt: skip * nits * soem styling changes * update ci job * nits isinstance * more files update * nits * more nits * small nits * check and format * revert wrong changes * actually use formatter instead of checker * nits * well docbuilder is overwriting this commit * revert notebook changes * try to nuke docbuilder * style * fix feature exrtaction test * remve `indent-width = 4` * fixup * more nits * update the ruff version that we use * style * nuke docbuilder styling * leve the print for detected changes * nits * Remove file I/O Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com> * style * nits * revert notebook changes * Add # fmt skip when possible * Add # fmt skip when possible * Fix * More ` # fmt: skip` usage * More ` # fmt: skip` usage * More ` # fmt: skip` usage * NIts * more fixes * fix tapas * Another way to skip * Recommended way * Fix two more fiels * Remove asynch Remove asynch --------- Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>	2023-11-16 17:43:19 +01:00
Lucain	fd65aa9818	Set `usedforsecurity=False` in hashlib methods (FIPS compliance) (#27483 ) * Set usedforsecurity=False in hashlib methods (FIPS compliance) * trigger ci * tokenizers version * deps * bump hfh version * let's try this	2023-11-16 14:29:53 +00:00
Matt	4989e73e2f	Update the TF pin for 2.15 (#27375 ) * Move the TF pin for 2.15 * make fixup	2023-11-16 13:47:43 +00:00
Phuc Van Phan	69c9b89fcb	docs: add docs for map, and add num procs to load_dataset (#27520 )	2023-11-16 13:16:19 +00:00
Matt	2e72bbab2c	Incorrect setting for num_beams in translation and summarization examples (#27519 ) * Remove the torch main_process_first context manager from TF examples * Correctly set num_beams=1 in our examples, and add a guard in GenerationConfig.validate() * Update src/transformers/generation/configuration_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-11-15 18:18:54 +00:00
Adam Louly	e6522e49a7	Fixing the failure of models without max_position_embeddings attribute. (#27499 ) fix max pos issue Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-11-15 18:16:42 +00:00
Zach Mueller	a85ea4b19a	Fix wav2vec2 params (#27515 ) Fix test	2023-11-15 09:24:03 -05:00
Arthur	b97cab7e6d	Remove-auth-token (#27060 ) * don't use `use_auth_token`internally * let's use token everywhere * fixup	2023-11-13 14:20:54 +01:00
Yih-Dar	c8b6052ff6	Final fix of the accelerate installation issue (#27408 ) * fix * [test-all] commit * fix * [test-all] commit * [test-all] commit * fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-11-09 18:52:29 +01:00
Dave Berenbaum	791ec370d1	Adds dvclive callback (#27352 ) * dvclive trainer callback * style fixes * dvclive link fixes	2023-11-09 12:19:31 +00:00
Zach Mueller	e9adb0c9cf	Change thresh in test (#27378 ) Change thresh	2023-11-09 04:44:36 -05:00
Zach Mueller	845aa832b7	Remove unused param from example script tests (#27354 ) Unused param	2023-11-08 09:07:32 -05:00
Zach Mueller	efa57cb234	Fix example tests from failing (#27353 ) * Fix example tests from failing * CHange thresh	2023-11-08 07:45:21 -05:00
Hz, Ji	b6dbfee0a2	moving example of benchmarking to legacy dir (#27337 ) move example of benchmarking to legacy	2023-11-08 09:27:37 +01:00
Lysandre	bc78fd1274	Dev version	2023-11-02 18:15:36 +01:00
Dong-geon Lee	25e6e9418c	Unify warning styles for better readability (#27184 )	2023-10-31 18:12:14 +00:00
Hz, Ji	cd19b19378	make tests of pytorch_example device agnostic (#27081 )	2023-10-30 14:56:41 +00:00
Gema Parreño	722e936491	[Typo fix] flag config in WANDB (#27130 ) typo fix flag config	2023-10-29 18:22:26 +00:00
Lucain	66b088faf0	Provide alternative when warning on use_auth_token (#27105 )	2023-10-27 14:32:54 +02:00
dependabot[bot]	3c2692407d	Bump urllib3 from 1.26.17 to 1.26.18 in /examples/research_projects/lxmert (#26888 ) Bump urllib3 in /examples/research_projects/lxmert Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.17 to 1.26.18. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.26.17...1.26.18) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-10-26 09:10:29 +02:00
dependabot[bot]	9c5240af14	Bump werkzeug from 2.2.3 to 3.0.1 in /examples/research_projects/decision_transformer (#27072 ) Bump werkzeug in /examples/research_projects/decision_transformer Bumps [werkzeug](https://github.com/pallets/werkzeug) from 2.2.3 to 3.0.1. - [Release notes](https://github.com/pallets/werkzeug/releases) - [Changelog](https://github.com/pallets/werkzeug/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/werkzeug/compare/2.2.3...3.0.1) --- updated-dependencies: - dependency-name: werkzeug dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-10-26 08:56:28 +02:00
Michal Jamroz	e2d6d5ce57	Normalize only if needed (#26049 ) * Normalize only if needed * Update examples/pytorch/image-classification/run_image_classification.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * if else in one line * within block * one more place, sorry for mess * import order * Update examples/pytorch/image-classification/run_image_classification.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update examples/pytorch/image-classification/run_image_classification_no_trainer.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-10-24 13:32:03 +01:00
YQ	f71c9ccf59	fix logit-to-multi-hot conversion in example (#26936 ) * fix logit to multi-hot converstion * add comments * typo	2023-10-23 12:33:05 +02:00
Matt	cbd278f0f6	Pin Keras for now (#26904 ) * Pin Keras for now out of paranoia * Add the keras pin to _tests_requirements.txt too * Make sure the Keras version matches the TF one * make fixup	2023-10-19 14:39:31 +01:00
dependabot[bot]	bece55d8f9	Bump urllib3 from 1.26.17 to 1.26.18 in /examples/research_projects/decision_transformer (#26889 ) Bump urllib3 in /examples/research_projects/decision_transformer Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.17 to 1.26.18. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.26.17...1.26.18) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-10-18 13:31:06 +02:00
dependabot[bot]	6d644d6852	Bump urllib3 from 1.26.17 to 1.26.18 in /examples/research_projects/visual_bert (#26890 ) Bump urllib3 in /examples/research_projects/visual_bert Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.17 to 1.26.18. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.26.17...1.26.18) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-10-18 04:30:50 -07:00
Tom Aarsen	40ea9ab2a1	Add many missing spaces in adjacent strings (#26751 ) Add missing spaces in adjacent strings	2023-10-12 10:28:40 +02:00
Zach Mueller	1d6a84749b	Fix checkpoint path in `no_trainer` scripts (#26733 ) checkpoint path	2023-10-11 16:16:27 +02:00
Roy Hvaara	fc63914399	[JAX] Replace uses of `jnp.array` in types with `jnp.ndarray`. (#26703 ) `jnp.array` is a function, not a type: https://jax.readthedocs.io/en/latest/_autosummary/jax.numpy.array.html so it never makes sense to use `jnp.array` in a type annotation. Presumably the intent was to write `jnp.ndarray` aka `jax.Array`. Co-authored-by: Peter Hawkins <phawkins@google.com>	2023-10-10 21:35:16 +02:00
jheitmann	3eceaa3637	Fix source_prefix default value (#26654 )	2023-10-10 20:49:10 +02:00
statelesshz	27597fea07	remove SharedDDP as it is deprecated (#25702 ) * remove SharedDDP as it was drepracated * apply review suggestion * make style * Oops,forgot to remove the compute_loss context manager in Seq2SeqTrainer. * remove the unnecessary conditional statement * keep the logic of IPEX * clean code * mix precision setup & make fixup --------- Co-authored-by: statelesshz <jihuazhong1@huawei.com>	2023-10-06 16:03:11 +02:00
Phuc Van Phan	6015f91a5a	refactor: change default block_size (#26229 ) * refactor: change default block_size * fix: return tf to origin * fix: change files to origin * rebase * rebase * rebase * rebase * rebase * rebase * rebase * rebase * refactor: add min block_size to files * reformat: add min block_size for run_clm tf	2023-10-04 15:31:38 +01:00
dependabot[bot]	fc296f419e	Bump pillow from 9.3.0 to 10.0.1 in /examples/research_projects/decision_transformer (#26580 ) Bump pillow in /examples/research_projects/decision_transformer Bumps [pillow](https://github.com/python-pillow/Pillow) from 9.3.0 to 10.0.1. - [Release notes](https://github.com/python-pillow/Pillow/releases) - [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst) - [Commits](https://github.com/python-pillow/Pillow/compare/9.3.0...10.0.1) --- updated-dependencies: - dependency-name: pillow dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-10-04 11:52:46 +02:00
Lysandre	bd6205919a	v4.35.0.dev0	2023-10-03 16:54:37 +02:00
dependabot[bot]	cf345d5f38	Bump urllib3 from 1.26.9 to 1.26.17 in /examples/research_projects/decision_transformer (#26554 ) Bump urllib3 in /examples/research_projects/decision_transformer Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.9 to 1.26.17. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.26.9...1.26.17) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-10-03 08:55:12 +02:00
dependabot[bot]	6de6fdd06d	Bump urllib3 from 1.26.5 to 1.26.17 in /examples/research_projects/visual_bert (#26552 ) Bump urllib3 in /examples/research_projects/visual_bert Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.5 to 1.26.17. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.26.5...1.26.17) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-10-03 08:55:01 +02:00
dependabot[bot]	e092b4ad68	Bump urllib3 from 1.26.5 to 1.26.17 in /examples/research_projects/lxmert (#26551 ) Bump urllib3 in /examples/research_projects/lxmert Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.5 to 1.26.17. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.26.5...1.26.17) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-10-03 08:54:50 +02:00
Sanchit Gandhi	68e85fc822	[Flax Examples] Seq2Seq ASR Fine-Tuning Script (#21764 ) * from seq2seq speech * [Flax] Example script for speech seq2seq * tests and fixes * make style * fix: label padding tokens * fix: label padding tokens over list * update ln names for Whisper * try datasets iter loader * create readme and append results * style * make style * adjust lr * use pt dataloader * make fast * pin gen max len * finish * add pt to requirements for test * fix pt -> torch * add accelerate	2023-09-29 16:42:58 +01:00
Phuc Van Phan	ba47efbfe4	docs: change assert to raise and some small docs (#26232 ) * docs: change assert to raise and some small docs * docs: add rule and some document * fix: fix bug * fix: fix bug * chorse: revert logging * chorse: revert	2023-09-28 10:14:17 +02:00
Phuc Van Phan	910faa3e1f	feat: adding num_proc to load_dataset (#26326 ) * feat: adding num_proc to load_dataset * feat: add add_num_proc for run_mlm_flax * feat: add num_proc for bart and t5 * chorse: remove	2023-09-22 19:22:47 +02:00
Nino Risteski	7d4e0c23c8	Update README.md (#26198 ) Fixed a few typos	2023-09-19 00:02:50 +02:00
Phuc Van Phan	8b5da9fc6e	refactor: change default block_size in block size > max position embeddings (#26069 ) * refactor: change default block_size when not initialize * reformat: add the min of block size	2023-09-18 16:47:57 +01:00
Phuc Van Phan	4fb64e285a	chore: correct update_step and correct gradient_accumulation_steps (#26068 )	2023-09-12 18:31:23 +01:00
Phuc Van Phan	5af2c62696	docs: add space to docs (#26067 ) * docs: add space to docs * docs: remove reduntant space	2023-09-11 22:03:26 +01:00
Phuc Van Phan	9cebae64ad	docs: update link huggingface map (#26077 )	2023-09-11 12:57:04 +01:00
Nino Risteski	f6295c6c53	Update README.md (#26003 ) fixed a typo	2023-09-06 10:55:11 +01:00
Joao Gante	9a70d6e56f	Trainer: delegate default generation values to `generation_config` (#25987 )	2023-09-05 14:47:00 +01:00
Traun Leyden	7011cd8667	Update RAG README.md with correct path to examples/seq2seq (#25953 ) Update README.md with correct path to examples/seq2seq	2023-09-05 12:31:59 +01:00
Susnato Dhar	404ff8fc17	Fix typo (#25966 ) * Update feature_extraction_clap.py * changed all lenght to length	2023-09-05 10:12:25 +02:00
Lysandre	d8e13b3e04	v4.34.dev.0	2023-09-04 15:12:11 -04:00
Zach Mueller	be0e189bd3	Revert frozen training arguments (#25903 ) * Revert frozen training arguments * TODO	2023-09-01 11:24:12 -04:00
Younes Belkada	4b79697865	🚨🚨🚨 [`Refactor`] Move third-party related utility files into `integrations/` folder 🚨🚨🚨 (#25599 ) * move deepspeed to `lib_integrations.deepspeed` * more refactor * oops * fix slow tests * Fix docs * fix docs * addess feedback * address feedback * final modifs for PEFT * fixup * ok now * trigger CI * trigger CI again * Update docs/source/en/main_classes/deepspeed.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * import from `integrations` * address feedback * revert removal of `deepspeed` module * revert removal of `deepspeed` module * fix conflicts * ooops * oops * add deprecation warning * place it on the top * put `FutureWarning` * fix conflicts with not_doctested.txt * add back `bitsandbytes` module with a depr warning * fix * fix * fixup * oops * fix doctests --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-08-25 17:13:34 +02:00

1 2 3 4 5 ...

2548 Commits