transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 21:30:07 +06:00

Author	SHA1	Message	Date
Yih-Dar	bf97d4aa6d	Fix benchmark script (#32635 ) * fix * >= 0.3.0 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-08-22 16:07:47 +02:00
Marc Sun	fd06ad5438	🚨🚨🚨 Update min version of accelerate to 0.26.0 (#32627 ) * Update min version of accelerate to 0.26.0 * dev-ci * update min version in import * remove useless check * dev-ci * style * dev-ci * dev-ci	2024-08-20 11:42:36 +02:00
Arthur Zucker	26a9443dae	dev version 4.45.0	2024-08-06 18:33:18 +02:00
Lysandre	ff0d708fe6	Dev version: v4.44.0.dev0	2024-07-23 17:12:47 +02:00
Sai-Suraj-27	d2c687b3f1	Updated `ruff` to the latest version (#31926 ) * Updated ruff version and fixed the required code accorindg to the latest version. * Updated ruff version and fixed the required code accorindg to the latest version. * Added noqa directive to ignore 1 error shown by ruff	2024-07-23 17:07:31 +02:00
Yih-Dar	765732e92c	unpin `numpy<2.0` (#32018 ) * unpin np * [test_all] trigger full CI --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-07-18 11:26:01 +02:00
Joao Gante	3345ae733b	dependencies: `keras-nlp<0.14` pin (#31684 ) * keras nlp pin * this should use the new docker images:dev * dev-ci	2024-07-01 17:39:33 +01:00
Lysandre	727eea4ab0	v4.43.0.dev0	2024-06-27 17:40:07 +02:00
René Gentzen	74b92c6256	Added version constraint on numpy for version <2.0 (#31569 ) * Contrained numpy to <2.0 * Updated dependency_versions_table --------- Co-authored-by: René Gentzen <rene.gentzen@mittelstand.ai>	2024-06-24 17:47:34 +01:00
Matt	2d4820284d	Add Jinja as a requirement with the right version cutoff (#31536 ) * Add Jinja as a requirement with the right version cutoff * Correct package name!	2024-06-24 14:42:16 +01:00
Albert Villanova del Moral	a14b055b65	Pass datasets trust_remote_code (#31406 ) * Pass datasets trust_remote_code * Pass trust_remote_code in more tests * Add trust_remote_dataset_code arg to some tests * Revert "Temporarily pin datasets upper version to fix CI" This reverts commit `b7672826ca`. * Pass trust_remote_code in librispeech_asr_dummy docstrings * Revert "Pin datasets<2.20.0 for examples" This reverts commit `833fc17a3e`. * Pass trust_remote_code to all examples * Revert "Add trust_remote_dataset_code arg to some tests" to research_projects * Pass trust_remote_code to tests * Pass trust_remote_code to docstrings * Fix flax examples tests requirements * Pass trust_remote_dataset_code arg to tests * Replace trust_remote_dataset_code with trust_remote_code in one example * Fix duplicate trust_remote_code * Replace args.trust_remote_dataset_code with args.trust_remote_code * Replace trust_remote_dataset_code with trust_remote_code in parser * Replace trust_remote_dataset_code with trust_remote_code in dataclasses * Replace trust_remote_dataset_code with trust_remote_code arg	2024-06-17 17:29:13 +01:00
Albert Villanova del Moral	b7672826ca	Temporarily pin datasets upper version to fix CI (#31407 ) Temporarily pin datasets upper version	2024-06-13 18:01:18 +01:00
Marc Sun	254b25abd9	Use huggingface_hub helper function to split state dict (#31091 ) * shard saving from hf hub * index = None * fix tests * indent	2024-06-12 14:10:32 +02:00
Arthur	8e8786e5f0	Update build ci image [push-ci-image] (#30933 ) * [build-ci-image] * correct branch * push ci image * [build-ci-image] * update scheduled as well * [push-ci-image] * [build-ci-image] * [push-ci-image] * update deps * [build-ci-image] * [build-ci-image] * [build-ci-image] * [build-ci-image] * [build-ci-image] * [build-ci-image] * oups [build-ci-image] * [push-ci-image] * fix * [build-ci-image] * [build-ci-image] * [build-ci-image] * [build-ci-image] * [build-ci-image] * [build-ci-image] * [build-ci-image] * updated * [build-ci-image] update tag * [build-ci-image] * [build-ci-image] * fix tag * [build-ci-image] * [build-ci-image] * [build-ci-image] * [build-ci-image] * github name * commit_title? * fetch * update * it not found * dev * dev * [push-ci-image] * dev * dev * update * dev * dev print dev commit message dev * dev ? dev * dev * dev * dev * dev * [build-ci-image] * [build-ci-image] * [push-ci-image] * revert unwanted * revert convert as well * no you are not important * [build-ci-image] * Update .circleci/config.yml * pin tf probability dev	2024-05-22 10:52:59 +02:00
Arthur	673440d073	update ruff version (#30932 ) * update ruff version * fix research projects * Empty * Fix errors --------- Co-authored-by: Lysandre <lysandre@huggingface.co>	2024-05-22 06:40:15 +02:00
Yih-Dar	64e0573a81	[Benchmark] Reuse `optimum-benchmark` (#30615 ) * benchmark * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-05-21 15:15:19 +02:00
Arthur Zucker	b6eb708bf1	v4.42.dev.0	2024-05-17 17:30:41 +02:00
Aaron Jimenez	47735f5f0f	[docs] Update es/pipeline_tutorial.md (#30684 ) * copy en/ contect to es/ * translate first section * translate the doc * fix typos * run make style	2024-05-09 16:42:01 -07:00
Lucain	835de4c833	Respect `resume_download` deprecation (#30620 ) * Deprecate resume_download * remove default resume_download value --------- Co-authored-by: Lysandre Debut <hi@lysand.re>	2024-05-06 18:01:15 +02:00
Arthur	307f632bb2	[`CI update`] Try to use dockers and no cache (#29202 ) * change cis * nits * update * minor updates * [push-ci-image] * nit [push-ci-image] * nitsssss * [build-ci-image] * [push-ci-image] * [push-ci-image] * both * [push-ci-image] * this? * [push-ci-image] * pypi-kenlm needs g++ * [push-ci-image] * nit * more nits [push-ci-image] * nits [push-ci-image] * [push-ci-image] * [push-ci-image] * [push-ci-image] * add vision * [push-ci-image] * [push-ci-image] * add new dummy file but will need to update them [push-ci-image] * [push-ci-image] * show package size as well * [push-ci-image] * potentially ignore failures * workflow updates * nits [push-ci-image] * [push-ci-image] * fix consistency * clean nciida triton * also show big packages [push-ci-image] * nit * update * another one * line escape? * add accelerate [push-ci-image] * updates [push-ci-image] * nits to run tests, no push-ci * try to parse skip reason to make sure nothing is skipped that should no be skippped * nit? * always show skipped reasons * nits * better parsing of the test outputs * action="store_true", * failure on failed * show matched * debug * update short summary with skipped, failed and errors * nits * nits * coolu pdates * remove docbuilder * fix * always run checks * oups * nits * don't error out on library printing * non zero exi codes * no warning * nit * WAT? * format nit * [push-ci-image] * fail if fail is needed * [push-ci-image] * sound file for torch light? * [push-ci-image] * order is important [push-ci-image] * [push-ci-image] reduce even further * [push-ci-image] * use pytest rich ! * yes [push-ci-image] * oupsy * bring back the full traceback, but pytest rich should help * nit * [push-ci-image] * re run * nit * [push-ci-image] * [push-ci-image] * [push-ci-image] * empty push to trigger * [push-ci-image] * nit? [push-ci-image] * empty * try to install timm with no deps * [push-ci-image] * oups [push-ci-image] * [push-ci-image] * [push-ci-image] ? * [push-ci-image] open ssh client for git checkout fast * empty for torch light * updates [push-ci-image] * nit * @v4 for checkout * [push-ci-image] * [push-ci-image] * fix fetch tests with parallelism * [push-ci-image] * more parallelism * nit * more nits * empty to re-trigger * empty to re-trigger * split by timing * did not work with previous commit * junit.xml * no path? * mmm this? * junitxml format * split by timing * nit * fix junit family * now we can test if the xunit1 is compatible! * this? * fully list tests * update * update * oups * finally * use classname * remove working directory to make sure the path does not interfere * okay no juni should have the correct path * name split? * sort by classname is what make most sense * some testing * naem * oups * test something fun * autodetect * 18? * nit * file size? * uip * 4 is best * update to see versions * better print * [push-ci-image] * [push-ci-image] * please install the correct keras version * [push-ci-image] * [push-ci-image] * [push-ci-image] * [push-ci-image] * [push-ci-image] * uv is fucking me up * [push-ci-image] * [push-ci-image] * [push-ci-image] * nits * [push-ci-image] * [push-ci-image] * install issues an pins * tapas as well * nits * more paralellism * short tb * soundfile * soundfile * [push-ci-image] * [push-ci-image] * [push-ci-image] * oups * [push-ci-image] * fix some things * [push-ci-image] * [push-ci-image] * [push-ci-image] * [push-ci-image] * use torch-light for hub * small git lfs for hub job * [push-ci-image] * [push-ci-image] * [push-ci-image] * [push-ci-image] * fix tf tapas * [push-ci-image] * nits * [push-ci-image] * don't update the test * [push-ci-image] * [push-ci-image] * [push-ci-image] * no use them * [push-ci-image] * [push-ci-image] * [push-ci-image] * [push-ci-image] * update tf proba * [push-ci-image] * [push-ci-image] * woops * [push-ci-image] * [push-ci-image] * [push-ci-image] * [push-ci-image] * [push-ci-image] * [push-ci-image] * test with built dockers * [push-ci-image] * skip annoying tests * revert fix copy * update test values * update * last skip and fixup * nit * ALL GOOOD * quality * Update tests/models/layoutlmv2/test_image_processing_layoutlmv2.py * Update docker/quality.dockerfile Co-authored-by: Lysandre Debut <hi@lysand.re> * Update src/transformers/models/tapas/modeling_tf_tapas.py Co-authored-by: Lysandre Debut <hi@lysand.re> * Apply suggestions from code review Co-authored-by: Lysandre Debut <hi@lysand.re> * use torch-speed * updates * [push-ci-image] * [push-ci-image] * [push-ci-image] * [push-ci-image] * fuck ken-lm [push-ci-image] * [push-ci-image] * [push-ci-image] --------- Co-authored-by: Lysandre Debut <hi@lysand.re>	2024-05-06 10:10:32 +02:00
Joao Gante	31921d8d5e	Jax: scipy version pin (#30402 ) scipy pin for jax	2024-04-23 10:42:17 +01:00
Lysandre	ce8e64fbe2	Dev version	2024-04-18 15:53:25 +02:00
Nicolas Patry	8e5f76f511	Upgrading to tokenizers 0.19.0 (#30289 ) * [DO NOT MERGE] Testing tokenizers 0.19.0rc0 * Accounting for the breaking change. * Ruff. * Upgrading to tokenizers `0.19` (new release with preprend_scheme fixed and new surface for BPE tiktoken bug).	2024-04-17 17:17:50 +02:00
Zach Mueller	c78f57729f	Update test reqs to include sentencepiece (#29756 ) * Update test reqs * Clean	2024-03-20 15:53:42 +00:00
Arthur Zucker	1248f09252	v4.40.0.dev.0	2024-03-20 23:31:47 +09:00
Arthur Zucker	1a77f07f65	v4.39.dev.0	2024-02-21 15:23:22 +09:00
Yih-Dar	89439fea64	unpin torch (#28892 ) * unpin torch * check * check * check --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-02-06 17:21:05 +01:00
Matt	74c9cfeaa7	Pin Torch to <2.2.0 (#28785 ) * Pin torch to <2.2.0 * Pin torchvision and torchaudio as well * Playing around with versions to see if this helps * twiddle something to restart the CI * twiddle it back * Try changing the natten version * make fixup * Revert "Try changing the natten version" This reverts commit `de0d6592c3`. * make fixup * fix fix fix * fix fix fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-01-30 23:01:12 +01:00
amyeroberts	0f8d015a41	Pin pytest version <8.0.0 (#28758 ) * Pin pytest version <8.0.0 * Update setup.py * make deps_table_update	2024-01-29 15:22:14 +00:00
Yih-Dar	f8b7c4345a	Unpin pydantic (#28728 ) * try pydantic v2 * try pydantic v2 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-01-26 17:39:33 +01:00
Lysandre Debut	008a6a2208	Enable safetensors conversion from PyTorch to other frameworks without the torch requirement (#27599 ) * Initial commit * Requirements & tests * Tests * Tests * Rogue import * Rogue torch import * Cleanup * Apply suggestions from code review Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> * bfloat16 management * Sanchit's comments * Import shield * apply suggestions from code review * correct bf16 * rebase --------- Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>	2024-01-23 10:28:23 +01:00
Amy Roberts	b2748a6efd	v4.38.dev.0	2024-01-19 10:43:28 +00:00
Yih-Dar	59cd9de39d	Byebye torch 1.10 (#28207 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-01-11 16:18:27 +01:00
Joao Gante	ee2482b6f8	CI: limit natten version (#28432 )	2024-01-10 12:39:05 +00:00
Lysandre	3ed3e3190c	Dev version	2023-12-13 18:29:31 +01:00
Justin Yu	5fa66df3f3	[integration] Update Ray Tune integration for Ray 2.7 (#26499 ) * fix tune integration for ray 2.7+ Signed-off-by: Justin Yu <justinvyu@anyscale.com> * add version check for ray tune backend availability Signed-off-by: Justin Yu <justinvyu@anyscale.com> * missing import Signed-off-by: Justin Yu <justinvyu@anyscale.com> * pin min version instead Signed-off-by: Justin Yu <justinvyu@anyscale.com> * address comments Signed-off-by: Justin Yu <justinvyu@anyscale.com> * some fixes Signed-off-by: Justin Yu <justinvyu@anyscale.com> * fix unnecessary final checkpoint Signed-off-by: Justin Yu <justinvyu@anyscale.com> * fix lint Signed-off-by: Justin Yu <justinvyu@anyscale.com> * dep table fix Signed-off-by: Justin Yu <justinvyu@anyscale.com> * fix lint Signed-off-by: Justin Yu <justinvyu@anyscale.com> --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com>	2023-12-09 11:04:13 +01:00
Yih-Dar	96f9caa10b	pin `ruff==0.1.5` (#27849 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-12-05 10:17:23 +01:00
Sourab Mangrulkar	a761d6e9a0	Refactoring Trainer, adds `save_only_model` arg and simplifying FSDP integration (#27652 ) * add code changes 1. Refactor FSDP 2. Add `--save_only_model` option: When checkpointing, whether to only save the model, or also the optimizer, scheduler & rng state. 3. Bump up the minimum `accelerate` version to `0.21.0` * quality * fix quality? * Revert "fix quality?" This reverts commit `149330a6ab`. * fix fsdp doc strings * fix quality * Update src/transformers/training_args.py Co-authored-by: Zach Mueller <muellerzr@gmail.com> * please fix the quality issue 😅 * Apply suggestions from code review Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * address comment * simplify conditional check as per the comment * update documentation --------- Co-authored-by: Zach Mueller <muellerzr@gmail.com> Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>	2023-11-24 11:40:52 +05:30
Arthur	b54993aa94	[`dependency`] update pillow pins (#27409 ) * update pillow pins * Apply suggestions from code review * more freedomin pins	2023-11-22 09:40:30 +01:00
Arthur	651408a077	[`Styling`] stylify using ruff (#27144 ) * try to stylify using ruff * might need to remove these changes? * use ruf format andruff check * use isinstance instead of type comparision * use # fmt: skip * use # fmt: skip * nits * soem styling changes * update ci job * nits isinstance * more files update * nits * more nits * small nits * check and format * revert wrong changes * actually use formatter instead of checker * nits * well docbuilder is overwriting this commit * revert notebook changes * try to nuke docbuilder * style * fix feature exrtaction test * remve `indent-width = 4` * fixup * more nits * update the ruff version that we use * style * nuke docbuilder styling * leve the print for detected changes * nits * Remove file I/O Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com> * style * nits * revert notebook changes * Add # fmt skip when possible * Add # fmt skip when possible * Fix * More ` # fmt: skip` usage * More ` # fmt: skip` usage * More ` # fmt: skip` usage * NIts * more fixes * fix tapas * Another way to skip * Recommended way * Fix two more fiels * Remove asynch Remove asynch --------- Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>	2023-11-16 17:43:19 +01:00
Lucain	fd65aa9818	Set `usedforsecurity=False` in hashlib methods (FIPS compliance) (#27483 ) * Set usedforsecurity=False in hashlib methods (FIPS compliance) * trigger ci * tokenizers version * deps * bump hfh version * let's try this	2023-11-16 14:29:53 +00:00
Matt	4989e73e2f	Update the TF pin for 2.15 (#27375 ) * Move the TF pin for 2.15 * make fixup	2023-11-16 13:47:43 +00:00
Arthur	3d1a7bf476	[`tokenizers`] update `tokenizers` version pin (#27494 ) * update `tokenizers` version pin * force tokenizers>=0.15 * use 0.14 Co-authored-by: Lysandre <lysandre@huggingface.co> --------- Co-authored-by: Lysandre <lysandre@huggingface.co>	2023-11-15 10:46:02 +01:00
Lysandre	bc78fd1274	Dev version	2023-11-02 18:15:36 +01:00
Zach Mueller	34a640642b	Save TB logs as part of push_to_hub (#27022 ) * Support runs/ * Upload runs folder as part of push to hub * Add a test * Add to test deps * Update with proposed solution from Slack * Ensure that repo gets deleted in tests	2023-10-26 12:13:19 -04:00
Lysandre Debut	700329493d	Limit to inferior fsspec version (#27010 ) Pin fsspec	2023-10-23 12:34:21 +02:00
Matt	cbd278f0f6	Pin Keras for now (#26904 ) * Pin Keras for now out of paranoia * Add the keras pin to _tests_requirements.txt too * Make sure the Keras version matches the TF one * make fixup	2023-10-19 14:39:31 +01:00
statelesshz	27597fea07	remove SharedDDP as it is deprecated (#25702 ) * remove SharedDDP as it was drepracated * apply review suggestion * make style * Oops,forgot to remove the compute_loss context manager in Seq2SeqTrainer. * remove the unnecessary conditional statement * keep the logic of IPEX * clean code * mix precision setup & make fixup --------- Co-authored-by: statelesshz <jihuazhong1@huawei.com>	2023-10-06 16:03:11 +02:00
Lysandre	bd6205919a	v4.35.0.dev0	2023-10-03 16:54:37 +02:00
Arthur	b132c1703e	update hf hub dependency to be compatible with the new tokenizers (#26301 )	2023-09-21 14:57:36 +02:00
Arthur	2da8853775	🚨🚨 🚨🚨 [`Tokenizer`] attemp to fix add_token issues🚨🚨 🚨🚨 (#23909 ) * fix test for bart. Order is correct now let's skip BPEs * ouf * styling * fix bert.... * slow refactoring * current updates * massive refactoring * update * NICE! * update to see where I am at * updates * update * update * revert * updates * updates * start supporting legacy_save * styling * big update * revert some changes * nits * nniiiiiice * small fixes * kinda fix t5 with new behaviour * major update * fixup * fix copies * today's updates * fix byt5 * upfate * update * update * updates * update vocab size test * Barthez does not use not need the fairseq offset ids * super calll must be after * calll super * move all super init * move other super init * fixup * nits * more fixes * nits * more fixes * nits * more fix * remove useless files * ouch all of them are affected * and more! * small imporvements * no more sanitize token * more changes around unique no split tokens * partially fix more things * keep legacy save but add warning * so... more fixes * updates * guess deberta tokenizer could be nuked * fixup * fixup did some bad things * nuke it if it breaks * remove prints and pretrain fast from slow with new format. * fixups * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fiou * nit * by default specials should not be normalized? * update * remove brakpoint * updates * a lot of updates * fixup * fixes revert some changes to match fast * small nits * that makes it cleaner * fix camembert accordingly * update * some lest breaking changes * update * fixup * fix byt5 and whisper mostly * some more fixes, canine's byte vocab * fix gpt2 * fix most of the perceiver tests (4 left) * fix layout lmv3 * fixup * fix copies for gpt2 style * make sure to only warn once * fix perciever and gpt2 tests * some more backward compatibility: also read special tokens map because some ppl use it........////..... * fixup * add else when reading * nits * fresh updates * fix copies * will this make everything faster? * fixes * more fixes * update * more fixes * fixup * is the source of truth right? * sorry camembert for the troubles * current updates * fixup * update led * update * fix regression * fix single word * more model specific fixes * fix t5 tests * fixup * more comments * update * fix nllb * rstrip removed * small fixes * better handle additional_special_tokens and vocab sizes * fixing * styling * fix 4 / 21 * fixup * fix nlbb's tests * some fixes * fix t5 * fixes * style * fix canine tests * damn this is nice * nits * m2m100 nit * fixups * fixes! * fixup * stash * fix merge * revert bad change * fixup * correct order for code Llama * fix speecht5 post merge * styling * revert source of 11 fails * small nits * all changes in one go * fnet hack * fix 2 more tests * update based on main branch of tokenizers * fixup * fix VITS issues * more fixes * fix mgp test * fix camembert issues * oups camembert still has 2 failing tests * mluke fixes * decode fixes * small nits * nits * fix llama and vits * fix camembert * smal nits * more fixes when initialising a fast from a slow and etc * fix one of the last test * fix CPM tokenizer test * fixups * fix pop2piano * fixup * ⚠️ Change tokenizers required version ⚠️ * ⚠️ Change tokenizers required version ⚠️ * "tokenizers>=0.14,<0.15", don't forget smaller than * fix musicgen tests and pretraiendtokenizerfast * fix owlvit and all * update t5 * fix 800 red * fix tests * fix the fix of the fix of t5 * styling * documentation nits * cache _added_tokens_encoder * fixups * Nit * fix red tests * one last nit! * make eveything a lot simpler * Now it's over 😉 * few small nits * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * updates that work for now * tests that should no be skipped / changed and fixed next * fixup * i am ashamed * pushe the fix * update * fixups * nits * fix added_tokens_encoder * fix canine test * fix pegasus vocab * fix transfoXL * fixup * whisper needs to be fixed for train new * pegasus nits * more pegasus fixes * minor update * better error message in failed test * fix whisper failing test * fix whisper failing test * fix pegasus * fixup * fix **** pegasus * reset things * remove another file * attempts to fix the strange custome encoder and offset * nits here and there * update * fixup * nit * fix the whisper test * nits nits * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * updates based on review * some small update to potentially remove * nits * import rlu cache * Update src/transformers/tokenization_utils_base.py Co-authored-by: Lysandre Debut <hi@lysand.re> * move warning to `from_pretrained` * update tests results now that the special tokens are always added --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Lysandre Debut <hi@lysand.re>	2023-09-18 20:28:36 +02:00
Lysandre	d8e13b3e04	v4.34.dev.0	2023-09-04 15:12:11 -04:00
Yih-Dar	3fb1535b09	Update `setup.py` (#25893 ) update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-31 18:54:01 +02:00
Matt	62396cff46	TF 2.14 compatibility (#25630 ) * Update the TF pin and see if anything breaks * make fixup * make fixup * make fixup	2023-08-22 13:13:38 +01:00
Sylvain Gugger	5c67682b16	v4.33.0.dev0	2023-08-21 07:07:04 -04:00
Sylvain Gugger	2defb6b048	More utils doc (#25457 ) * Document and clean more utils. * More documentation and fixes * Switch to Lysandre's token * Address review comments * Actually put else	2023-08-17 07:58:35 +02:00
Sylvain Gugger	baf1daa58e	Migrate Trainer from `Repository` to `upload_folder` (#25095 ) * First draft * Deal with progress bars * Update src/transformers/utils/hub.py Co-authored-by: Lucain <lucainp@gmail.com> * Address review comments * Forgot one * Pin hf_hub * Add argument for push all and fix tests * Fix tests * Address review comments --------- Co-authored-by: Lucain <lucainp@gmail.com>	2023-08-07 17:47:22 +02:00
Sanchit Gandhi	66c240f3c9	[JAX] Bump min version (#25286 ) * [JAX] Bump min version * make fixup	2023-08-03 16:05:02 +01:00
Sylvain Gugger	e9ad51306f	4.32.0.dev0	2023-07-17 13:30:44 -04:00
Georgie Mathews	0866705022	Update setup.py to be compatible with pipenv (#24789 )	2023-07-13 12:56:43 -04:00
Yih-Dar	e538189931	Upgrade jax/jaxlib/flax pin versions (#24791 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-13 13:57:30 +02:00
Yih-Dar	6eedfa6dd1	Pin `Pillow` for now (#24633 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-03 12:24:46 +02:00
Serge Matveenko	d51aa48a76	Limit Pydantic to V1 in dependencies (#24596 ) * Limit Pydantic to V1 in dependencies Pydantic is about to release V2 release which will break a lot of things. This change prevents `transformers` to be used with Pydantic V2 to avoid breaking things. * more --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-01 00:04:03 +02:00
Yih-Dar	299aafe55f	Use protobuf 4 (#24599 ) * fix * fix * fix * fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-30 20:56:55 +02:00
Yih-Dar	11cb6e0f7e	Unpin DeepSpeed and require DS >= 0.9.3 (#24541 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-28 14:01:22 +02:00
Yih-Dar	e84bf1f734	⚠️ Time to say goodbye to py37 (#24091 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-28 07:22:39 +02:00
Matt	8e164c5400	Improved keras imports (#24448 ) * An end to accursed version-specific imports * No more K.is_keras_tensor() either * Update dependency tables * Use a cleaner call context function getter * Add a cap to <2.14 * Add cap to examples requirements too	2023-06-23 19:09:34 +01:00
Sylvain Gugger	26a2ec56d7	Clean up old Accelerate checks (#24279 ) * Clean up old Accelerate checks * Put back imports	2023-06-14 12:44:09 -04:00
Sylvain Gugger	8c5f306719	Update the pin on Accelerate (#24110 )	2023-06-08 10:11:01 -04:00
Sylvain Gugger	ba695c1efd	v4.31.0.dev0	2023-06-07 16:49:00 -04:00
Zachary Mueller	5eb3d3c702	Up pinned accelerate version (#24089 ) * Min accelerate * Also min version * Min accelerate * Also min version * To different minor version * Empty	2023-06-07 16:21:51 -04:00
Sylvain Gugger	9193188276	Pin rhoknp (#23937 )	2023-06-01 10:25:43 -04:00
Zachary Mueller	55451c66ce	Upgrade safetensors version (#23911 ) * Upgrade safetensors * Second table	2023-05-31 11:30:39 -04:00
Sanchit Gandhi	8f915c450d	Unpin numba (#23162 ) * fix for ragged list * unpin numba * make style * np.object -> object * propagate changes to tokenizer as well * np.long -> "long" * revert tokenization changes * check with tokenization changes * list/tuple logic * catch numpy * catch else case * clean up * up * better check * trigger ci * Empty commit to trigger CI	2023-05-31 14:59:30 +01:00
Nicolas Patry	9e8d7066e6	Making `safetensors` a core dependency. (#23254 ) * Making `safetensors` a core dependency. To be merged later, I'm creating the PR so we can try it out. * Update setup.py * Remove duplicates. * Even more redundant.	2023-05-23 15:16:34 +02:00
Sylvain Gugger	9cf4a8b456	Build with non Python files (#23405 ) * Add a test of the built release * Polish everything * Trigger CI	2023-05-16 14:23:10 -04:00
Yih-Dar	a3975f94f3	Only add files with modification outside doc blocks (#23327 ) * min. version for pytest * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-12 16:35:15 +02:00
Sylvain Gugger	786b9cf5ca	Style	2023-05-11 14:40:38 -04:00
Lysandre Debut	71b19ee251	Agents extras (#23301 ) * Agents extras * Add to docs	2023-05-11 14:25:51 -04:00
José Ángel Rey Liñares	0c65fb7cfa	chore: allow protobuf 3.20.3 requirement (#22759 ) * chore: allow protobuf 3.20.3 Allow latest bugfix release for protobuf (3.20.3) * chore: update auto-generated dependency table update auto-generated dependency table * run in subprocess * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Apply suggestions --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-05-10 20:22:56 +02:00
Sylvain Gugger	a0c0a78233	v4.30.0.dev0	2023-05-09 14:59:38 -04:00
Sylvain Gugger	94056b57be	New version of Accelerate for the Trainer (#23204 )	2023-05-08 09:47:08 -04:00
Sylvain Gugger	3341bb41cd	Pin urllib3	2023-05-04 12:00:22 -04:00
Sylvain Gugger	4b6aecb48e	Pin numba for now (#23118 )	2023-05-02 22:02:39 -04:00
amyeroberts	e5f3487190	Pin flax & optax version (#22895 ) * Pin optax version * Pin flax too * Fixup	2023-04-20 17:30:14 +01:00
Zachary Mueller	aec10d162f	Update accelerate version + warning check fix (#22833 )	2023-04-18 12:51:32 -04:00
Zachary Mueller	03462875cc	Introduce `PartialState` as the device handler in the `Trainer` (#22752 ) * Use accelerate for device management * Add accelerate to setup Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-04-17 15:09:45 -04:00
Sylvain Gugger	888c4a2ae0	v4.29.0.dev0	2023-04-12 20:04:29 -04:00
Sylvain Gugger	6db23af50c	Revert migration of setup to pyproject.toml (#22658 )	2023-04-07 15:08:44 -04:00
Nicolas Patry	1670be4bde	Adding Llama FastTokenizer support. (#22264 ) * Adding Llama FastTokenizer support. - Requires https://github.com/huggingface/tokenizers/pull/1183 version - Only support byte_fallback for llama, raise otherwise (safety net). - Lots of questions are special tokens How to test: ```python from transformers.convert_slow_tokenizer import convert_slow_tokenizer from transformers import AutoTokenizer from tokenizers import Tokenizer tokenizer = AutoTokenizer.from_pretrained("huggingface/llama-7b") if False: new_tokenizer = Tokenizer.from_file("tok.json") else: new_tokenizer = convert_slow_tokenizer(tokenizer) new_tokenizer.save("tok.json") strings = [ "This is a test", "生活的真谛是", "生活的真谛是[MASK]。", # XXX: This one is problematic because of special tokens # "<s> Something something", ] for string in strings: encoded = tokenizer(string)["input_ids"] encoded2 = new_tokenizer.encode(string).ids assert encoded == encoded2, f"{encoded} != {encoded2}" decoded = tokenizer.decode(encoded) decoded2 = new_tokenizer.decode(encoded2) assert decoded.strip() == decoded2, f"{repr(decoded)} != {repr(decoded2)}" ``` The converter + some test script. The test script. Tmp save. Adding Fast tokenizer + tests. Adding the tokenization tests. Correct combination. Small fix. Fixing tests. Fixing with latest update. Rebased. fix copies + normalized added tokens + copies. Adding doc. TMP. Doc + split files. Doc. Versions + try import. Fix Camembert + warnings -> Error. Fix by ArthurZucker. Not a decorator. * Fixing comments. * Adding more to docstring. * Doc rewriting.	2023-04-06 09:53:03 +02:00
Xuehai Pan	4169dc84bf	[setup] migrate setup script to `pyproject.toml` (#22539 ) * [setup] migrate setup script to `pyproject.toml` * [setup] cleanup configurations * remove unused imports	2023-04-03 14:03:41 -04:00
Xuehai Pan	80d1319e1b	[setup] drop deprecated `distutils` usage (#22531 ) * [setup] drop deprecated `distutils` usage * drop deprecated `distutils.util.strtobool` usage * fix import order * reformat docstring by `doc-builder`	2023-04-03 12:04:24 -04:00
Sylvain Gugger	2194943a34	Pin ruff (#22455 )	2023-03-29 14:07:06 -04:00
Sylvain Gugger	4c295a265b	Update release instructions (#22454 )	2023-03-29 14:05:42 -04:00
Joao Gante	88dae78f4d	TensorFlow: pin maximum version to 2.12 (#22364 )	2023-03-24 18:45:03 +00:00
Sylvain Gugger	6587125c0a	Pin tensorflow-text to go with tensorflow (#22362 ) * Pin tensorflow-text to go with tensorflow * Make it more convenient to pin TensorFlow * setup don't like f-strings	2023-03-24 10:54:06 -04:00
Stas Bekman	89a0a9eace	[deepspeed] offload + non-cpuadam optimizer exception doc (#22044 ) * [deepspeed] offload + non-cpuadam optimizer exception doc * deps	2023-03-21 17:00:05 -07:00
Ali Hassani	5990743fdd	Correct NATTEN function signatures and force new version (#22298 )	2023-03-21 17:21:34 -04:00
Yih-Dar	67c2dbdb54	Time to Say Goodbye, torch 1.7 and 1.8 (#22291 ) * time to say goodbye, torch 1.7 and 1.8 * clean up torch_int_div * clean up is_torch_less_than_1_8-9 * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-03-21 19:22:01 +01:00
Ali Hassani	3028b20a71	Fix natten (#22229 ) * Add kernel size to NATTEN's QK arguments. The new NATTEN 0.14.5 supports PyTorch 2.0, but also adds an additional argument to the QK operation to allow optional RPBs. This ends up failing NATTEN tests. This commit adds NATTEN back to circleci and adds the arguments to get it working again. * Force NATTEN >= 0.14.5	2023-03-17 11:07:55 -04:00

1 2 3 4 5 ...

533 Commits