transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-03 03:31:05 +06:00

Author	SHA1	Message	Date
Lysandre Debut	0eb8fbcdac	Remove task guides auto-update in favor of links towards task pages (#30429 )	2024-04-24 09:38:10 +02:00
Arthur	e68ff30419	[`quality`] update quality check to make sure we check imports 😈 (#29771 ) * update quality check * make it nice * update * let's make sure it runs and we have the logs actually * update workflow * nits	2024-03-22 10:11:59 +01:00
Yih-Dar	bb3bd44739	Fix the check of models supporting FA/SDPA not run (#28202 ) * add check_support_list.py * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-12-22 12:56:11 +01:00
Arthur	651408a077	[`Styling`] stylify using ruff (#27144 ) * try to stylify using ruff * might need to remove these changes? * use ruf format andruff check * use isinstance instead of type comparision * use # fmt: skip * use # fmt: skip * nits * soem styling changes * update ci job * nits isinstance * more files update * nits * more nits * small nits * check and format * revert wrong changes * actually use formatter instead of checker * nits * well docbuilder is overwriting this commit * revert notebook changes * try to nuke docbuilder * style * fix feature exrtaction test * remve `indent-width = 4` * fixup * more nits * update the ruff version that we use * style * nuke docbuilder styling * leve the print for detected changes * nits * Remove file I/O Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com> * style * nits * revert notebook changes * Add # fmt skip when possible * Add # fmt skip when possible * Fix * More ` # fmt: skip` usage * More ` # fmt: skip` usage * More ` # fmt: skip` usage * NIts * more fixes * fix tapas * Another way to skip * Recommended way * Fix two more fiels * Remove asynch Remove asynch --------- Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>	2023-11-16 17:43:19 +01:00
Sylvain Gugger	03af4c42a6	Docstring check (#26052 ) * Fix number of minimal calls to the Hub with peft integration * Alternate design * And this way? * Revert * Nits to fix * Add util * Print when changes are made * Add list to ignore * Add more rules * Manual fixes * deal with kwargs * deal with enum defaults * avoid many digits for floats * Manual fixes * Fix regex * Fix regex * Auto fix * Style * Apply script * Add ignored list * Add check that templates are filled * Adding to CI checks * Add back semi-fix * Ignore more objects * More auto-fixes * Ignore missing objects * Remove temp semi-fix * Fixes * Update src/transformers/models/pvt/configuration_pvt.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update utils/check_docstrings.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/utils/quantization_config.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Deal with float defaults * Fix small defaults * Address review comment * Treat * Post-rebase cleanup * Address review comment * Update src/transformers/models/deprecated/mctct/configuration_mctct.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comment --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2023-10-04 15:13:37 +02:00
Yih-Dar	fe3c8ab1af	Revert "Reuse the cache created for latest `main` on PRs/branches" (#25466 ) Revert "Reuse the cache created for latest `main` on PRs/branches if `setup.py` is not modified (#25445)" This reverts commit `1d75768695`.	2023-08-11 21:07:08 +02:00
Yih-Dar	1d75768695	Reuse the cache created for latest `main` on PRs/branches if `setup.py` is not modified (#25445 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-11 14:40:51 +02:00
Yih-Dar	f14c7f999d	Fix CircleCI cache (#24880 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-18 13:45:00 +02:00
Yih-Dar	050ef14516	Unpin `huggingface_hub` (#24667 ) * fix * fix * fix * [test all] commit * [test all] commit * [test all] commit --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-05 16:49:10 +02:00
Yih-Dar	2c977e4a90	Save `site-packages` as cache in CircleCI job (#24424 ) * fix * fix * Upgrade complete! --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-22 23:16:35 +02:00
Yih-Dar	8f2ef52fb6	Fix `save_cache` version in `config.yml` (#24419 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-22 16:18:16 +02:00
Yih-Dar	02d255db26	bring back `filtered_test_list_cross_tests.txt` (#24055 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-06 19:35:24 +02:00
Sylvain Gugger	6e4bc67099	Revamp test selection for the example tests (#23737 ) * Revamp test selection for the example tests * Rename old XLA test and fake modif in run_glue * Fixes * Fake Trainer modif * Remove fake modifs	2023-05-25 09:38:21 -04:00
Yih-Dar	ca3df9f0cf	Run doctest (in PRs) only when some doc example(s) are modified (#23387 ) * fix * fix * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-16 23:29:02 +02:00
Sylvain Gugger	69ee46243c	Revert "[Doctests] Refactor doctests + add CI" (#23245 ) Revert "[Doctests] Refactor doctests + add CI (#22987)" This reverts commit `627f44799a`.	2023-05-09 15:26:15 -04:00
Arthur	627f44799a	[Doctests] Refactor doctests + add CI (#22987 ) * intiial commit * new styling * update * just run doctest in CI * remove more test for fast dev * update * update refs * update path and fetch upstream * update documentatyion trests * typo * parse pwd * don't check for files that are in hidden folders * just give paths relative to transformers * update * update * update * major refactoring * make sure options is ok * lest test that mdx is tested * doctest glob * nits * update doctest nightly * some cleaning * run correct test on diff * debug * run on a single worker * skip_cuda_test tampkate * updates * add rA and continue on failure * test options * parse `py` codeblock? * we don't need to replace ignore results, don't remember whyu I put it * cleanup * more cleaning * fix arg * more cleaning * clean an todo * more pre-processing * doctest-module has none so extra `- ` is needed * remove logs * nits * doctest-modules .... * oups * let's use sugar * make dataset go quiet * add proper timeout * nites * spleling timeout * update * properly skip tests that have CUDSA * proper skipping * cleaning main and get tests to run * remove make report? * remove tee * some updates * tee was removed but is the full output still available? * [all-test] * only our tests * don't touch tee in this PR * no atee-sys * proper sub * monkey * only replace call * fix sub * nits * nits * fix invalid syntax * add skip cuda doctest env variable * make sure all packages are installed * move file * update check repo * revert changes * nit * finish cleanup * fix re * findall * update don't test init files * ignore pycache * `-ignore-pycache` when running pytests * try to fix the import missmatch error * install dec * pytest is required as doctest_utils imports things from it * the only log issues were dataset, ignore results should work * more cleaning * Update .circleci/create_circleci_config.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * [ydshieh] empty string if cuda is found * [ydshieh] fix condition * style * [ydshieh] fix * Add comment * style * style * show failure * trigger CI --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-09 20:34:48 +02:00
Yih-Dar	dfeb5aa6a9	extend the test files (#23043 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-28 22:25:34 +02:00
Sylvain Gugger	c612628045	Test fetch v2 (#22367 ) * Test fetcher v2 * Fix regexes * Remove sanity check * Fake modification to OPT * Fixes some .sep issues * Remove fake OPT change * Fake modif for BERT * Fake modif for init * Exclude SageMaker tests * Fix test and remove fake modif * Fake setup modif * Fake pipeline modif * Remove all fake modifs * Adds options to skip/force tests * [test-all-models] Fake modif for BERT * Try this way * Does the command actually work? * [test-all-models] Try again! * [skip circleci] Remove fake modif * Remove debug statements * Add the list of important models * Quality * Update utils/tests_fetcher.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comments * Address review comments * Fix and add test * Apply suggestions from code review Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * Address review comments --------- Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2023-03-31 16:18:43 -04:00
Yih-Dar	5110e5748e	🔥py38 + torch 2 🔥🔥🔥🚀 (#22204 ) * py38 + torch 2 * increment cache versions --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-03-16 22:59:23 +01:00
Yih-Dar	479322bfaa	A new test to check config attributes being used (#21453 ) * Add a new test to check config attributes being used * Add a new test to check config attributes being used * Add a new test to check config attributes being used * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions * Update allowed cases - part 1 * Update allowed cases - part 2 * final --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-07 17:49:30 +01:00
Sylvain Gugger	6f79d26442	Update quality tooling for formatting (#21480 ) * Result of black 23.1 * Update target to Python 3.7 * Switch flake8 to ruff * Configure isort * Configure isort * Apply isort with line limit * Put the right black version * adapt black in check copies * Fix copies	2023-02-06 18:10:56 -05:00
Maria Khalusova	73a2ff6974	Automated compatible models list for task guides (#21338 ) * initial commit. added tip placeholders and a script * removed unused imports, fixed paths * fixed generated links * make style * split language modeling doc into two: causal language modeling and masked language modeling * added check_task_guides.py to make fix-copies * review feedback addressed	2023-01-27 13:19:28 -05:00
Yih-Dar	857bad6e53	check paths in `utils/documentation_tests.txt` (#21315 ) * check paths in utils/documentation_tests.txt * check paths in utils/documentation_tests.txt Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-26 15:33:47 +01:00
Yih-Dar	8f09dd89f6	Avoid CI runs under users' own CircleCI personal account (#20981 ) * Avoid null CI * Avoid null CI * rename * more clear error message * Update .circleci/config.yml Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * clean up Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-03 16:19:38 +01:00
Yih-Dar	305e8718b4	Show installed libraries and their versions in CI jobs (#20026 ) * Show versions * check * store outputs * revert Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-02 20:52:39 +01:00
Yih-Dar	8db92dbe26	Fix nightly CircleCI (#19837 ) * Fix nightly CircleCI * update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-24 16:00:02 +02:00
ydshieh	6f8064da6b	install GitPython	2022-10-24 09:54:15 +02:00
Sylvain Gugger	b58d4f70f6	Fix nightly test setup (#19792 )	2022-10-21 10:26:30 -04:00
Sylvain Gugger	a929f81e92	Repo utils test (#19696 ) * Create repo utils test job * Last occurence * Add tests for tests_fetcher * Better filtering * Let's learn more * Should fix * Should fix * Remove debug * Style * WiP WiP WiP WiP WiP WiP WiP WiP WiP * Quality * address review comments * Fix link	2022-10-18 13:47:36 -04:00
Sylvain Gugger	69b81c0a5f	Use a dynamic configuration for circleCI tests (#19325 ) * Generate config on the file * Fake modif for all test launch * Upload more artifacts * Typo and quality * Try converting th yml to txt * Leave my long lines alone yaml * Debug prints * Debug prints v2 * Try without sorting * Was it really working before? * Typo * Use a parameter * Use a parameter? * Typo * Here is some JSON * Another try * Learning to read... * Check default is used * Does this work? * With continuation * WiP * Use a parameter for test list * Other fake modif * With the comma * Name the test step so it doesn't blow up * Just one example modification * Final steps * Add nightlies * Move config generator * Add trigger for nightlies * Better workflow * Rebase on recent changes * Fix config creation * Fake modif in an example * Now fake modif in one config file * Fix install step in custom tokenizers test * Fix generated config * Better fix hopefully * Finally test modif in setup * final cleanup	2022-10-11 16:31:24 -04:00
Sylvain Gugger	9ac586b3c8	Rework pipeline tests (#19366 ) * Rework pipeline tests * Try to fix Flax tests * Try to put it before * Use a new decorator instead * Remove ignore marker since it doesn't work * Filter pipeline tests * Woopsie * Use the fitlered list * Clean up and fake modif * Remove init * Revert fake modif	2022-10-07 18:01:58 -04:00
r-terada	2f53ab5745	Add sudachi and jumanpp tokenizers for bert_japanese (#19043 ) * add sudachipy and jumanpp tokenizers for bert_japanese * use ImportError instead of ModuleNotFoundError in SudachiTokenizer and JumanppTokenizer * put test cases of test_tokenization_bert_japanese in one line * add require_sudachi and require_jumanpp decorator for testing * add sudachi and pyknp(jumanpp) to dependencies * remove sudachi_dict_small and sudachi_dict_full from dependencies * empty commit for ci	2022-10-05 11:41:37 -04:00
Sylvain Gugger	655f72a689	Fix test fetching for examples (#19237 ) * Fix test fetching for examples * Fake example modif * Debug statements * Typo * You need to persist the file... * Revert change in example * Remove debug statements	2022-09-29 09:36:42 -04:00
Yih-Dar	64998a57fb	Fix cache names in CircleCI jobs (#19223 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-28 18:26:12 +02:00
Sylvain Gugger	820cb97a3f	Organize test jobs (#19058 ) * Tests conditional run * Syntax * Deps * Try early exit * Another way * Test with no tests to run * Test all * Typo * Try this way * With tests to run * Mostly finished * Typo * With a modification in one file only * No change, no tests * Final cleanup * Address review comments	2022-09-16 09:19:51 -04:00
Sylvain Gugger	f7ce4f1ff7	Fix custom tokenizers test (#19052 ) * Fix CI for custom tokenizers * Add nightly tests * Run CI, run! * Fix paths * Typos * Fix test	2022-09-15 11:31:09 -04:00
Sylvain Gugger	3774010161	Automate check for new pipelines and metadata update (#19029 ) * Automate check for new pipelines and metadata update * Add Datasets to quality extra	2022-09-14 14:06:49 -04:00
Craig Chan	fbf382c84d	Determine framework automatically before ONNX export (#18615 ) * Automatic detection for framework to use when exporting to ONNX * Log message change * Incorporating PR comments, adding unit test * Adding tf for pip install for run_tests_onnxruntime CI * Restoring past changes to circleci yaml and test_onnx_v2.py, tests moved to tests/onnx/test_features.py * Fixup * Adding test to fetcher * Updating circleci config to log more * Changing test class name * Comment typo fix in tests/onnx/test_features.py Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * Moving torch_str/tf_str to self.framework_pt/tf * Remove -rA flag in circleci config Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>	2022-08-25 16:31:34 +02:00
Yih-Dar	84beb8a49b	Unpin detectron2 (#18727 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-23 11:10:07 +02:00
Yih-Dar	2c947d2939	Ping `detectron2` for CircleCI tests (#18680 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-18 12:57:18 +02:00
Matt	6eb51450fa	TF Examples Rewrite (#18451 ) * Finished QA example * Dodge a merge conflict * Update text classification and LM examples * Update NER example * New Keras metrics WIP, fix NER example * Update NER example * Update MC, summarization and translation examples * Add XLA warnings when shapes are variable * Make sure batch_size is consistently scaled by num_replicas * Add PushToHubCallback to all models * Add docs links for KerasMetricCallback * Add docs links for prepare_tf_dataset and jit_compile * Correct inferred model names * Don't assume the dataset has 'lang' * Don't assume the dataset has 'lang' * Write metrics in text classification * Add 'framework' to TrainingArguments and TFTrainingArguments * Export metrics in all examples and add tests * Fix training args for Flax * Update command line args for translation test * make fixup * Fix accidentally running other tests in fp16 * Remove do_train/do_eval from run_clm.py * Remove do_train/do_eval from run_mlm.py * Add tensorflow tests to circleci * Fix circleci * Update examples/tensorflow/language-modeling/run_mlm.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update examples/tensorflow/test_tensorflow_examples.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update examples/tensorflow/translation/run_translation.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update examples/tensorflow/token-classification/run_ner.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Fix save path for tests * Fix some model card kwargs * Explain the magical -1000 * Actually enable tests this time * Skip text classification PR until we fix shape inference * make fixup Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2022-08-10 16:49:51 +01:00
Sylvain Gugger	70b0d4e193	Fix compatibility with 1.12 (#17925 ) * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * fix torch.onnx.symbolic_opset12 import * Reject bad version Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-08 09:53:08 -04:00
Yih-Dar	6649133124	Add PYTEST_TIMEOUT for CircleCI test jobs (#18251 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-07-26 17:57:59 +02:00
Yih-Dar	4b1ed7979f	update cache to v0.5 (#18203 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-07-20 08:14:10 +02:00
Yih-Dar	05ed569c79	Use next-gen CircleCI convenience images (#18197 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-07-19 15:43:05 +02:00
Sylvain Gugger	1b749a7f8d	Sort doc toc (#18034 ) * Add script to sort doc ToC * Style and fixes * Add check to quality job	2022-07-07 08:17:58 -04:00
Yih-Dar	216499bfcc	Fix CI tests hang forever (#17471 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-02 10:30:54 +02:00
Joao Gante	ca1f1c8685	CLI: tool to convert PT into TF weights and open hub PR (#17497 )	2022-06-01 18:52:07 +01:00
NielsRogge	31ee80d556	Add LayoutLMv3 (#17060 ) * Make forward pass work * More improvements * Remove unused imports * Remove timm dependency * Improve loss calculation of token classifier * Fix most tests * Add docs * Add model integration test * Make all tests pass * Add LayoutLMv3FeatureExtractor * Improve integration test + make fixup * Add example script * Fix style * Add LayoutLMv3Processor * Fix style * Add option to add visual labels * Make more tokenizer tests pass * Fix more tests * Make more tests pass * Fix bug and improve docs * Fix import of processors * Improve docstrings * Fix toctree and improve docs * Fix auto tokenizer * Move tests to model folder * Move tests to model folder * change default behavior add_prefix_space * add prefix space for fast * add_prefix_spcae set to True for Fast * no space before `unique_no_split` token * add test to hightligh special treatment of added tokens * fix `test_batch_encode_dynamic_overflowing` by building a long enough example * fix `test_full_tokenizer` with add_prefix_token * Fix tokenizer integration test * Make the code more readable * Add tests for LayoutLMv3Processor * Fix style * Add model to README and update init * Apply suggestions from code review * Replace asserts by value errors * Add suggestion by @ducviet00 * Add model to doc tests * Simplify script * Improve README * a step ahead to fix * Update pair_input_test * Make all tokenizer tests pass - phew * Make style * Add LayoutLMv3 to CI job * Fix auto mapping * Fix CI job name * Make all processor tests pass * Make tests of LayoutLMv2 and LayoutXLM consistent * Add copied from statements to fast tokenizer * Add copied from statements to slow tokenizer * Remove add_visual_labels attribute * Fix tests * Add link to notebooks * Improve docs of LayoutLMv3Processor * Fix reference to section Co-authored-by: SaulLu <lucilesaul.com@gmail.com> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-05-24 09:53:45 +02:00
Sylvain Gugger	ddb1a47ec8	Automatically sort auto mappings (#17250 ) * Automatically sort auto mappings * Better class extraction * Some auto class magic * Adapt test and underlying behavior * Remove re-used config * Quality	2022-05-16 13:24:20 -04:00

1 2 3 4 5

247 Commits