transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 21:00:08 +06:00

Author	SHA1	Message	Date
amyeroberts	a717e0318c	Add TimmBackbone model (#22619 ) * Add test_backbone for convnext * Add TimmBackbone model * Add check for backbone type * Tidying up - config checks * Update convnextv2 * Tidy up * Fix indices & clearer comment * Exceptions for config checks * Correclty update config for tests * Safer imports * Safer safer imports * Fix where decorators go * Update import logic and backbone tests * More import fixes * Fixup * Only import all_models if torch available * Fix kwarg updates in from_pretrained & main rebase * Tidy up * Add tests for AutoBackbone * Tidy up * Fix import error * Fix up * Install nattan in doc_test_job * Revert back to setting self._out_xxx directly * Bug fix - out_indices mapping from out_features * Fix tests * Dont accept output_loading_info for Timm models * Set out_xxx and don't remap * Use smaller checkpoint for test * Don't remap timm indices - check out_indices based on stage names * Skip test as it's n/a * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Cleaner imports / spelling is hard --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-06-06 17:11:30 +01:00
Sylvain Gugger	0b3d092f63	Empty circleci config (#23913 ) * Try easy first * Add an empty job * Fix name * Fix method	2023-05-31 12:02:05 -04:00
Sylvain Gugger	6e4bc67099	Revamp test selection for the example tests (#23737 ) * Revamp test selection for the example tests * Rename old XLA test and fake modif in run_glue * Fixes * Fake Trainer modif * Remove fake modifs	2023-05-25 09:38:21 -04:00
Yih-Dar	ca3df9f0cf	Run doctest (in PRs) only when some doc example(s) are modified (#23387 ) * fix * fix * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-16 23:29:02 +02:00
Yih-Dar	ba71d9e94c	unpin tf prob (#23293 ) * unpin tf prob --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-11 21:28:08 +02:00
Yih-Dar	6a6225beab	Fix doctest files fetch issue (#23277 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-11 17:14:06 +02:00
Yih-Dar	9088fcae82	Bring back the PR `Refactor doctests + add CI` to `main` (#23271 ) * Revert "Revert "[Doctests] Refactor doctests + add CI" (#23245)" This reverts commit `69ee46243c`. * try not expose HfDocTestParser * move into testing_utils.py * remove pytest install --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-10 22:00:48 +02:00
Sylvain Gugger	69ee46243c	Revert "[Doctests] Refactor doctests + add CI" (#23245 ) Revert "[Doctests] Refactor doctests + add CI (#22987)" This reverts commit `627f44799a`.	2023-05-09 15:26:15 -04:00
Arthur	627f44799a	[Doctests] Refactor doctests + add CI (#22987 ) * intiial commit * new styling * update * just run doctest in CI * remove more test for fast dev * update * update refs * update path and fetch upstream * update documentatyion trests * typo * parse pwd * don't check for files that are in hidden folders * just give paths relative to transformers * update * update * update * major refactoring * make sure options is ok * lest test that mdx is tested * doctest glob * nits * update doctest nightly * some cleaning * run correct test on diff * debug * run on a single worker * skip_cuda_test tampkate * updates * add rA and continue on failure * test options * parse `py` codeblock? * we don't need to replace ignore results, don't remember whyu I put it * cleanup * more cleaning * fix arg * more cleaning * clean an todo * more pre-processing * doctest-module has none so extra `- ` is needed * remove logs * nits * doctest-modules .... * oups * let's use sugar * make dataset go quiet * add proper timeout * nites * spleling timeout * update * properly skip tests that have CUDSA * proper skipping * cleaning main and get tests to run * remove make report? * remove tee * some updates * tee was removed but is the full output still available? * [all-test] * only our tests * don't touch tee in this PR * no atee-sys * proper sub * monkey * only replace call * fix sub * nits * nits * fix invalid syntax * add skip cuda doctest env variable * make sure all packages are installed * move file * update check repo * revert changes * nit * finish cleanup * fix re * findall * update don't test init files * ignore pycache * `-ignore-pycache` when running pytests * try to fix the import missmatch error * install dec * pytest is required as doctest_utils imports things from it * the only log issues were dataset, ignore results should work * more cleaning * Update .circleci/create_circleci_config.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * [ydshieh] empty string if cuda is found * [ydshieh] fix condition * style * [ydshieh] fix * Add comment * style * style * show failure * trigger CI --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-09 20:34:48 +02:00
Sylvain Gugger	006da469dd	Pin tensorflow-probability (#23220 ) * Pin tensorflow-probability * [all-test] * [all-test] Fix syntax for bash	2023-05-08 18:36:22 -04:00
Yih-Dar	dfeb5aa6a9	extend the test files (#23043 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-28 22:25:34 +02:00
Sylvain Gugger	c612628045	Test fetch v2 (#22367 ) * Test fetcher v2 * Fix regexes * Remove sanity check * Fake modification to OPT * Fixes some .sep issues * Remove fake OPT change * Fake modif for BERT * Fake modif for init * Exclude SageMaker tests * Fix test and remove fake modif * Fake setup modif * Fake pipeline modif * Remove all fake modifs * Adds options to skip/force tests * [test-all-models] Fake modif for BERT * Try this way * Does the command actually work? * [test-all-models] Try again! * [skip circleci] Remove fake modif * Remove debug statements * Add the list of important models * Quality * Update utils/tests_fetcher.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comments * Address review comments * Fix and add test * Apply suggestions from code review Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * Address review comments --------- Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2023-03-31 16:18:43 -04:00
Joao Gante	c746eb1603	TensorFlow: additional missing `cmake` dependencies in CI (#22383 ) * missing cmake * more cmake	2023-03-27 09:20:56 -04:00
Sylvain Gugger	cfab34e188	Fix TF pipeline job	2023-03-24 16:16:43 -04:00
Joao Gante	88dae78f4d	TensorFlow: pin maximum version to 2.12 (#22364 )	2023-03-24 18:45:03 +00:00
Ali Hassani	3028b20a71	Fix natten (#22229 ) * Add kernel size to NATTEN's QK arguments. The new NATTEN 0.14.5 supports PyTorch 2.0, but also adds an additional argument to the QK operation to allow optional RPBs. This ends up failing NATTEN tests. This commit adds NATTEN back to circleci and adds the arguments to get it working again. * Force NATTEN >= 0.14.5	2023-03-17 11:07:55 -04:00
Yih-Dar	97a3d16a69	Hotfix for natten issue with torch 2.0.0 on CircleCI (#22218 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-03-16 23:57:26 +01:00
Yih-Dar	5110e5748e	🔥py38 + torch 2 🔥🔥🔥🚀 (#22204 ) * py38 + torch 2 * increment cache versions --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-03-16 22:59:23 +01:00
Sylvain Gugger	f7329751fe	Run all tests by default (#22162 )	2023-03-14 17:30:43 -04:00
Yih-Dar	b05e0bec88	Use large VM for `repo_utils_job` (#21928 ) upgrade to large VM Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-03-03 14:43:03 +01:00
Sylvain Gugger	50a8ed3ee0	Mark pipeline tests to skip them easily (#21887 ) * Mark pipeline tests to skip them easily * Mark the mixin as pipeline test * Update src/transformers/testing_utils.py Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2023-03-02 10:55:36 -05:00
Yih-Dar	53735d7c3b	Add an utility file to get information from test files (#21856 ) * Add an utility file to get information from test files --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-03-01 17:53:29 +01:00
Yih-Dar	479322bfaa	A new test to check config attributes being used (#21453 ) * Add a new test to check config attributes being used * Add a new test to check config attributes being used * Add a new test to check config attributes being used * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply suggestions * Update allowed cases - part 1 * Update allowed cases - part 2 * final --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-02-07 17:49:30 +01:00
Sylvain Gugger	6f79d26442	Update quality tooling for formatting (#21480 ) * Result of black 23.1 * Update target to Python 3.7 * Switch flake8 to ruff * Configure isort * Configure isort * Apply isort with line limit * Put the right black version * adapt black in check copies * Fix copies	2023-02-06 18:10:56 -05:00
NielsRogge	5451f8896c	Add DETA (#20983 ) * First draft * Add initial draft of conversion script * Convert all weights * Fix config * Add image processor * Fix DetaImageProcessor * Run make fix copies * Remove timm dependency * Fix dummy objects * Improve loss function * Remove conv_encoder attribute * Update conversion scripts * Improve postprocessing + docs * Fix copied from statements * Add tests * Improve postprocessing * Improve postprocessing * Update READMEs * More improvements * Fix rebase * Add is_torchvision_available * Add torchvision dependency * Fix typo and README * Fix bug * Add copied from * Fix style * Apply suggestions * Fix thanks to @ydshieh * Fix another dependency check * Simplify image processor * Add scipy * Improve code * Add threshold argument * Fix bug * Set default threshold * Improve integration test * Add another integration test * Update setup.py * Address review * Improve deformable attention function * Improve copied from * Use relative imports * Address review * Replace assertions * Address review * Update dummies * Remove dummies * Address comments, update READMEs * Remove custom kernel code * Add image processor tests * Add requires_backends * Add minor comment * Update scripts * Update organization name * Fix defaults, add doc tests * Add id2label for object 365 * Fix tests * Update task guide	2023-01-31 10:43:10 +01:00
Yih-Dar	c749bd405e	Pipeline testing - using tiny models on Hub (#20426 ) * rework pipeline tests * run pipeline tests * fix * fix * fix * revert the changes in get_test_pipeline() parameter list * fix expected error message * skip a test * clean up --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-30 10:39:43 +01:00
Maria Khalusova	73a2ff6974	Automated compatible models list for task guides (#21338 ) * initial commit. added tip placeholders and a script * removed unused imports, fixed paths * fixed generated links * make style * split language modeling doc into two: causal language modeling and masked language modeling * added check_task_guides.py to make fix-copies * review feedback addressed	2023-01-27 13:19:28 -05:00
Yih-Dar	857bad6e53	check paths in `utils/documentation_tests.txt` (#21315 ) * check paths in utils/documentation_tests.txt * check paths in utils/documentation_tests.txt Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-26 15:33:47 +01:00
Yih-Dar	b0969cafd0	Make `parallelism` for CircleCI jobs work - but keep it `1` for now (#21157 ) * split tests * test CI * add if else Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-20 16:41:33 +01:00
Yih-Dar	8f09dd89f6	Avoid CI runs under users' own CircleCI personal account (#20981 ) * Avoid null CI * Avoid null CI * rename * more clear error message * Update .circleci/config.yml Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * clean up Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-01-03 16:19:38 +01:00
Yih-Dar	fe9152f67c	Install vision for TF pipeline tests (#20771 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-15 11:16:37 +01:00
Yih-Dar	f41a11a16f	rename `layoutlm_job` to `exotic_models_job` (#20736 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-12 20:02:16 +01:00
Nathan Raw	9e56aff58a	Add video classification pipeline (#20151 ) * 🚧 wip video classification pipeline * 🚧 wip - add is_decord_available check * 🐛 add missing import * ✅ add tests * 🔧 add decord to setup extras * 🚧 add is_decord_available * ✨ add video-classification pipeline * 📝 add video classification pipe to docs * 🐛 add missing VideoClassificationPipeline import * 📌 add decord install in test runner * ✅ fix url inputs to video-classification pipeline * ✨ updates from review * 📝 add video cls pipeline to docs * 📝 add docstring * 🔥 remove unused import * 🔥 remove some code * 📝 docfix	2022-12-08 16:22:43 -05:00
Sylvain Gugger	72eaaf6d55	Fix nightly runs (#20352 ) * Fix nightly runs * Fix type * Address review comment	2022-11-22 10:38:38 -05:00
Ali Hassani	fc4a993e1b	Add Neighborhood Attention Transformer (NAT) and Dilated NAT (DiNAT) models (#20219 ) * Add DiNAT * Adds DiNAT + tests * Minor fixes * Added HF model * Add natten to dependencies. * Cleanup * Minor fixup * Reformat * Optional NATTEN import. * Reformat & add doc to _toctree * Reformat (finally) * Dummy objects for DiNAT * Add NAT + minor changes Adds NAT as its own independent model + docs, tests Adds NATTEN to ext deps to ensure ci picks it up. * Remove natten from `all` and `dev-torch` deps, add manual pip install to ci tests * Minor fixes. * Fix READMEs. * Requested changes to docs + minor fixes. * Requested changes. * Add NAT/DiNAT tests to layoutlm_job * Correction to Dinat doc. * Requested changes.	2022-11-18 13:08:26 -05:00
Sylvain Gugger	9643ecf8ca	Enable PyTorch 1.13 (#20168 ) * Try PT1.13 by removing torch scatter * Skip failing tests * Style * Remvoe testing extras for repo utils * Try with all decorators * Try to wipe the cache * Fix all tests? * Try this way * Fix comma * Update to main * Try with less deps * Quality	2022-11-15 11:33:09 -05:00
Yih-Dar	305e8718b4	Show installed libraries and their versions in CI jobs (#20026 ) * Show versions * check * store outputs * revert Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-02 20:52:39 +01:00
raghavanone	7829c890db	Change the import of kenlm from github to pypi (#19770 ) * Change the import of kenlm from github to pypi * Change the import of kenlm from github to pypi in circleci config * Fix code quality issues * Fix isort issue, add kenlm in extras for audio * Add kenlm to deps * Add kenlm to deps * Commit 'make fixup' changes * Remove version from kenlm deps * commit make fixup changes * Remove manual installation of kenlm * Remove manual installation of kenlm * Remove manual installation of kenlm	2022-10-26 17:06:46 +02:00
Yih-Dar	8db92dbe26	Fix nightly CircleCI (#19837 ) * Fix nightly CircleCI * update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-24 16:00:02 +02:00
ydshieh	6f8064da6b	install GitPython	2022-10-24 09:54:15 +02:00
Sylvain Gugger	b58d4f70f6	Fix nightly test setup (#19792 )	2022-10-21 10:26:30 -04:00
Sylvain Gugger	a929f81e92	Repo utils test (#19696 ) * Create repo utils test job * Last occurence * Add tests for tests_fetcher * Better filtering * Let's learn more * Should fix * Should fix * Remove debug * Style * WiP WiP WiP WiP WiP WiP WiP WiP WiP * Quality * address review comments * Fix link	2022-10-18 13:47:36 -04:00
Sylvain Gugger	69b81c0a5f	Use a dynamic configuration for circleCI tests (#19325 ) * Generate config on the file * Fake modif for all test launch * Upload more artifacts * Typo and quality * Try converting th yml to txt * Leave my long lines alone yaml * Debug prints * Debug prints v2 * Try without sorting * Was it really working before? * Typo * Use a parameter * Use a parameter? * Typo * Here is some JSON * Another try * Learning to read... * Check default is used * Does this work? * With continuation * WiP * Use a parameter for test list * Other fake modif * With the comma * Name the test step so it doesn't blow up * Just one example modification * Final steps * Add nightlies * Move config generator * Add trigger for nightlies * Better workflow * Rebase on recent changes * Fix config creation * Fake modif in an example * Now fake modif in one config file * Fix install step in custom tokenizers test * Fix generated config * Better fix hopefully * Finally test modif in setup * final cleanup	2022-10-11 16:31:24 -04:00
Sylvain Gugger	9ac586b3c8	Rework pipeline tests (#19366 ) * Rework pipeline tests * Try to fix Flax tests * Try to put it before * Use a new decorator instead * Remove ignore marker since it doesn't work * Filter pipeline tests * Woopsie * Use the fitlered list * Clean up and fake modif * Remove init * Revert fake modif	2022-10-07 18:01:58 -04:00
r-terada	2f53ab5745	Add sudachi and jumanpp tokenizers for bert_japanese (#19043 ) * add sudachipy and jumanpp tokenizers for bert_japanese * use ImportError instead of ModuleNotFoundError in SudachiTokenizer and JumanppTokenizer * put test cases of test_tokenization_bert_japanese in one line * add require_sudachi and require_jumanpp decorator for testing * add sudachi and pyknp(jumanpp) to dependencies * remove sudachi_dict_small and sudachi_dict_full from dependencies * empty commit for ci	2022-10-05 11:41:37 -04:00
Sylvain Gugger	655f72a689	Fix test fetching for examples (#19237 ) * Fix test fetching for examples * Fake example modif * Debug statements * Typo * You need to persist the file... * Revert change in example * Remove debug statements	2022-09-29 09:36:42 -04:00
Yih-Dar	64998a57fb	Fix cache names in CircleCI jobs (#19223 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-28 18:26:12 +02:00
Sylvain Gugger	820cb97a3f	Organize test jobs (#19058 ) * Tests conditional run * Syntax * Deps * Try early exit * Another way * Test with no tests to run * Test all * Typo * Try this way * With tests to run * Mostly finished * Typo * With a modification in one file only * No change, no tests * Final cleanup * Address review comments	2022-09-16 09:19:51 -04:00
Sylvain Gugger	f7ce4f1ff7	Fix custom tokenizers test (#19052 ) * Fix CI for custom tokenizers * Add nightly tests * Run CI, run! * Fix paths * Typos * Fix test	2022-09-15 11:31:09 -04:00
Sylvain Gugger	3774010161	Automate check for new pipelines and metadata update (#19029 ) * Automate check for new pipelines and metadata update * Add Datasets to quality extra	2022-09-14 14:06:49 -04:00
Craig Chan	fbf382c84d	Determine framework automatically before ONNX export (#18615 ) * Automatic detection for framework to use when exporting to ONNX * Log message change * Incorporating PR comments, adding unit test * Adding tf for pip install for run_tests_onnxruntime CI * Restoring past changes to circleci yaml and test_onnx_v2.py, tests moved to tests/onnx/test_features.py * Fixup * Adding test to fetcher * Updating circleci config to log more * Changing test class name * Comment typo fix in tests/onnx/test_features.py Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * Moving torch_str/tf_str to self.framework_pt/tf * Remove -rA flag in circleci config Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>	2022-08-25 16:31:34 +02:00
Yih-Dar	84beb8a49b	Unpin detectron2 (#18727 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-23 11:10:07 +02:00
Yih-Dar	2c947d2939	Ping `detectron2` for CircleCI tests (#18680 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-18 12:57:18 +02:00
Matt	6eb51450fa	TF Examples Rewrite (#18451 ) * Finished QA example * Dodge a merge conflict * Update text classification and LM examples * Update NER example * New Keras metrics WIP, fix NER example * Update NER example * Update MC, summarization and translation examples * Add XLA warnings when shapes are variable * Make sure batch_size is consistently scaled by num_replicas * Add PushToHubCallback to all models * Add docs links for KerasMetricCallback * Add docs links for prepare_tf_dataset and jit_compile * Correct inferred model names * Don't assume the dataset has 'lang' * Don't assume the dataset has 'lang' * Write metrics in text classification * Add 'framework' to TrainingArguments and TFTrainingArguments * Export metrics in all examples and add tests * Fix training args for Flax * Update command line args for translation test * make fixup * Fix accidentally running other tests in fp16 * Remove do_train/do_eval from run_clm.py * Remove do_train/do_eval from run_mlm.py * Add tensorflow tests to circleci * Fix circleci * Update examples/tensorflow/language-modeling/run_mlm.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update examples/tensorflow/test_tensorflow_examples.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update examples/tensorflow/translation/run_translation.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update examples/tensorflow/token-classification/run_ner.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Fix save path for tests * Fix some model card kwargs * Explain the magical -1000 * Actually enable tests this time * Skip text classification PR until we fix shape inference * make fixup Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2022-08-10 16:49:51 +01:00
Sylvain Gugger	70b0d4e193	Fix compatibility with 1.12 (#17925 ) * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * fix torch.onnx.symbolic_opset12 import * Reject bad version Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-08 09:53:08 -04:00
Yih-Dar	6649133124	Add PYTEST_TIMEOUT for CircleCI test jobs (#18251 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-07-26 17:57:59 +02:00
Yih-Dar	4b1ed7979f	update cache to v0.5 (#18203 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-07-20 08:14:10 +02:00
Yih-Dar	05ed569c79	Use next-gen CircleCI convenience images (#18197 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-07-19 15:43:05 +02:00
Sylvain Gugger	1b749a7f8d	Sort doc toc (#18034 ) * Add script to sort doc ToC * Style and fixes * Add check to quality job	2022-07-07 08:17:58 -04:00
Yih-Dar	216499bfcc	Fix CI tests hang forever (#17471 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-02 10:30:54 +02:00
Joao Gante	ca1f1c8685	CLI: tool to convert PT into TF weights and open hub PR (#17497 )	2022-06-01 18:52:07 +01:00
NielsRogge	31ee80d556	Add LayoutLMv3 (#17060 ) * Make forward pass work * More improvements * Remove unused imports * Remove timm dependency * Improve loss calculation of token classifier * Fix most tests * Add docs * Add model integration test * Make all tests pass * Add LayoutLMv3FeatureExtractor * Improve integration test + make fixup * Add example script * Fix style * Add LayoutLMv3Processor * Fix style * Add option to add visual labels * Make more tokenizer tests pass * Fix more tests * Make more tests pass * Fix bug and improve docs * Fix import of processors * Improve docstrings * Fix toctree and improve docs * Fix auto tokenizer * Move tests to model folder * Move tests to model folder * change default behavior add_prefix_space * add prefix space for fast * add_prefix_spcae set to True for Fast * no space before `unique_no_split` token * add test to hightligh special treatment of added tokens * fix `test_batch_encode_dynamic_overflowing` by building a long enough example * fix `test_full_tokenizer` with add_prefix_token * Fix tokenizer integration test * Make the code more readable * Add tests for LayoutLMv3Processor * Fix style * Add model to README and update init * Apply suggestions from code review * Replace asserts by value errors * Add suggestion by @ducviet00 * Add model to doc tests * Simplify script * Improve README * a step ahead to fix * Update pair_input_test * Make all tokenizer tests pass - phew * Make style * Add LayoutLMv3 to CI job * Fix auto mapping * Fix CI job name * Make all processor tests pass * Make tests of LayoutLMv2 and LayoutXLM consistent * Add copied from statements to fast tokenizer * Add copied from statements to slow tokenizer * Remove add_visual_labels attribute * Fix tests * Add link to notebooks * Improve docs of LayoutLMv3Processor * Fix reference to section Co-authored-by: SaulLu <lucilesaul.com@gmail.com> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-05-24 09:53:45 +02:00
Sylvain Gugger	ddb1a47ec8	Automatically sort auto mappings (#17250 ) * Automatically sort auto mappings * Better class extraction * Some auto class magic * Adapt test and underlying behavior * Remove re-used config * Quality	2022-05-16 13:24:20 -04:00
Sylvain Gugger	afe5d42d8d	Black preview (#17217 ) * Black preview * Fixup too! * Fix check copies * Use the same version as the CI * Bump black	2022-05-12 16:25:55 -04:00
Zachary Mueller	2fbb237967	Add the auto_find_batch_size capability from Accelerate into Trainer (#17068 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> - Adds auto_batch_size finder - Moves training loop to an inner training loop	2022-05-09 12:29:18 -04:00
Zachary Mueller	ef20390291	Update to build via git for accelerate (#17084 )	2022-05-04 09:42:36 -04:00
lewtun	4bb1d0ec84	Skip RoFormer ONNX test if rjieba not installed (#16981 ) * Skip RoFormer ONNX test if rjieba not installed * Update deps table * Skip RoFormer serialization test * Fix RoFormer vocab * Add rjieba to CircleCI	2022-05-04 10:04:10 +02:00
Yih-Dar	19420fd99e	Move test model folders (#17034 ) * move test model folders (TODO: fix imports and others) * fix (potentially partially) imports (in model test modules) * fix (potentially partially) imports (in tokenization test modules) * fix (potentially partially) imports (in feature extraction test modules) * fix import utils.test_modeling_tf_core * fix path ../fixtures/ * fix imports about generation.test_generation_flax_utils * fix more imports * fix fixture path * fix get_test_dir * update module_to_test_file * fix get_tests_dir from wrong transformers.utils * update config.yml (CircleCI) * fix style * remove missing imports * update new model script * update check_repo * update SPECIAL_MODULE_TO_TEST_MAP * fix style * add __init__ * update self-scheduled * fix add_new_model scripts * check one way to get location back * python setup.py build install * fix import in test auto * update self-scheduled.yml * update slack notification script * Add comments about artifact names * fix for yolos Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-03 14:42:02 +02:00
Sylvain Gugger	1073f00d4e	Clean up setup.py (#17045 ) * Clean up setup.py * Trigger CI * Upgrade Python used	2022-05-02 12:58:17 -04:00
Yih-Dar	ede5e04191	Add a check on config classes docstring checkpoints (#17012 ) * Add the check * add missing ckpts * add a list to ignore * call the added check script * better regex pattern Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-30 10:40:46 +02:00
Stas Bekman	5da33f8729	[modeling utils] revamp `from_pretrained(..., low_cpu_mem_usage=True)` + tests (#16657 ) * add low_cpu_mem_usage tests * wip: revamping * wip * install /usr/bin/time * wip * cleanup * cleanup * cleanup * cleanup * cleanup * fix assert * put the wrapper back * cleanup; switch to bert-base-cased * Trigger CI * Trigger CI	2022-04-14 18:10:05 -07:00
Zachary Mueller	89293a0f6b	Make nightly install dev accelerate (#16783 )	2022-04-14 09:41:02 -04:00
Zachary Mueller	d57da99237	Add tests for no_trainer and fix existing examples (#16656 ) * Fixed some bugs involving saving during epochs * Added tests mimicking the existing examples tests * Added in json exporting to all `no_trainer` examples for consistency	2022-04-08 10:03:56 -04:00
Sylvain Gugger	473709fc76	Use doc builder styler (#16412 ) * Config update * Use doc-builder styler * Cleanup * Adapt import * We need it there too!	2022-03-28 07:45:18 -04:00
Lysandre Debut	eca77f4719	Updates the default branch from master to main (#16326 ) * Updates the default branch from master to main * Links from `master` to `main` * Typo * Update examples/flax/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-23 03:46:59 -04:00
Lysandre Debut	0868fdef85	Fix torch-scatter version (#16072 )	2022-03-11 09:03:27 -05:00
lewtun	50dd314d93	Add ONNX export for ViT (#15658 ) * Add ONNX support for ViT * Refactor to use generic preprocessor * Add vision dep to tests * Extend ONNX slow tests to ViT * Add dummy image generator * Use model_type to determine modality * Add deprecation warnings for tokenizer argument * Add warning when overwriting the preprocessor * Add optional args to docstrings * Add minimum PyTorch version to OnnxConfig * Refactor OnnxConfig class variables from CONSTANT_NAME to snake_case * Add reasonable value for default atol Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-09 17:36:59 +01:00
SaulLu	e93763d420	fix CLIP fast tokenizer and change some properties of the slow version (#15067 ) Very big changes concerning the tokenizer fast of CLIP which did not correspond to the tokenizer slow of CLIP Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-02-18 10:21:30 +01:00
cody-moveworks	a54961c5f7	Make OpenAIGPTTokenizer work with SpaCy 2.x and 3.x (#15019 ) * Make OpenAIGPTTokenizer work with SpaCy 3.x SpaCy 3.x introduced an API change to creating the tokenizer that breaks OpenAIGPTTokenizer. The old API for creating the tokenizer in SpaCy 2.x no longer works under SpaCy 3.x, but the new API for creating the tokenizer in SpaCy 3.x DOES work under SpaCy 2.x. Switching to the new API should allow OpenAIGPTTokenizer to work under both SpaCy 2.x and SpaCy 3.x versions. * Add is_spacy_available and is_ftfy_available methods to file utils * Add spacy and ftfy unittest decorator to testing utils * Add tests for OpenAIGPTTokenizer that require spacy and ftfy * Modify CircleCI config to run tests that require spacy and ftfy * Remove unneeded unittest decorators are reuse test code * Run make fixup	2022-01-10 07:53:20 -05:00
Sylvain Gugger	87e6e4fe5c	Doc styler v2 (#14950 ) * New doc styler * Fix issue with args at the start * Code sample fixes * Style code examples in MDX * Fix more patterns * Typo * Typo * More patterns * Do without black for now * Get more info in error * Docstring style * Re-enable check * Quality * Fix add_end_docstring decorator * Fix docstring	2021-12-27 16:31:21 -05:00
Sylvain Gugger	7af80f6618	Convert docstrings of modeling files (#14850 ) * Convert file_utils docstrings to Markdown * Test on BERT * Return block indent * Temporarily disable doc styler * Remove from quality checks as well * Remove doc styler mess * Remove check from circleCI * Fix typo * Convert file_utils docstrings to Markdown * Test on BERT * Return block indent * Temporarily disable doc styler * Remove from quality checks as well * Remove doc styler mess * Remove check from circleCI * Fix typo * Let's go on all other model files * Add templates too * Styling and quality	2021-12-21 05:37:32 -05:00
Patrick von Platen	c4a96cecbc	Wav2Vec2 meets phonemes (#14353 ) * up * add tokenizer * improve more * finish tokenizer * finish * adapt speech recognition script * adapt convert * more fixes * more fixes * update phonemizer wav2vec2 * better naming * fix more tests * more fixes swedish * correct tests * finish * improve script * remove file * up * lets get those 100 model architectures until the end of the month * make fix-copies * correct more * correct script * more fixes * more fixes * add to docs * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * replace assert * fix copies * fix docs * new try docs * boom boom * update * add phonemizer to audio tests * make fix-copies * up * upload models * some changes * Update tests/test_tokenization_wav2vec2_phoneme.py Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * more fixes * remove @ Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>	2021-12-17 19:56:44 +01:00
Sylvain Gugger	e9800122a6	Add kenlm dep to missing tests	2021-12-08 19:59:44 -05:00
Patrick von Platen	961732c276	[Wav2Vec2] PyCTCDecode Integration to support language model boosted decoding (#14339 ) * up * up * up * make it cleaner * correct * make styhahalal * add more tests * finish * small fix * make style * up * tryout to solve cicrle ci * up * fix more tests * fix more tests * apply sylvains suggestions * fix import * correct docs * add pyctcdecode only to speech tests * fix more tests * add tf, flax and pt tests * add pt * fix last tests * fix more tests * Apply suggestions from code review * change lines * Apply suggestions from code review Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * correct tests * correct tests * add doc string Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>	2021-12-08 12:07:54 +01:00
Suraj Patil	c824d7ed48	add flax example tests in CI workflow (#14637 )	2021-12-06 14:50:43 +05:30
Suraj Patil	c5bd732ac6	Add Flax example tests (#14599 ) * add test for glue * add tests for clm * fix clm test * add summrization tests * more tests * fix few tests * add test for t5 mlm * fix t5 mlm test * fix tests for multi device * cleanup * ci job * fix metric file name * make t5 more robust	2021-12-06 10:48:58 +05:30
Lysandre Debut	e4c67d60ec	Python 3.6 -> Python 3.7 for TF runs (#14598 )	2021-12-02 04:09:17 -05:00
Sylvain Gugger	4df7d05a87	Doc new front (#14590 ) * Convert PretrainedConfig doc to Markdown * Use syntax * Add necessary doc files (#14496) * Doc fixes (#14499) * Fixes for the new front * Convert DETR file for table * Title is needed * Simplify a bit * Even simpler * Remove imports * Fix typo in toctree (#14516) * Fix checkpoints badge * Update versions.yml format (#14517) * Doc new front github actions (#14512) * Doc new front github actions * Fix docstring * Fix feature extraction utils import (#14515) * Address Julien's comments * Push to doc-builder * Ready for merge * Remove old build and deploy * Doc misc fixes (#14583) * Rm versions.yml from doc * Fix converting.rst * Rm pretrained_models from toctree * Fix index links (#14567) * Fix links in README * Localized READMEs * Fix copy script * Fix find doc script * Update README_ko.md Co-authored-by: Julien Chaumond <julien@huggingface.co> Co-authored-by: Julien Chaumond <julien@huggingface.co> * Adapt build command to new CLI tools (#14578) * Fix typo * Fix doc interlinks (#14589) * Convert PretrainedConfig doc to Markdown * Use syntax * Rm pattern <[a-z]+(.html).> Rm huggingface.co/transformers/master * Rm .html * Rm .html from index.mdx * Rm .html from model_summary.rst * Update index.mdx rm html * Update remove .html * Fix inner doc links * Fix interlink in preprocssing.rst * Update pr_checks Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Convert PretrainedConfig doc to Markdown * Use syntax * Add necessary doc files (#14496) * Doc fixes (#14499) * Fixes for the new front * Convert DETR file for table * Title is needed * Simplify a bit * Even simpler * Remove imports * Fix checkpoints badge * Fix typo in toctree (#14516) * Update versions.yml format (#14517) * Doc new front github actions (#14512) * Doc new front github actions * Fix docstring * Fix feature extraction utils import (#14515) * Address Julien's comments * Push to doc-builder * Ready for merge * Remove old build and deploy * Doc misc fixes (#14583) * Rm versions.yml from doc * Fix converting.rst * Rm pretrained_models from toctree * Fix index links (#14567) * Fix links in README * Localized READMEs * Fix copy script * Fix find doc script * Update README_ko.md Co-authored-by: Julien Chaumond <julien@huggingface.co> Co-authored-by: Julien Chaumond <julien@huggingface.co> * Adapt build command to new CLI tools (#14578) * Fix typo * Fix doc interlinks (#14589) * Convert PretrainedConfig doc to Markdown * Use syntax * Rm pattern <[a-z]+(.html).> Rm huggingface.co/transformers/master * Rm .html * Rm .html from index.mdx * Rm .html from model_summary.rst * Update index.mdx rm html * Update remove .html * Fix inner doc links * Fix interlink in preprocssing.rst * Update pr_checks Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Styling Co-authored-by: Mishig Davaadorj <mishig.davaadorj@coloradocollege.edu> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Julien Chaumond <julien@huggingface.co>	2021-12-01 14:13:02 -05:00
Kamal Raj	c468a87a69	Tapas tf (#13393 ) * TF Tapas first commit * updated docs * updated logger message * updated pytorch weight conversion script to support scalar array * added use_cache to tapas model config to work properly with tf input_processing * 1. rm embeddings_sum 2. added # Copied 3. + TFTapasMLMHead 4. and lot other small fixes * updated docs * + test for tapas * updated testing_utils to check is_tensorflow_probability_available * converted model logits post processing using numpy to work with both PT and TF models * + TFAutoModelForTableQuestionAnswering * added TF support * added test for TFAutoModelForTableQuestionAnswering * added test for TFAutoModelForTableQuestionAnswering pipeline * updated auto model docs * fixed typo in import * added tensorflow_probability to run tests * updated MLM head * updated tapas.rst with TF model docs * fixed optimizer import in docs * updated convert to np data from pt model is not `transformers.tokenization_utils_base.BatchEncoding` after pipeline upgrade * updated pipeline: 1. with torch.no_gard removed, pipeline forward handles 2. token_type_ids converted to numpy * updated docs. * removed `use_cache` from config * removed floats_tensor * updated code comment * updated Copyright Year and logits_aggregation Optional * updated docs and comments * updated docstring * fixed model weight loading * make fixup * fix indentation * added tf slow pipeline test * pip upgrade * upgrade python to 3.7 * removed from_pt from tests * revert commit `f18cfa9`	2021-11-30 11:07:55 +01:00
NielsRogge	3772af49ce	[Tests] Improve vision tests (#14458 ) * Improve tests * Install vision for tf tests	2021-11-24 15:22:20 +01:00
Shang Zhang	a59e7c1ed4	Add QDQBert model and quantization examples of SQUAD task (#14066 ) * clean up branch for add-qdqbert-model * README update for QAT example; update docstrings in modeling_qdqbert.py * Update qdqbert.rst * Update README.md * Update README.md * calibration data using traning set; QAT example runs in fp32 * re-use BERTtokenizer for qdqbert * Update docs/source/model_doc/qdqbert.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/model_doc/qdqbert.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/model_doc/qdqbert.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove qdqbert tokenizer * Update qdqbert.rst * update evaluate-hf-trt-qa.py * update configuration_qdqbert.py * update modeling_qdqbert.py: add copied statement; replace assert with ValueError * update copied from statement * add is_quantization_available; run make fix-copies * unittest add require_quantization * add backend dependency to qdqbert model * update README; update evaluate script; make style * lint * docs qdqbert update * circleci build_doc add pytorch-quantization for qdqbert * update README * update example readme with instructions to upgrade TensorRT to 8.2 * Update src/transformers/models/qdqbert/configuration_qdqbert.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/models/qdqbert/configuration_qdqbert.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/models/qdqbert/configuration_qdqbert.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/models/qdqbert/configuration_qdqbert.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * change quantization to pytorch_quantization for backend requirement * feed_forward_chunking not supported in QDQBert * make style * update model docstrings and comments in testing scripts * rename example to quantization-qdqbert; rename example scripts from qat to quant * Update src/transformers/models/qdqbert/modeling_qdqbert.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * rm experimental functions in quant_trainer * qa cleanup * make fix-copies for docs index.rst * fix doctree; use post_init() for qdqbert * fix early device assignment for qdqbert * fix CI:Model templates runner Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-11-19 13:33:39 -05:00
Lysandre Debut	331c3d2aa0	Add GitPython to quality tools (#14459 ) * Update setup.py * Update setup.py * Update setup.py * Remove GitPython install	2021-11-19 08:43:48 -05:00
Lysandre	c6c075544d	Docs for version v4.12.5	2021-11-17 11:39:12 -05:00
Lysandre	888fb21159	Docs for v4.12.4	2021-11-16 17:40:58 -05:00
Sylvain Gugger	f0d6e952c0	Quality explain (#14264 ) * Start PR doc * Cleanup the quality checks and document them * Add reference in the contributing guide * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Rename file as per review suggestion Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-11-03 17:43:19 -04:00
Lysandre	9fc1951711	Docs for v4.12.2	2021-10-29 14:51:05 -04:00
Lysandre	513fa30a63	Docs for v4.12.1	2021-10-29 13:49:50 -04:00
Lysandre Debut	63d91f449c	Torch 1.10 (#14169 ) * Torch 1.10 * torch scatter for 1.10 * style * Skip tests ok	2021-10-29 13:43:43 -04:00
Sylvain Gugger	4ab6a4a086	Fix pipeline tests env and fetch (#14209 ) * Fix pipeline tests env and fetch * Fix quality	2021-10-29 09:35:05 -04:00
Lysandre	b8fad022a0	v4.13.0.dev0	2021-10-28 12:56:46 -04:00
Lysandre Debut	5b317f7ea4	Scatter dummies + skip pipeline tests (#13996 ) * Scatter dummies + skip pipeline tests * Add torch scatter to build docs	2021-10-14 15:30:27 -04:00
Lysandre	5be59a3649	Deploy docs for v4.11.3	2021-10-06 12:58:47 -04:00
Sylvain Gugger	5f25855b3e	Update doc for v4.11.2	2021-09-30 11:58:33 -04:00
Sylvain Gugger	cf4aa3597f	Update doc for v4.11.1	2021-09-29 12:09:40 -04:00
Lysandre	11c69b8045	Docs for version v4.11.0	2021-09-27 14:19:38 -04:00
Sylvain Gugger	a8ec002926	Update test dependence for torch examples (#13738 )	2021-09-25 18:47:39 +02:00
Sylvain Gugger	af5c6ae5ed	Properly use test_fetcher for examples (#13604 ) * Properly use test_fetcher for examples * Fake example modification * Fake modeling file modification * Clean fake modifications * Run example tests for any modification.	2021-09-16 15:13:00 -04:00
patrickvonplaten	72ec2f3eb5	Docs for v4.10.1	2021-09-10 16:45:19 +02:00
Sylvain Gugger	c1b20e42f5	Redeploy stable documentation	2021-09-01 09:21:50 -04:00
Li-Huai (Allan) Lin	85cb447766	Revert "Correct wrong function signatures on the docs website (#13198 )" This reverts commit `ffecfea949`.	2021-09-01 09:17:08 -04:00
Lysandre	e53af030c0	Re-deploy documentation	2021-08-31 16:18:14 +02:00
Lysandre	5ee67a4412	Docs for v4.10.0	2021-08-31 16:02:31 +02:00
Patrick von Platen	062300ba7f	[Testing] Add Flax Tests on GPU, Add Speech and Vision to Flax & TF tests (#13313 ) * up * finish * Apply suggestions from code review * apply Lysandres suggestions * adapt circle ci as well * finish * Update setup.py	2021-08-31 11:08:22 +02:00
Li-Huai (Allan) Lin	ffecfea949	Correct wrong function signatures on the docs website (#13198 ) * Correct outdated function signatures on website. * Upgrade sphinx to 3.5.4 (latest 3.x) * Test * Test * Test * Test * Test * Test * Revert unnecessary changes. * Change sphinx version to 3.5.4" * Test python 3.7.11	2021-08-30 11:40:25 -04:00
NielsRogge	b6ddb08a66	Add LayoutLMv2 + LayoutXLM (#12604 ) * First commit * Make style * Fix dummy objects * Add Detectron2 config * Add LayoutLMv2 pooler * More improvements, add documentation * More improvements * Add model tests * Add clarification regarding image input * Improve integration test * Fix bug * Fix another bug * Fix another bug * Fix another bug * More improvements * Make more tests pass * Make more tests pass * Improve integration test * Remove gradient checkpointing and add head masking * Add integration test * Add LayoutLMv2ForSequenceClassification to the tests * Add LayoutLMv2ForQuestionAnswering * More improvements * More improvements * Small improvements * Fix _LazyModule * Fix fast tokenizer * Move sync_batch_norm to a separate method * Replace dummies by requires_backends * Move calculation of visual bounding boxes to separate method + update README * Add models to main init * First draft * More improvements * More improvements * More improvements * More improvements * More improvements * Remove is_split_into_words * More improvements * Simply tesseract - no use of pandas anymore * Add LayoutLMv2Processor * Update is_pytesseract_available * Fix bugs * Improve feature extractor * Fix bug * Add print statement * Add truncation of bounding boxes * Add tests for LayoutLMv2FeatureExtractor and LayoutLMv2Tokenizer * Improve tokenizer tests * Make more tokenizer tests pass * Make more tests pass, add integration tests * Finish integration tests * More improvements * More improvements - update API of the tokenizer * More improvements * Remove support for VQA training * Remove some files * Improve feature extractor * Improve documentation and one more tokenizer test * Make quality and small docs improvements * Add batched tests for LayoutLMv2Processor, remove fast tokenizer * Add truncation of labels * Apply suggestions from code review * Improve processor tests * Fix failing tests and add suggestion from code review * Fix tokenizer test * Add detectron2 CI job * Simplify CI job * Comment out non-detectron2 jobs and specify number of processes * Add pip install torchvision * Add durations to see which tests are slow * Fix tokenizer test and make model tests smaller * Frist draft * Use setattr * Possible fix * Proposal with configuration * First draft of fast tokenizer * More improvements * Enable fast tokenizer tests * Make more tests pass * Make more tests pass * More improvements * Addd padding to fast tokenizer * Mkae more tests pass * Make more tests pass * Make all tests pass for fast tokenizer * Make fast tokenizer support overflowing boxes and labels * Add support for overflowing_labels to slow tokenizer * Add support for fast tokenizer to the processor * Update processor tests for both slow and fast tokenizers * Add head models to model mappings * Make style & quality * Remove Detectron2 config file * Add configurable option to label all subwords * Fix test * Skip visual segment embeddings in test * Use ResNet-18 backbone in tests instead of ResNet-101 * Proposal * Re-enable all jobs on CI * Fix installation of tesseract * Fix failing test * Fix index table * Add LayoutXLM doc page, first draft of code examples * Improve documentation a lot * Update expected boxes for Tesseract 4.0.0 beta * Use offsets to create labels instead of checking if they start with ## * Update expected boxes for Tesseract 4.1.1 * Fix conflict * Make variable names cleaner, add docstring, add link to notebooks * Revert "Fix conflict" This reverts commit a9b46ce9afe47ebfcfe7b45e6a121d49e74ef2c5. * Revert to make integration test pass * Apply suggestions from @LysandreJik's review * Address @patrickvonplaten's comments * Remove fixtures DocVQA in favor of dataset on the hub Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2021-08-30 12:35:42 +02:00
Sylvain Gugger	b0a917c48a	Fix CircleCI nightly tests (#13113 )	2021-08-13 08:57:30 +02:00
Sylvain Gugger	9e9b8f1d99	Roll out the test fetcher on push tests (#13055 ) * Use test fetcher for push tests as well * Force diff with last commit for circleCI on master * Fix syntax error * Style * Schedule nightly tests	2021-08-10 14:54:52 +02:00
Lysandre	a8bf2fa76e	Documentation for patch v4.9.2	2021-08-09 16:14:17 +02:00
Sylvain Gugger	a492aec82d	Update doc	2021-07-26 10:27:14 -04:00
Lysandre	40de2d5a4f	Docs for v4.10.0dev0	2021-07-22 12:52:25 +02:00
Stas Bekman	7fae535052	add troubleshooting docs (#12791 )	2021-07-20 03:32:02 -04:00
Sylvain Gugger	084873b025	Only test the files impacted by changes in the diff (#12644 ) * Base test * More test * Fix mistake * Add a docstring change * Add doc ignore * Add changes * Add recursive dep search * Add recursive dep search * save * Finalize test mapping * Fix bug * Print prettier * Ignore comments and empty lines * Make script runnable from anywhere * Need dev install * Like that * Adapt * Add as artifact * Try on torch tests * Fix yaml error * Install GitPython * Apply everywhere * Be more defensive * Revert to all tests if something is wrong * Install GitPython * Test if there are tests before launching. * Fixes * Fixes * Fixes * Fixes * Bash syntax is horrible * Be less stupid * Try differently * Typo * Typo * Typo * Style * Better name * Escape quotes * Ignore black unhelpful re-formatting * Not a docstring * Deal with inits in dependency map * Run all tests once PR is merged. * Add last job * Apply suggestions from code review Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> * Stronger dependencies gather * Ignore empty lines too! * Clean up * Fix quality Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>	2021-07-14 10:56:55 -04:00
Funtowicz Morgan	2aa3cd935d	[RFC] Laying down building stone for more flexible ONNX export capabilities (#11786 ) * Laying down building stone for more flexible ONNX export capabilities * Ability to provide a map of config key to override before exporting. * Makes it possible to export BART with/without past keys. * Supports simple mathematical syntax for OnnxVariable.repeated * Effectively apply value override from onnx config for model * Supports export with additional features such as with-past for seq2seq * Store the output path directly in the args for uniform usage across. * Make BART_ONNX_CONFIG_* constants and fix imports. * Support BERT model. * Use tokenizer for more flexibility in defining the inputs of a model. * Add TODO as remainder to provide the batch/sequence_length as CLI args * Enable optimizations to be done on the model. * Enable GPT2 + past * Improve model validation with outputs containing nested structures * Enable Roberta * Enable Albert * Albert requires opset >= 12 * BERT-like models requires opset >= 12 * Remove double printing. * Enable XLM-Roberta * Enable DistilBERT * Disable optimization by default * Fix missing setattr when applying optimizer_features * Add value field to OnnxVariable to define constant input (not from tokenizers) * Add T5 support. * Simplify model type retrieval * Example exporting token_classification pipeline for DistilBERT. * Refactoring to package `transformers.onnx` * Solve circular dependency & __main__ * Remove unnecessary imports in `__init__` * Licences * Use @Narsil's suggestion to forward the model's configuration to the ONNXConfig to avoid interpolation. * Onnx export v2 fixes (#12388) * Tiny fixes Remove `convert_pytorch` from onnxruntime-less runtimes Correct reference to model * Style * Fix Copied from * LongFormer ONNX config. * Removed optimizations * Remvoe bad merge relicas. * Remove unused constants. * Remove some deleted constants from imports. * Fix unittest to remove usage of PyTorch model for onnx.utils. * Fix distilbert export * Enable ONNX export test for supported model. * Style. * Fix lint. * Enable all supported default models. * GPT2 only has one output * Fix bad property name when overriding config. * Added unittests and docstrings. * Disable with_past tests for now. * Enable outputs validation for default export. * Remove graph opt lvls. * Last commit with on-going past commented. * Style. * Disabled `with_past` for now * Remove unused imports. * Remove framework argument * Remove TFPreTrainedModel reference * Add documentation * Add onnxruntime tests to CircleCI * Add test * Rename `convert_pytorch` to `export` * Use OrderedDict for dummy inputs * WIP Wav2Vec2 * Revert "WIP Wav2Vec2" This reverts commit f665efb04c92525c3530e589029f0ae7afdf603e. * Style * Use OrderedDict for I/O * Style. * Specify OrderedDict documentation. * Style :) Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-07-08 10:54:42 -04:00
Lysandre	2870fd198f	Bump CircleCI machine sizes	2021-07-06 17:46:39 +02:00
Lysandre	89073a95ba	Document patch release v4.8.2	2021-06-30 14:39:52 +02:00
Stas Bekman	d25ad34c82	[CI] add dependency table sync verification (#12364 ) * add dependency table sync verification * improve the message * improve the message * revert * ready to merge	2021-06-28 08:55:59 -07:00
Sylvain Gugger	5b1b5635d3	Document patch release v4.8.1	2021-06-24 10:15:15 -04:00
Sylvain Gugger	2150dfed31	v4.9.0.dev0	2021-06-23 13:31:19 -04:00
Lysandre	0daadc1919	Docs for v4.8.0	2021-06-17 18:17:42 +02:00
Lysandre Debut	3a960c4857	Support for torch 1.9.0 (#12224 ) * Support for torch 1.9.0 * Torch scatter for 1.9.0 * Github Actions run on 1.9.0	2021-06-17 11:29:01 -04:00
Lysandre Debut	52c7ca0488	Temporarily deactivate torch-scatter while we wait for new release (#12181 ) * Temporarily deactivate torch-scatter while we wait for new release * torch-1.8.1 binary for scatter * Revert to 1.8.0 * Pin torch dependency * torchaudio and torchvision	2021-06-15 16:03:58 -04:00
NielsRogge	d3eacbb829	Add DETR (#11653 ) * Squash all commits of modeling_detr_v7 branch into one * Improve docs * Fix tests * Style * Improve docs some more and fix most tests * Fix slow tests of ViT, DeiT and DETR * Improve replacement of batch norm * Restructure timm backbone forward * Make DetrForSegmentation support any timm backbone * Fix name of output * Address most comments by @LysandreJik * Give better names for variables * Conditional imports + timm in setup.py * Address additional comments by @sgugger * Make style, add require_timm and require_vision to testsé * Remove train_backbone attribute of DetrConfig, add methods to freeze/unfreeze backbone * Add png files to fixtures * Fix type hint * Add timm to workflows * Add `BatchNorm2d` to the weight initialization * Fix retain_grad test * Replace model checkpoints by Facebook namespace * Fix name of checkpoint in test * Add user-friendly message when scipy is not available * Address most comments by @patrickvonplaten * Remove return_intermediate_layers attribute of DetrConfig and simplify Joiner * Better initialization * Scipy is necessary to get sklearn metrics * Rename TimmBackbone to DetrTimmConvEncoder and rename DetrJoiner to DetrConvModel * Make style * Improve docs and add 2 community notebooks Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2021-06-09 11:51:13 -04:00
Sylvain Gugger	252082001d	Fix v4.6.0 doc	2021-05-13 10:45:28 -04:00
Sylvain Gugger	cbbf49f644	Fix doc deployment	2021-05-13 10:34:14 -04:00
Lysandre	d77eb0cf92	Docs for v4.7.0.dev0	2021-05-12 17:08:35 +02:00
Sylvain Gugger	2ce0fb84cc	Make quality scripts work when one backend is missing. (#11573 ) * Make quality scripts work when one backend is missing. * Check env variable is properly set * Add default * With print statements * Fix typo * Set env variable * Remove debug code	2021-05-04 09:53:44 -04:00
Sylvain Gugger	81a6c7cd39	Use 3 workers for torch tests	2021-04-23 18:47:46 -04:00
Sylvain Gugger	ca6b80cadb	Wrong branch Sylvain...	2021-04-23 12:46:54 -04:00
Sylvain Gugger	3951fc55ee	Try to trigger failure more	2021-04-23 12:44:54 -04:00
Sylvain Gugger	bf2e0cf70b	Trainer push to hub (#11328 ) * Initial support for upload to hub * push -> upload * Fixes + examples * Fix torchhub test * Torchhub test I hate you * push_model_to_hub -> push_to_hub * Apply mixin to other pretrained models * Remove ABC inheritance * Add tests * Typo * Run tests * Install git-lfs * Change approach * Add push_to_hub to all * Staging test suite * Typo * Maybe like this? * More deps * Cache * Adapt name * Quality * MOAR tests * Put it in testing_utils * Docs + torchhub last hope * Styling * Wrong method * Typos * Update src/transformers/file_utils.py Co-authored-by: Julien Chaumond <julien@huggingface.co> * Address review comments * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Julien Chaumond <julien@huggingface.co> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-04-23 09:17:37 -04:00
Sylvain Gugger	dabeb15292	Examples reorg (#11350 ) * Base move * Examples reorganization * Update references * Put back test data * Move conftest * More fixes * Move test data to test fixtures * Update path * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address review comments and clean Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-21 11:11:20 -04:00
Sylvain Gugger	893e51a53f	Document v4.5.1	2021-04-13 11:28:17 -04:00
Sylvain Gugger	26212c14e5	Reactivate Megatron tests an use less workers	2021-04-09 18:09:53 -04:00
Kevin Canwen Xu	fb41f9f50c	Add a special tokenizer for CPM model (#11068 ) * Add a special tokenizer for CPM model * make style * fix * Add docs * styles * cpm doc * fix ci * fix the overview * add test * make style * typo * Custom tokenizer flag * Add REAMDE.md Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2021-04-10 02:07:47 +08:00
Stas Bekman	97ccf67bb3	[setup] extras[docs] must include 'all' (#11148 ) * extras[doc] must include 'all' * fix * better * regroup	2021-04-08 18:10:44 -04:00
Lysandre	9853c5dd58	Development on v4.6.0dev0	2021-04-06 12:53:25 -04:00
Sylvain Gugger	b0d49fd536	Add a script to check inits are consistent (#11024 )	2021-04-04 20:41:34 -04:00
NielsRogge	30677dc743	Add Vision Transformer and ViTFeatureExtractor (#10950 ) * Squash all commits into one * Update ViTFeatureExtractor to use image_utils instead of torchvision * Remove torchvision and add Pillow * Small docs improvement * Address most comments by @sgugger * Fix tests * Clean up conversion script * Pooler first draft * Fix quality * Improve conversion script * Make style and quality * Make fix-copies * Minor docs improvements * Should use fix-copies instead of manual handling * Revert "Should use fix-copies instead of manual handling" This reverts commit `fd4e591bce`. * Place ViT in alphabetical order Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-01 11:16:05 -04:00
Sylvain Gugger	d0b3797a3b	Add more metadata to the user agent (#10972 ) * Add more metadata to the user agent * Fix typo * Use DISABLE_TELEMETRY * Address review comments * Use global env * Add clean envs on circle CI	2021-03-31 09:36:07 -04:00
Lysandre	3f48b2bc3e	Update stable docs	2021-03-23 11:01:16 -04:00

1 2 3 4 5 ...

432 Commits