transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 09:42:22 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	655f72a689	Fix test fetching for examples (#19237 ) * Fix test fetching for examples * Fake example modif * Debug statements * Typo * You need to persist the file... * Revert change in example * Remove debug statements	2022-09-29 09:36:42 -04:00
atturaioe	b79028f0b6	Fix TrainingArgs argument serialization (#19239 )	2022-09-29 09:13:56 -04:00
Lucain	902d30b31a	Use `hf_raise_for_status` instead of deprecated `_raise_for_status` (#19244 ) * Use instead of from huggingface_hub * bump huggingface_hub to 0.10.0 + make deps_table_update	2022-09-29 08:58:39 -04:00
Younes Belkada	3a27ba3d18	Fix opt softmax small nit (#19243 ) * fix opt softmax nit - Use the same logic as `1eb0953755` for consistency * Update src/transformers/models/opt/modeling_opt.py	2022-09-29 13:40:55 +02:00
mustapha ajeghrir	ba9e336fa3	Fix `m2m_100.mdx` doc example missing `labels` (#19149 ) The `labels` variable is not defined, the `model_inputs` already contain this information.	2022-09-29 13:27:58 +02:00
Aritra Roy Gosthipaty	0dc7b3a785	[TensorFlow] Adding GroupViT (#18020 ) * chore: initial commit * chore: adding util methods yet to work on the nn.functional.interpolate port with align_corener=True * chore: refactor the utils * used tf.compat.v1.image.resize to align the F.interpolate function * added type hints to the method signatures * added references to the gists where one 2 one alignment of torch and tf has been shown * chore: adding the layers * chore: porting all the layers from torch to tf This is the initial draft, nothing is tested yet. * chore: aligning the layers with reference to tf clip * chore: aligning the modules * added demaraction comments * added copied and adapted from comments * chore: aligning with CLIP * chore: wrangling the layers to keep it tf compatible * chore: aligning the names of the layers for porting * chore: style changes * chore: adding docs and inits * chore: adding tfp dependencis the code is taken from TAPAS * chore: initial commit for testing * chore: aligning the vision embeddings with the vit implementatino * chore: changing model prefix * chore: fixing the name of the model and the layer normalization test case * chore: every test passes but the slow ones * chore: fix style and integration test * chore: moving comments below decorators * chore: make fixup and fix-copies changes * chore: adding the Vision and Text Model to check_repo * chore: modifying the prefix name to align it with the torch implementation * chore: fix typo in configuration * choer: changing the name of the model variable * chore: adding segmentation flag * chore: gante's review * chore: style refactor * chore: amy review * chore: adding shape_list to parts that have been copied from other snippets * chore: init batchnorm with torch defaults * chore: adding shape_list to pass the tests * test fix: adding seed as 0 * set seed * chore: changing the straight through trick to fix -ve dimensinos * chore: adding a dimension to the loss * chore: adding reviewers and contributors names to the docs * chore: added changes after review * chore: code quality fixup * chore: fixing the segmentation snippet * chore: adding to the layer calls * chore: changing int32 to int64 for inputs of serving * chore: review changes * chore: style changes * chore: remove from_pt=True * fix: repo consistency Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-29 10:48:04 +01:00
Michael Benayoun	bb6fa06f2d	Add a getattr method, which replaces _module_getattr in torch.fx.Tracer from PyTorch 1.13+ (#19233 )	2022-09-29 11:04:49 +02:00
Gabriele Sarti	9d732fd2dd	XGLM - Fix Softmax NaNs when using FP16 (#18057 ) * fix fp16 for xglm * Removed misleading comment * Fix undefined variable Co-authored-by: Gabriele Sarti <gsarti@amazon.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>	2022-09-29 10:42:07 +02:00
Yih-Dar	99c32493e0	Fix confusing working directory in Push CI (#19234 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-29 08:36:46 +02:00
Steven Liu	6957350c2b	Focus doc around preprocessing classes (#18768 ) * 📝 reframe docs around preprocessing classes * small edits * edits and review * fix typo * apply review * clarify processor	2022-09-28 17:09:44 -07:00
Steven Liu	990936a868	Move AutoClasses under Main Classes (#19163 ) * move autoclasses to main classes * keep auto.mdx in model_doc	2022-09-28 17:09:29 -07:00
Sylvain Gugger	0fc68a7e14	Fix seq2seq QA example	2022-09-28 15:45:49 -04:00
Yih-Dar	64998a57fb	Fix cache names in CircleCI jobs (#19223 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-28 18:26:12 +02:00
Tatsuki Okada	4a0b958d61	Fix trainer seq2seq qa.py evaluate log and ft script (#19208 ) * fix args option * fix trainer eval log * fix out of memory qa script * do isort, black, flake * fix tokenize target * take it back. * fix: comment	2022-09-28 10:55:46 -04:00
Nick Doiron	9c6aeba353	Document and validate typical_p in generation (#19128 ) * Document and validate typical_p in generation	2022-09-28 15:45:05 +01:00
Yih-Dar	de359c4593	Fix doctest for `TFDeiTForImageClassification` (#19173 ) * Fix doctest for TFDeiTForImageClassification * Remove unnecessary tf.random.set_seed Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-28 15:53:21 +02:00
Gabriel Luiz Freitas Almeida	22d37a9d2c	Fix deprecation warning for return_all_scores (#19217 ) * Improve deprecation warning for return_all_scores * Fix formatting	2022-09-28 08:57:43 -04:00
Joao Gante	a357ed50e7	Generate: add warning when left padding should be used (#19067 ) * add warning when left padding should be used * PT: check for pad token; FLAX: can only check while not tracing	2022-09-28 13:07:08 +01:00
Ankur Goyal	942fa8ced8	Fix small use_cache typo in the docs (#19191 )	2022-09-28 13:03:20 +01:00
IMvision12	2df602870b	Added tests for yaml and json parser (#19219 ) * Added tests for yaml and json * Added tests for yaml and json	2022-09-27 16:25:57 -04:00
Yih-Dar	2d95695825	Use `math.pi` instead of `torch.pi` in `MaskFormer` (#19201 ) * Use math.pi Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-27 17:30:58 +02:00
Sylvain Gugger	34be08efcd	More tests for regression in cached non existence (#19216 ) * More tests for regression in cached non existence * Style	2022-09-27 09:36:34 -04:00
Nicola Procopio	e3a30e2b99	translated add_new_pipeline (#19215 )	2022-09-27 08:55:41 -04:00
wangxu	226b0e46d5	Add a use_parallel_residual argument to control the residual computing way (#18695 ) * Add a gpt_j_residual argument to control the residual computing way * Put duplicate code outside of the if block * Rename parameter "gpt_j_residual" to "use_parallel_residual" and set the default value to True	2022-09-27 07:54:05 -04:00
Wang, Yi	88f597ba6a	add doc for hyperparameter search (#19192 ) * add doc for hyperparameter search * update doc	2022-09-27 07:51:51 -04:00
Arijit Mukherjee	ea540a5977	add wav2vec2_alignment (#16782 ) * add wav2vec2_alignment * Update alignment.py * Update examples/research_projects/wav2vec2/alignment.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/research_projects/wav2vec2/alignment.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/research_projects/wav2vec2/alignment.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/research_projects/wav2vec2/alignment.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update README.md * fix style * fix imports * fix multithread * fix bash script * [@anton-l] Style fixes and docstrings * [@anton-l] Style fixes and docstrings * Update alignment.py fix blank id in backtrack Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: anton-l <aglozhkov@gmail.com>	2022-09-27 13:12:56 +02:00
Ekagra Ranjan	7132d55ca1	Remove unused `cur_len` in generation_utils.py (#18874 ) * remove unused cur_len in generation_utils.py * linting	2022-09-27 10:39:31 +02:00
Sylvain Gugger	a32f97c37d	Fix cached_file in offline mode for cached non-existing files (#19206 ) * Fix cached_file in offline mode for cached non-existing files * Add tests * Test with offline mode	2022-09-26 18:01:00 -04:00
Yih-Dar	ca0886395b	Add warning for torchaudio <= 0.10 in MCTCTFeatureExtractor (#19203 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-26 23:58:02 +02:00
IMvision12	be4f269979	Updated hf_argparser.py (#19188 ) * Changed json_file_parser function and added yaml parser function * update hf_argparser * Added allow_extra_keys argument	2022-09-26 17:02:57 -04:00
Sylvain Gugger	c20b2c7e18	Use repo_type instead of deprecated datasets repo IDs (#19202 ) * Use repo_type instead of deprecated datasets repo IDs * Add missing one in doc	2022-09-26 09:50:48 -04:00
Ankur Goyal	216b2f9e80	Move the model type check (#19027 ) Co-authored-by: Ankur Goyal <ankur@impira.com>	2022-09-26 09:43:34 -04:00
Yih-Dar	ea75e9f10e	Use `assertAlmostEqual` in `BloomEmbeddingTest.test_logits` (#19200 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-26 14:56:41 +02:00
dependabot[bot]	98af4f9b54	Bump protobuf in /examples/research_projects/decision_transformer (#19176 ) Bumps [protobuf](https://github.com/protocolbuffers/protobuf) from 3.19.4 to 3.19.5. - [Release notes](https://github.com/protocolbuffers/protobuf/releases) - [Changelog](https://github.com/protocolbuffers/protobuf/blob/main/generate_changelog.py) - [Commits](https://github.com/protocolbuffers/protobuf/compare/v3.19.4...v3.19.5) --- updated-dependencies: - dependency-name: protobuf dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-09-26 14:55:16 +02:00
Ahmad Elawady	408b5e307b	Remove pos arg from Perceiver's Pre/Postprocessors (#18602 ) * Remove pos arg from Perceiver's Pre/Postprocessors * Revert the removed pos args in public methods	2022-09-26 08:50:58 -04:00
Yih-Dar	71fc331746	Separate Push CI images from Scheduled CI (#19170 ) * separate images * Fix condition Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-26 10:55:42 +02:00
flozi00	fa4eeb4fd3	german training, accelerate and model sharing (#19171 ) * correct spelling in README * processing * german training * accelerate * german model sharing * build doc * ttf links * casing	2022-09-23 14:52:09 -04:00
Enze	5da6afdd8d	Update run_clip.py (#19130 ) The overwrite_cache parameter is declared twice.	2022-09-23 20:48:41 +02:00
Fei Wang	6395d1227f	Fixed type hint for pipelines/check_task (#19150 )	2022-09-23 20:35:19 +02:00
Tianqi Zhang (张天启)	ece762443e	Fix incorrect comments about atten mask for pytorch backend (#18728 ) * fix incorrect comments about atten mask * typo * Update for CodeGen Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-23 13:52:27 -04:00
Yih-Dar	0cea8d5555	Add offline runners info in the Slack report (#19169 ) * send slack report for offline runners Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-23 19:23:05 +02:00
Steven Anton	49bf569830	Add doctests to Perceiver examples (#19129 ) * Fix bug in example and add to tests * Fix failing tests * Check the size of logits * Code style * Try again... * Add expected loss for PerceiverForMaskedLM doctest Co-authored-by: Steven Anton <antonstv@amazon.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-23 19:19:35 +02:00
Alara Dirik	fe01ec343b	Detr preprocessor fix (#19007 ) * fix in-place preprocessing of inputs	2022-09-23 18:49:31 +03:00
Alara Dirik	7e84723fe4	Add semantic segmentation post-processing method to MobileViT (#19105 ) * add post-processing method for semantic segmentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-09-23 16:24:28 +03:00
Tim Baumgärtner	905635f5d3	[WIP] Trainer supporting evaluation on multiple datasets (#19158 ) * support for multiple eval datasets * support multiple datasets in seq2seq trainer * add documentation * update documentation * make fixup * revert option for multiple compute_metrics * revert option for multiple compute_metrics * revert added empty line	2022-09-23 09:14:53 -04:00
Wang, Yi	49629e7ba8	fix HPO DDP GPU problem (#19168 ) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>	2022-09-23 09:13:35 -04:00
Sylvain Gugger	8d59385f12	Fix TrainingArguments documentation (#19162 ) * Fix TrainingArguments documentation * Fix TFTrainingArguments documentation	2022-09-22 14:38:32 -04:00
Sayak Paul	3a396c59b8	fix: ckpt paths. (#19159 )	2022-09-22 11:03:01 -04:00
dependabot[bot]	74a3ea4737	Bump oauthlib in /examples/research_projects/decision_transformer (#19080 ) Bumps [oauthlib](https://github.com/oauthlib/oauthlib) from 3.2.0 to 3.2.1. - [Release notes](https://github.com/oauthlib/oauthlib/releases) - [Changelog](https://github.com/oauthlib/oauthlib/blob/master/CHANGELOG.rst) - [Commits](https://github.com/oauthlib/oauthlib/compare/v3.2.0...v3.2.1) --- updated-dependencies: - dependency-name: oauthlib dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-09-22 17:01:40 +02:00
Wang, Yi	e5b7cff5fe	update perf_train_cpu_many doc (#19151 ) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>	2022-09-22 09:20:15 -04:00

... 11 12 13 14 15 ...

11371 Commits