transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 17:52:35 +06:00

Author	SHA1	Message	Date
Alara Dirik	cd2457809f	Improve OWL-ViT postprocessing (#20980 ) * add post_process_object_detection method * style changes	2023-01-03 19:25:09 +03:00
NielsRogge	9c6f7485a6	Add GIT (GenerativeImage2Text) (#20295 ) * First draft * Make model instantiation work * Fix copied from statement * More fixes * Add correct output head * Improve configuration * Add conversion script * Improve conversion script * Remove token_type_ids * Fix conversion of projection layers * Convert all weights * Use cats image * Make logits match * Generate caption on cats image * Add GITProcessor * Update conversion script * Add support for more checkpoints * Fix conversion script * Add initial tests * Remove cross-attention * More improvements * Remove is_decoder * Improve model tests * Improve tests * Improve model outputs * Fix model outputs equivalence * Fix more tests * Remove unused code * Use generate to generate text, no use of cache for now * Use generate more appropriately * Fix config tests * Fix style * Add support for use_cache Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Fix style * Fix GIT vision encoder * Update README * Fix integration test * Set bos and eos token ids * Improve docs * Improve code * Add support for provided attention_mask * Add copied from statement * Fix gradient checkpointing test * Set model_input_names * Investigate model_input_names * Remove script * Fix model inputs * Fix docstring * Rename GIT to Git * Support more models * Add support for textvqa model * Add video support * Extend conversion script for video * Add support for large variant * Add support for more models * Fix config archive map * Update integration test * Fix README * Fix CLIP mean and std * Update processor * Fix use_cache for video, thanks @gante * Remove print statements * Remove assertion * Add processor tests * Fix model_input_names * Use Auto API for processor * Fix processor tests * Fix integration test * Fix pipeline test * Make tests faster * Update conversion script * Update conversion script * Convert more checkpoints * Update conversion script * Fix typo * Update docstrings * Improve code snippets * Fix doc tests * Add more code examplesé * Fix doc tests * Add integration tests * Fix unused variable * revert * Add GIT to Japanese README Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-01-03 14:17:18 +01:00
bofeng huang	47c9b22d08	Add generate kwargs to `AutomaticSpeechRecognitionPipeline` (#20952 ) * Add generate kwargs to AutomaticSpeechRecognitionPipeline * Add test for generation kwargs	2022-12-31 01:13:28 -05:00
bofeng huang	fe65657de1	Fix FP16 inference in TextGenerationPipeline (#20913 ) * add torch_dtype attribute to Pipeline * Use torch_dtype to cast input tensor type in AutomaticSpeechRecognitionPipeline * Fix code quality * Add TextGenerationPipeline fp16 test * Fix code quality * Remove useless require in tests Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>	2022-12-29 02:19:25 -05:00
Nicolas Patry	f7f0ec2f54	Adding support for `fp16` for asr pipeline. (#20864 ) * Supporting `fp16` for asr pipeline * Adding test. * Style. * Oops. * Flake8 update ? * Fixing flake8 ? * Revert "Flake8 update ?" This reverts commit `0b917fcb52`. * Style (acctidentally deleted flake8 F401.) * Move to a bigger test (no small whisper model, and s2t doesn't seem to accept torch_dtype=fp16). Also we need to use a GPU to actually compute on fp16. * Using BatchFeature capability.	2022-12-23 10:18:45 +01:00
Andreas Madsen	b4b613b102	Implement Roberta PreLayerNorm (#20305 ) * Copy RoBERTa * formatting * implement RoBERTa with prelayer normalization * update test expectations * add documentation * add convertion script for DinkyTrain weights * update checkpoint repo Unfortunately the original checkpoints assumes a hacked roberta model * add to RoBERTa-PreLayerNorm docs to toc * run utils/check_copies.py * lint files * remove unused import * fix check_repo reporting wrongly a test is missing * fix import error, caused by rebase * run make fix-copies * add RobertaPreLayerNormConfig to ROBERTA_EMBEDDING_ADJUSMENT_CONFIGS * Fix documentation <Facebook> -> Facebook Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fixup: Fix documentation <Facebook> -> Facebook Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Add missing Flax header Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * expected_slice -> EXPECTED_SLICE Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * update copies after rebase * add missing copied from statements * make fix-copies * make prelayernorm explicit in code * fix checkpoint path for the original implementation * add flax integration tests * improve docs * update utils/documentation_tests.txt * lint files * Remove Copyright notice Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * make fix-copies * Remove EXPECTED_SLICE calculation comments Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-12-19 09:30:17 +01:00
Nicolas Patry	3ee958207a	Fix object detection2 (#20798 ) * Revert "Fixing object detection with `layoutlm` (#20776)" This reverts commit `fca66abe2a`. * Better fix for layoutlm object detection. * Style.	2022-12-16 13:25:36 +01:00
Younes Belkada	4341f4e224	[Pipeline] skip feature extraction test if in `IMAGE_PROCESSOR_MAPPING` (#20790 ) skip feature extraction test if in `IMAGE_PROCESSOR_MAPPING`	2022-12-16 12:46:58 +01:00
Nicolas Patry	fca66abe2a	Fixing object detection with `layoutlm` (#20776 ) * Fixing object detection with layoutlm. * Fixup.	2022-12-15 18:46:43 +01:00
Younes Belkada	8891193e83	[Pipeline] fix failing bloom `pipeline` test (#20778 ) fix failing `pipeline` test	2022-12-15 18:46:00 +01:00
Nicolas Patry	a9912d2fca	Even more validation. (#20762 ) * Even more validation. * Fixing order.	2022-12-15 10:05:54 +01:00
Yih-Dar	a12c5cbcd8	Change a logic in pipeline test regarding TF (#20710 ) * Fix the pipeline test regarding TF * Fix the pipeline test regarding TF * update comment Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-13 13:42:36 +01:00
Nicolas Patry	53357e8196	Adding ValueError when imcompatible parameters are used. (#20729 )	2022-12-12 15:39:13 +01:00
Nathan Raw	9e56aff58a	Add video classification pipeline (#20151 ) * 🚧 wip video classification pipeline * 🚧 wip - add is_decord_available check * 🐛 add missing import * ✅ add tests * 🔧 add decord to setup extras * 🚧 add is_decord_available * ✨ add video-classification pipeline * 📝 add video classification pipe to docs * 🐛 add missing VideoClassificationPipeline import * 📌 add decord install in test runner * ✅ fix url inputs to video-classification pipeline * ✨ updates from review * 📝 add video cls pipeline to docs * 📝 add docstring * 🔥 remove unused import * 🔥 remove some code * 📝 docfix	2022-12-08 16:22:43 -05:00
Yih-Dar	cec5f7abd1	Update summarization `run_pipeline_test` (#20623 ) * update summarization run_pipeline_test * update Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-07 15:46:12 +01:00
Yih-Dar	9b14c1b6bf	Fix `AutomaticSpeechRecognitionPipelineTests.run_pipeline_test` (#20597 ) * Remove assert exception not triggered * Fix wrong expected exception string * fix * use assertRaisesRegex Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-06 15:48:49 +01:00
Arthur	538e5248b0	Ci-whisper-asr (#20588 ) * Expected output for the test changed * fix failing asr test	2022-12-05 16:50:38 +01:00
Yih-Dar	cc8aec6740	Add `require_torch` to 2 pipeline tests (#20585 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-12-05 16:06:39 +01:00
NielsRogge	4973d2a04c	Add Audio Spectogram Transformer (#19981 ) * First draft * Make conversion script work * Add id2label mapping, run code quality * Fix copies * Add first draft of feature extractor * Update conversion script to use feature extractor * Make more tests pass * Add docs * update input_features to input_values + pad by default to max length * Fix doc tests * Add feature extractor tests * Add proper padding/truncation to feature extractor * Add support for conversion of all audioset checkpoints * Improve docs and extend conversion script * Fix README * Rename spectogram to spectrogram * Fix copies * Add integration test * Remove dummy conv * Update to ast * Update organization * Fix init * Rename model to AST * Add require_torchaudio annotator * Move import of ASTFeatureExtractor under a is_speech_available * Fix rebase * Add pipeline config * Update name of classifier head * Rename time_dimension and frequency_dimension for clarity * Remove print statement * Fix pipeline test * Fix pipeline test * Fix index table * Fix init * Fix conversion script * Rename to ForAudioClassification * Fix index table Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-11-21 18:58:54 +01:00
Nicolas Patry	8e777b3ba4	[Proposal] Breaking change `zero-shot-object-detection` for improved consistency. (#20280 ) * [Proposal] Breaking change `zero-shot-object-detection` for improved consistency. This is a proposal to modify the output of `zero-shot-object-detection` to provide better alignment with other pipelines. The output is now strictly the same as `object-detection` whereas before it would output lists of lists. The name `candidate_labels` is used throughout for consistency with other `zero-shot` pipelines. The pipeline is changed to `ChunkPipeline` to support batching cleanly. This removes all the lists and list of lists shenanigans, it's now a matter of the base pipeline handling all this not this specific one. Breaking change: It did remove complex calls potentials `pipe(images = [image1, image2], text_queries=[candidates1, candidates2])` to support only `pipe([{"image": image1, "candidate_labels": candidates1}, {"image": image2, "candidate_labels": candidates2}])` when dealing with lists and/or datasets. We could keep them, but it will add a lot of complexity to the code base, since the pipeline is rather young, I'd rather break to keep the code simpler, but we can revert this. Breaking change: The name of the argument is now `image` instead of `images` since it expects by default only 1 image. This is revertable like the previous one. Breaking change: The types is now simplified and flattened: `pipe(inputs) == [{object1}, {object2}]` instead of the previous `pipe(inputs) == [[{object1}, {object1}], [{object2}]]` Where the different instances would be grouped by candidate labels within lists. IMHO this is not really desirable, since it would output empty lists and is only adding superflous indirection compared to `zero-shot-object-detection`. It is relatively change free in terms of how the results, it does change computation however since now the batching is handled by the pipeline itself. It did** change the results for the small models so there seems to be a real difference in how the models handle this. * Fixing the doctests. * Behind is_torch_available.	2022-11-18 15:57:28 +01:00
Younes Belkada	163ac3d3ee	Add Switch transformers (#19323 ) * first commit * add more comments * add router v1 * clean up - remove `tf` modeling files * clean up - remove `tf` modeling files * clean up * v0 routers * added more router - Implemented `ExpertsChooseMaskedRouter` - added tests - 2 more routers to implement * last router * improved docstring - completed the docstring in `router.py` - added more args in the config * v0 sparse mlp * replace wrong naming * forward pass run * update MOE layer * small router update * fixup * consistency * remove scatter router * remove abstract layer * update test and model for integration testing * v1 conversion * update * hardcode hack * all keys match * add gin conversion, without additional libraries * update conversion sctipy * delete router file * update tests wrt router deletion * fix router issues * update expert code * update, logits match, code needsREFACTORING * Refactor code Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * add generate tests Co-authored-by: younesbelkada <younesbelkada@gmail.com> * add support for router loss Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * fix forward error * refactor a bit * remove `FlaxSwitchTransformers` modules * more tests pass * Update code Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> * fixup * fix tests * fix doc * fix doc + tokenization * fix tokenizer test * fix test * fix loss output * update code for backward pass * add loss support * update documentation * fix documentation, clean tokenizer * more doc fix, cleanup example_switch * fix failing test * fix test * fix test * fix loss issue * move layer * update doc and fix router capacity usage * fixup * add sparse mlp index for documentation on hub * fixup * test sparse mix architecture * Apply suggestions from code review * Update docs/source/en/model_doc/switch_transformers.mdx * fixup on update * fix tests * fix another test * attempt fix * Update src/transformers/models/switch_transformers/configuration_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/switch_transformers/convert_switch_transformers_original_flax_checkpoint_to_pytorch.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * try * all tests pass * fix jitter noise * Apply suggestions from code review * doc tests pass * Update src/transformers/models/switch_transformers/modeling_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/switch_transformers/modeling_switch_transformers.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * remove assert * change config order * fix readme japanese * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove parallelizable tests + add one liners * remove ONNX config * fix nits - add `T5Tokenizer` in auto mapping - remove `Switch Transformers` from ONNX supported models * remove `_get_router` * remove asserts * add check in test for `router_dtype` * add `SwitchTransformersConfig` in `run_pipeline_test` * Update tests/pipelines/test_pipelines_summarization.py * add huge model conversion script * fix slow tests - add better casting for `Linear8bitLt` - remove `torchscript` tests * add make dir * style on new script * fix nits - doctest - remove `_keys_to_ignore_on_load_unexpected` * Update src/transformers/models/switch_transformers/configuration_switch_transformers.py * add google as authors * fix year * remove last `assert` statements * standardize vertical spaces * fix failing import * fix another failing test * Remove strange àuthorized_keys` * removing todo and padding that is never used Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by: ybelkada <younes@huggingface.co> Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Arthur Zucker <arthur@huggingface.co>	2022-11-15 13:06:45 +01:00
Yih-Dar	f9909fbf85	Make `ImageSegmentationPipelineTests` less flaky (#20147 ) * Fix ImageSegmentationPipelineTests * Use 0.9 * no zip * links to show images * links to show images * rebase Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-15 09:14:55 +01:00
Nicolas Patry	25c451e5a0	Adding chunking for whisper (all seq2seq actually). Very crude matching algorithm. (#20104 ) * Very crude matching algorithm. * Fixing tests. * Removing comments * Adding warning + fix short matches. * Cleanup tests. * Quality. * Less noisy. * Fixup.	2022-11-14 22:32:50 +01:00
Bartosz Szmelczynski	78a471ff71	Fix tapas scatter (#20149 ) * First draft * Remove scatter dependency * Add require_torch * update vectorized sum test, add clone call * remove artifacts * fix style * fix style v2 * remove "scatter" mentions from the code base * fix isort error Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-14 01:04:26 -05:00
Sylvain Gugger	9740a03f61	Skip broken test	2022-11-10 14:59:32 -05:00
Nicolas Patry	d066c3731b	Adding support for LayoutLMvX variants for `object-detection`. (#20143 ) * Adding support for LayoutLMvX variants for `object-detection`. * Revert bogs `layoutlm` feature extractor which does not exist (it was a V2 model) . * Updated condition. * Handling the comments.	2022-11-10 11:33:38 +01:00
Nicolas Patry	ec6878f6ca	Now supporting pathlike in pipelines too. (#20030 )	2022-11-03 09:14:45 +01:00
Nicolas Patry	5fd5990dce	Factored out some code in the `image-segmentation` pipeline. (#19727 ) * Factored out some code in the image-segmentation pipeline Re-enable `small_model_pt`. Re-enable `small_model_pt`. Enabling the current test with the current values. Debugging the values on the CI. More logs ? Printing doesn't work ? Using the CI values instead. Seems to be a Pillow sensitivity. Added a test showcasing that models not supporting some tasks get a clear error. Factored out code. Further factor out. Fixup. Bad rebase. Put `panoptic` before `instance` as it should be a superset. * Fixing tests. * Adding subtasks tests + Fixes `instance` segmentation which was broken due to default and non kwargs arguments. * Fix bad replace.	2022-10-26 10:44:36 +02:00
Rak Alexey	d3f4cef74d	fix image2test args forwarding (#19648 ) * fix image2test args forwarding * fix issues * Proposing the update to the PR. * Fixup. Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>	2022-10-24 09:49:24 -04:00
Alara Dirik	cca51aa151	Fix image segmentation pipeline errors, resolve backward compatibility issues (#19768 ) * Fix panoptic segmentation and pipeline * Update ImageSegmentationPipeline tests and reenable test_small_model_pt * Resolve backward compatibility issues	2022-10-21 18:09:58 +03:00
Yih-Dar	3aaabaa214	Update `ImageToTextPipelineTests.test_small_model_tf` (#19785 ) * update expected values for the correct TF checkpoint * Run test * Clean up * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-21 14:35:20 +02:00
Nicolas Patry	a40386669f	`image-segmentation` pipeline: re-enable `small_model_pt` test. (#19716 ) * Re-enable `small_model_pt`. Re-enable `small_model_pt`. Enabling the current test with the current values. Debugging the values on the CI. More logs ? Printing doesn't work ? Using the CI values instead. Seems to be a Pillow sensitivity. * Update src/transformers/pipelines/image_segmentation.py Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com> Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>	2022-10-20 11:57:11 +02:00
Yih-Dar	bed2edb99f	Specify TF framework explicitly in more pipeline tests (#19748 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-19 16:24:03 +02:00
David Yang	a23819ed6a	Clean up deprecation warnings (#19654 ) * Clean up deprecation warnings Notes: Changed some strings in tests to raw strings, which will change the literal content of the strings as they are fed into whatever machine handles them. Test cases for past in the past/past_key_values switch changed/removed due to warning of impending removal * Add PILImageResampling abstraction for PIL.Image.Resampling	2022-10-18 13:34:47 -04:00
Yih-Dar	06a82a49ae	Specify TF framework in TF-related pipeline tests (#19719 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-18 17:40:28 +02:00
Nicolas Patry	63d13d768b	Improving `image-segmentation` pipeline tests. (#19710 ) This PR (https://github.com/huggingface/transformers/pull/19367) introduced a few breaking changes: - Removed an argument `mask_threshold`. - Broke the default behavior (instance vs panoptic in the function call) https://github.com/huggingface/transformers/pull/19367/files#diff-60f846b86fb6a21d4caf60f5b3d593a04accb8f248de3029cccae2ff898c5bc3R119-R120 - Broke the actual masks: https://github.com/huggingface/transformers/pull/1961 This PR is the start of a handful that will aim at bringing back the old behavior(s). - tests should not have to specify `task` by default, unless we want to modify the behavior and have a lower form of segmentation running) - `test_small_model_pt` should be working. This specific PR starts with adding more information to the masks hash because missing the actual mask was actual easy to miss (the hashes do change, but it was easy to miss that one code path wasn't properly updated). So we go from a simple `hash` to ``` {"hash": #smaller hash, "shape": (h, w), "white_pixels": n} ``` The `shape` should help make sure the interpolation of the mask works correctly, the `white_pixels` hopefully helps detect big regressions in their amount when the hash gets modified.	2022-10-18 16:33:53 +02:00
Nicolas Patry	ee2a80ecc0	add return_tensors parameter for feature_extraction 2 (#19707 ) * add return_tensors parameter for feature_extraction w/ test add return_tensor parameter for feature extraction Revert "Merge branch 'feature-extraction-return-tensor' of https://github.com/ajsanjoaquin/transformers into feature-extraction-return-tensor" This reverts commit d559da743b87914e111a84a98ba6dbb70d08ad88, reversing changes made to bbef89278650c04c090beb65637a8e9572dba222. call parameter directly Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Fixup. Update src/transformers/pipelines/feature_extraction.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fix the imports. * Fixing the test by not overflowing the model capacity. Co-authored-by: AJ San Joaquin <ajsanjoaquin@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-10-18 16:29:00 +02:00
Nicolas Patry	713eab45d3	🚨 🚨 🚨 [Breaking change] Deformable DETR intermediate representations (#19678 ) * [Breaking change] Deformable DETR intermediate representations - Fixes naturally the `object-detection` pipeline. - Moves from `[n_decoders, batch_size, ...]` to `[batch_size, n_decoders, ...]` instead. * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-10-18 09:00:39 -04:00
Arthur	d356b89f3c	fix test whisper with new max length (#19668 )	2022-10-18 08:56:37 +02:00
Sylvain Gugger	f2ecb9eec4	Revert "add return_tensor parameter for feature extraction (#19257 )" (#19680 ) This reverts commit `35bd089a24`.	2022-10-17 11:56:29 -04:00
Ayrton San Joaquin	35bd089a24	add return_tensor parameter for feature extraction (#19257 ) * add return_tensors parameter for feature_extraction w/ test add return_tensor parameter for feature extraction Revert "Merge branch 'feature-extraction-return-tensor' of https://github.com/ajsanjoaquin/transformers into feature-extraction-return-tensor" This reverts commit d559da743b87914e111a84a98ba6dbb70d08ad88, reversing changes made to bbef89278650c04c090beb65637a8e9572dba222. * call parameter directly Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> * Fixup. * Update src/transformers/pipelines/feature_extraction.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-10-17 11:17:26 -04:00
Ankur Goyal	cbc1abc4af	A few CI fixes for `DocumentQuestionAnsweringPipeline` (#19584 ) * Fixes * update expected values * style * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-17 15:35:27 +02:00
Matt	3b3024da70	TF port of ESM (#19587 ) * Partial TF port for ESM model * Add ESM-TF tests * Add the various imports for TF-ESM * TF weight conversion almost ready * Stop ignoring the decoder weights in PT * Add tests and lots of fixes * fix-copies * Fix imports, add model docs * Add get_vocab() to tokenizer * Fix vocab links for pretrained files * Allow multiple inputs with a sep * Use EOS as SEP token because ESM vocab lacks SEP * Correctly return special tokens mask from ESM tokenizer * make fixup * Stop testing unsupported embedding resizing * Handle TF bias correctly * Skip all models with slow tokenizers in the token classification test * Fixing the batch/unbatcher of pipelines to accomodate the `None` being passed around. * Fixing pipeline bug caused by slow tokenizer being different. * Update src/transformers/models/esm/modeling_tf_esm.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/models/esm/modeling_tf_esm.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/models/esm/modeling_tf_esm.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update set_input_embeddings and the copyright notices Co-authored-by: Your Name <you@example.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2022-10-17 14:16:16 +01:00
Sivaudha	8aad4363d8	Fix pipeline predict transform methods (#19657 ) * Remove key word argument X from pipeline predict and transform methods As __call__ of pipeline clasees require one positional argument, passing the input as a keyword argument inside predict, transform methods, causing __call__ to fail. Hence in this commit the keyword argument is modified into positional argument. * Implement basic tests for scikitcompat pipeline interface * Seperate tests instead of running with parameterized based on framework as both frameworks will not be active at the same time	2022-10-17 09:06:20 -04:00
Nicolas Patry	463226e2ee	Improve error messaging for ASR pipeline. (#19570 ) * Improve error messaging for ASR pipeline. - Raise error early (in `_sanitize`) so users don't waste time trying to run queries with invalid params. - Fix the error was after using `config.inputs_to_logits_ratio` so our check was masked by the failing property does not exist. - Added some manual check on s2t for the error message. No non ctc model seems to be used by the default runner (they are all skipped). * Removing pdb. * Stop the early error it doesn't really work :(.	2022-10-14 17:12:21 +02:00
Yih-Dar	62f28bc152	Fix `ImageToTextPipelineTests.test_small_model_tf` (#19565 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-10-14 16:29:54 +02:00
amyeroberts	83a2e694f1	Cast masks to np.unit8 before converting to PIL.Image.Image (#19616 ) * Cast masks to np.unit8 before converting to PIL.Image.Image * Update tests * Fixup	2022-10-14 09:30:45 -04:00
Ritik Nandwal	e94384e4d8	Add depth estimation pipeline (#18618 ) * Add initial files for depth estimation pipelines * Add test file for depth estimation pipeline * Update model mapping names * Add updates for depth estimation output * Add generic test * Hopefully fixing the tests. * Check if test passes * Add make fixup and make fix-copies changes after rebase with main * Rebase with main * Fixing up depth pipeline. * This is not used anymore. * Fixing the test. `Image` is a module `Image.Image` is the type. * Update docs/source/en/main_classes/pipelines.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-10-12 08:54:20 -04:00
Quancore	70a058bc65	Added tokenize keyword arguments to feature extraction pipeline (#19382 ) * Added tokenize keyword arguments to feature extraction pipeline * Reverted truncation parameter * Import numpy moved to top	2022-10-11 12:54:41 -04:00
Ankur Goyal	a3008c5a6d	Implement multiple span support for DocumentQuestionAnswering (#19204 ) * Implement multiple span support * Address comments * Add tests + fix bugs	2022-10-11 10:47:55 -04:00

1 2 3

110 Commits