transformers/docs/source/main_classes
Nicolas Patry be236361f1
Adding batch_size support for (almost) all pipelines (#13724)
* Tentative enabling of `batch_size` for pipelines.

* Add systematic test for pipeline batching.

* Enabling batch_size on almost all pipelines

- Not `zero-shot` (it's already passing stuff as batched so trickier)
- Not `QA` (preprocess uses squad features, we need to switch to real
tensors at this boundary.

* Adding `min_length_for_response` for conversational.

* Making CTC, speech mappings avaiable regardless of framework.

* Attempt at fixing automatic tests (ffmpeg not enabled for fast tests)

* Removing ffmpeg dependency in tests.

* Small fixes.

* Slight cleanup.

* Adding docs

and adressing comments.

* Quality.

* Update docs/source/main_classes/pipelines.rst

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/question_answering.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/zero_shot_classification.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Improving docs.

* Update docs/source/main_classes/pipelines.rst

Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>

* N -> oberved_batch_size

softmax trick.

* Follow `padding_side`.

* Supporting image pipeline batching (and padding).

* Rename `unbatch` -> `loader_batch`.

* unbatch_size forgot.

* Custom padding for offset mappings.

* Attempt to remove librosa.

* Adding require_audio.

* torchaudio.

* Back to using datasets librosa.

* Adding help to set a pad_token on the tokenizer.

* Update src/transformers/pipelines/base.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/base.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/base.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Quality.

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>
2021-10-29 11:34:18 +02:00
..
callback.rst Add example for registering callbacks with trainers (#10928) 2021-04-05 12:27:23 -04:00
configuration.rst Update model configs - Allow setters for common properties (#13026) 2021-09-06 16:30:13 +02:00
data_collator.rst TF/Numpy variants for all DataCollator classes (#13105) 2021-08-31 13:06:48 +01:00
deepspeed.rst Fix typo in deepspeed documentation (#13482) 2021-09-08 11:24:10 -07:00
feature_extractor.rst Add ImageFeatureExtractionMixin (#10905) 2021-03-26 11:23:56 -04:00
keras_callbacks.rst Keras callback to push to hub each epoch, or after N steps (#13773) 2021-09-29 12:47:35 +01:00
logging.rst Logging propagation (#10092) 2021-02-09 10:27:49 -05:00
model.rst Fix push_to_hub docstring and make it appear in doc (#12770) 2021-07-17 15:52:33 +02:00
optimizer_schedules.rst Seq2seq trainer (#9241) 2020-12-22 11:33:44 -05:00
output.rst Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222) 2021-10-13 00:10:34 +02:00
pipelines.rst Adding batch_size support for (almost) all pipelines (#13724) 2021-10-29 11:34:18 +02:00
processors.rst Examples reorg (#11350) 2021-04-21 11:11:20 -04:00
tokenizer.rst Fix push_to_hub docstring and make it appear in doc (#12770) 2021-07-17 15:52:33 +02:00
trainer.rst Push to hub when saving checkpoints (#13503) 2021-09-14 08:02:15 -04:00