transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 19:21:31 +06:00

Author	SHA1	Message	Date
Javier de la Rosa	9eb7e9ba1d	Fix ASR pipelines from local directories with wav2vec models that have language models attached (#15590 ) * Fix loading pipelines with wav2vec models with lm when in local paths * Adding tests * Fix test * Adding tests * Flake8 fixes * Removing conflict files :( * Adding task type to test * Remove unnecessary test and imports	2022-02-15 13:45:08 +01:00
Patrick von Platen	041fdc4a7e	[SpeechEncoderDecoder] Make sure no EOS is generated in test (#15655 )	2022-02-15 09:13:55 +01:00
Sylvain Gugger	2e11a04337	Register feature extractor (#15634 ) * Rework AutoFeatureExtractor.from_pretrained internal * Custom feature extractor * Add more tests * Add support for custom feature extractor code * Clean up * Add register API to AutoFeatureExtractor	2022-02-14 13:35:16 -05:00
Sylvain Gugger	52d2e6f6e9	Add push to hub to feature extractor (#15632 ) * Add push to hub to feature extractor * Quality * Clean up	2022-02-11 17:14:01 -05:00
Sylvain Gugger	7a32e4722f	Custom feature extractor (#15630 ) * Rework AutoFeatureExtractor.from_pretrained internal * Custom feature extractor * Add more tests * Add support for custom feature extractor code * Clean up	2022-02-11 16:43:54 -05:00
Sylvain Gugger	2dce350b33	Fix _configuration_file argument getting passed to model (#15629 )	2022-02-11 13:46:08 -05:00
Joao Gante	2f40c728c9	TF MT5 embeddings resize (#15567 ) * Fix TF MT5 vocab resize * more assertive testing	2022-02-11 17:35:10 +00:00
Yih-Dar	724e51c6e6	Compute loss independent from decoder for TF EncDec models (as #14139 ) (#15175 ) * Compute loss independent from decoder (as 14139) * fix expected seq_len + style * Apply the same change to TFVisionEncoderDecoderModel * fix style * Add case with labels in equivalence test * uncomment * Add case with labels in equivalence test * add decoder_token_labels * use hf_compute_loss * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Add copied from Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>	2022-02-10 15:47:02 +01:00
Alberto Bégué	cb7ed6e083	Add Tensorflow handling of ONNX conversion (#13831 ) * Add TensorFlow support for ONNX export * Change documentation to mention conversion with Tensorflow * Refactor export into export_pytorch and export_tensorflow * Check model's type instead of framework installation to choose between TF and Pytorch Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Alberto Bégué <alberto.begue@della.ai> Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>	2022-02-10 11:18:41 +01:00
Lysandre	e923917cd9	Reformat tokenization_fnet	2022-02-09 22:23:32 -05:00
Sylvain Gugger	644ec05233	Make slow tests slow	2022-02-09 19:10:22 -05:00
Sylvain Gugger	315e67404d	Fix tests hub failure (#15580 ) * Expose hub test problem * Fix tests	2022-02-09 12:27:59 -05:00
Sylvain Gugger	b1ba03e082	Fix quality	2022-02-09 12:06:59 -05:00
Chan Woo Kim	2b5603f6ac	Constrained Beam Search [without disjunctive decoding] (#15416 ) * added classes to get started with constrained beam search * in progress, think i can directly force tokens now but not yet with the round robin * think now i have total control, now need to code the bank selection * technically works as desired, need to optimize and fix design choices leading to undersirable outputs * complete PR #1 without disjunctive decoding * removed incorrect tests * Delete k.txt * Delete test.py * Delete test.sh * revert changes to test scripts * genutils * full implementation with testing, no disjunctive yet * shifted docs * passing all tests realistically ran locally * removing accidentally included print statements * fixed source of error in initial PR test * fixing the get_device() vs device trap * fixed documentation docstrings about constrained_beam_search * fixed tests having failing for Speech2TextModel's floating point inputs * fix cuda long tensor * added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search * deleted accidentally added test halting code with assert False * code reformat * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py * fixing based on comments on PR * took out the testing code that should but work fails without the beam search moditification ; style changes * fixing comments issues * docstrings for ConstraintListState * typo in PhrsalConstraint docstring * docstrings improvements Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-02-09 16:59:26 +01:00
Clara Meister	0113aae5b7	Add implementation of typical sampling (#15504 ) * typical decoding * changing arg name * add test config params * forgotten arg rename * fix edge case where scores are same * test for typical logits warper * code quality fixes	2022-02-09 16:48:41 +01:00
Suraj Patil	f588cf4050	[Flax tests/FlaxBert] make from_pretrained test faster (#15561 )	2022-02-09 16:48:08 +01:00
Sylvain Gugger	1f60bc46f3	Make sure custom configs work with Transformers (#15569 ) * Make sure custom configs work with Transformers * Apply code review suggestions	2022-02-09 10:04:44 -05:00
Lysandre Debut	7732d0fe7a	Upgrade black to version ~=22.0 (#15565 ) * Upgrade black to version ~=22.0 * Check copies * Fix code	2022-02-09 09:28:57 -05:00
Suraj Patil	a6885db912	[Flax tests] fix test_model_outputs_equivalence (#15571 ) * fix test_model_outputs_equivalence * fix tuple outputs for blenderbot	2022-02-09 12:26:48 +01:00
Joao Gante	8406fa6dd5	Add TFSpeech2Text (#15113 ) * Add wrapper classes * convert inner layers to tf * Add TF Encoder and Decoder layers * TFSpeech2Text models * Loadable model * TF model with same outputs as PT model * test skeleton * correct tests and run the fixup * correct attention expansion * TFSpeech2Text pask_key_values with TF format	2022-02-08 16:27:23 +00:00
aaron	87d08afb16	electra is added to onnx supported model (#15084 ) * electra is added to onnx supported model * add google/electra-base-generator for test onnx module Co-authored-by: Lewis Tunstall <lewis.c.tunstall@gmail.com>	2022-02-08 15:47:49 +01:00
Michael Benayoun	0fe17f375a	FX tracing improvement (#14321 ) * Change the way tracing happens, enabling dynamic axes out of the box * Update the tests and modeling xlnet * Add the non recoding of leaf modules to avoid recording more values for the methods to record than what will be seen at tracing time (which would otherwise desynchronize the recorded values and the values that need to be given to the proxies during tracing, causing errors). * Comments and making tracing work for gpt-j and xlnet * Refactore things related to num_choices (and batch_size, sequence_length) * Update fx to work on PyTorch 1.10 * Postpone autowrap_function feature usage for later * Add copyrights * Remove unnecessary file * Fix issue with add_new_model_like * Apply suggestions	2022-02-07 22:25:33 +01:00
Yih-Dar	131e258411	Fix TF T5/LED missing cross attn in retrun values (#15511 ) * add cross attn to outputs * add cross attn to outputs for TFLED * add undo padding * remove unused import * fix style Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-02-07 17:41:48 +01:00
lewtun	6775b211b6	Remove Longformers from ONNX-supported models (#15273 )	2022-02-07 17:32:13 +01:00
François REMY	7a1412e12b	Wav2Vec2 models must either throw or deal with add_apater (#15409 ) * Wav2Vec2 models must either throw or deal with add_apater Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Add pre-add_adapter backwards compatibility * Add pre-add_adapter backwards compatibility * Fix issue in tests/test_modeling_wav2vec2.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-02-07 17:03:12 +01:00
NielsRogge	84eec9e6ba	Add ConvNeXT (#15277 ) * First draft * Add conversion script * Improve conversion script * Improve docs and implement tests * Define model output class * Fix tests * Fix more tests * Add model to README * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply more suggestions from code review * Apply suggestions from code review * Rename dims to hidden_sizes * Fix equivalence test * Rename gamma to gamma_parameter * Clean up conversion script * Add ConvNextFeatureExtractor * Add corresponding tests * Implement feature extractor correctly * Make implementation cleaner * Add ConvNextStem class * Improve design * Update design to also include encoder * Fix gamma parameter * Use sample docstrings * Finish conversion, add center cropping * Replace nielsr by facebook, make feature extractor tests smaller * Fix integration test Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-02-07 16:11:37 +01:00
Patrick von Platen	5f1918a4a8	[ASR pipeline] correct asr pipeline for seq2seq models (#15541 )	2022-02-07 15:35:44 +01:00
Sylvain Gugger	ac6aa10f23	Standardize semantic segmentation models outputs (#15469 ) * Standardize instance segmentation models outputs * Rename output * Update src/transformers/modeling_outputs.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Add legacy argument to the config and model forward * Update src/transformers/models/beit/modeling_beit.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Copy fix in Segformer Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2022-02-04 14:52:07 -05:00
Yih-Dar	bbe9c6981b	Fix TFRemBertEncoder all_hidden_states (#15510 ) * fix * fix test * remove expected_num_hidden_layers Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-02-04 16:32:14 +00:00
davidleonfdez	f1a4c4ead5	[WIP] Add preprocess_logits_for_metrics Trainer param (#15473 ) * Add preprocess_logits_for_metrics Trainer param * Compute accuracy in LM examples * Improve comments	2022-02-03 12:07:20 -05:00
Stas Bekman	4f5faaf044	[deepspeed] fix a bug in a test (#15493 ) * [deepspeed] fix a bug in a test * consistency	2022-02-03 08:55:45 -08:00
Yih-Dar	f5d98da29e	fix load_weight_prefix (#15101 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-02-03 15:11:53 +00:00
CHI LIU	5ec368d79e	Correct eos_token_id settings in generate (#15403 ) * Correct eos_token_id set in generate * Set eos_token_id in test * Correct eos_token_id set in generate * Set eos_token_id in test	2022-02-03 00:24:40 +01:00
SaulLu	39b5d1a63a	fix set truncation attribute in `__init__` of `PreTrainedTokenizerBase` (#15456 ) * change truncation_side in init of `PreTrainedTokenizerBase` Co-authored-by: LSinev <LSinev@users.noreply.github.com> * add test * Revert "replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__`" This reverts commit `7a98b87962`. * fix kwargs * Revert "fix kwargs" This reverts commit 67b0a5270e8cf1dbf70e6b0232e94c0452b6946f. * Update tests/test_tokenization_common.py Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> * delete truncation_side variable * reorganize test * format * complete doc * Revert "Revert "replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__`"" This reverts commit d5a10a7e2680539e5d9e98ae5d896c893d224b80. * fix typo * fix typos to render documentation * Revert "Revert "Revert "replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__`""" This reverts commit 16cf58811943a08f43409a7c83eaa330686591d0. * format Co-authored-by: LSinev <LSinev@users.noreply.github.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>	2022-02-02 23:18:09 +01:00
Ayush Chaurasia	c74f3d4c48	Add W&B backend for hyperparameter sweep (#14582 ) # Add support for W&B hyperparameter sweep This PR: * allows using wandb for running hyperparameter search. * The runs are visualized on W&B sweeps dashboard * This supports runnning sweeps on parallel devices, all reporting to the same central dashboard. ### Usage To run new a hyperparameter search: ``` trainer.hyperparameter_search( backend="wandb", project="transformers_sweep", # name of the project n_trials=5, metric="eval/loss", # metric to be optimized, default 'eval/loss'. A warning is raised if the passed metric is not found ) ``` This outputs a sweep id. Eg. `my_project/sweep_id` To run sweeps on parallel devices: Just pass sweep id which you want to run parallel ``` trainer.hyperparameter_search( backend="wandb", sweep_id = "my_project/sweep_id" ) ```	2022-02-02 14:06:14 -05:00
Sylvain Gugger	44b21f117b	Save code of registered custom models (#15379 ) * Allow dynamic modules to use relative imports * Work for configs * Fix last merge conflict * Save code of registered custom objects * Map strings to strings * Fix test * Add tokenizer * Rework tests * Tests * Ignore fixtures py files for tests * Tokenizer test + fix collection * With full path * Rework integration * Fix typo * Remove changes in conftest * Test for tokenizers * Add documentation * Update docs/source/custom_models.mdx Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Add file structure and file content * Add more doc * Style * Update docs/source/custom_models.mdx Co-authored-by: Suraj Patil <surajp815@gmail.com> * Address review comments Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-02-02 10:44:37 -05:00
Nicolas Patry	623d8cb475	Adding support for `microphone` streaming within pipeline. (#15046 ) * Adding support for `microphone` streaming within pipeline. - Uses `ffmpeg` to get microphone data. - Makes sure alignment is made to `size_of_sample`. - Works by sending `{"raw": ..data.., "stride": (n, left, right), "partial": bool}` directly to the pipeline enabling to stream partial results and still get inference. - Let's `partial` information flow through the pipeline to enable caller to get it back and choose to display text or not. - The striding reconstitution is bound to have errors since CTC does not keep previous state. Currently most of the errors are we don't know if there's a space or not between two chunks. Since we have some left striding info, we could use that during decoding to choose what to do with those spaces and even extra letters maybe (if the stride is long enough, it's bound to cover at least a few symbols) Fixing tests. Protecting with `require_torch`. `raw_ctc` support for nicer demo. Post rebase fixes. Revamp to split raw_mic_data from it's live chunking. - Requires a refactor to make everything a bit cleaner. Automatic resampling. Small fix. Small fix. * Post rebase fix (need to let super handle more logic, reorder args.) * Update docstrings * Docstring format. * Remove print. * Prevent flow of `input_values`. * Fixing `stride` too. * Fixing the PR by removing `raw_ctc`. * Better docstrings. * Fixing init. * Update src/transformers/pipelines/audio_utils.py Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Update tests/test_pipelines_automatic_speech_recognition.py Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Quality. Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>	2022-02-02 15:12:12 +01:00
Patrick von Platen	d718c0c3a8	[Wav2Vec2ProcessorWithLM] add alpha & beta to batch decode & decode (#15465 )	2022-02-02 12:59:40 +01:00
NielsRogge	1d94d57546	Add option to resize like torchvision's Resize (#15419 ) * Add torchvision's resize * Rename torch_resize to default_to_square * Apply suggestions from code review * Add support for default_to_square and tuple of length 1	2022-02-02 09:44:22 +01:00
SaulLu	7b8bdd8601	fix the `tokenizer_config.json` file for the slow tokenizer when a fast version is available (#15319 ) * add new test * update test * remove `tokenizer_file` from `additional_files_names` in `tokenization_utils_base.py` * add `tokenizer_file` for the fast only tokenizer * change global variables layoutxml * remove `"tokenizer_file"` from DPR tokenizer's Global variables * remove `tokenizer_file` from herbert slow tokenizer init * `"tokenizer_file"` from LED tokenizer's Global variables * remove `tokenizer_file` from mbart slow tokenizer init * remove `tokenizer_file` from slow tokenizer template * adapt to versioning * adapt the `test_tokenizer_mismatch_warning` test * clean test * clarify `VOCAB_FILES_NAMES` in tokenization_utils_fast.py * Revert "remove `tokenizer_file` from mbart slow tokenizer init" This reverts commit `0dbb723fa9`. * Revert "`"tokenizer_file"` from LED tokenizer's Global variables" This reverts commit `5a3f879bdd`. * Revert "remove `tokenizer_file` from herbert slow tokenizer init" This reverts commit `f5e10007b7`. * Revert "remove `"tokenizer_file"` from DPR tokenizer's Global variables" This reverts commit `da0895330b`. * set `tokenizer_file` in super `__init__` of mbart	2022-02-01 16:48:25 +01:00
SaulLu	6d585fe0f0	replace assert with exception for padding_side arg in `PreTrainedTokenizerBase` `__init__` (#15454 ) * replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__` * add test * fix kwargs * reformat test * format * format * fix typo to render the documentation	2022-02-01 16:13:58 +01:00
Yih-Dar	dc05dd539f	Fix TF Causal LM models' returned logits (#15256 ) * Fix TF Causal LM models' returned logits * Fix expected shape in the tests Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-02-01 11:04:07 +00:00
Yih-Dar	af5c3329d7	remove "inputs" in tf common test script (no longer required) (#15262 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-02-01 10:09:49 +00:00
Suraj Patil	d4f201b860	skip test for XGLM (#15445 )	2022-01-31 16:53:16 -05:00
peregilk	125a2882b4	Update modeling_wav2vec2.py (#15423 ) * Update modeling_wav2vec2.py With very tiny sound files (less than 0.1 seconds) the num_masked_span can be too long. The issue is described in issue #15366 and discussed with @patrickvonplaten. * correct errors with mask time indices * remove bogus file * make fix-copies Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-01-31 21:22:11 +01:00
Tavin Turner	d984b10335	Add 'with torch.no_grad()' to BEiT integration test forward passes (#14961 ) * Add 'with torch.no_grad()' to BEiT integration test forward pass * Fix inconsistent use of tabs and spaces in indentation	2022-01-31 15:12:10 -05:00
Yih-Dar	5a70987301	Fix TFLEDModel (#15356 ) * fix tf led * fix * fix * Add test_pt_tf_model_equivalence_extra for TFLED * add a (temporary) test Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-01-31 19:35:54 +01:00
Sylvain Gugger	3385ca2582	Change REALM checkpoint to new ones (#15439 ) * Change REALM checkpoint to new ones * Last checkpoint missing	2022-01-31 12:50:20 -05:00
NielsRogge	d4b3e56d64	[Hotfix] Fix Swin model outputs (#15414 ) * Fix Swin model outputs * Rename pooler	2022-01-31 16:32:14 +01:00
Yih-Dar	f380bf2b61	Fix the inconsistency of loss calculation between PT/TF XLNetLMHeadModel (#15298 ) * Fix the inconsistency of loss calculation between PT/TF XLNetLMHeadModel * overwrite test_loss_computation Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-01-29 15:08:35 +00:00

1 2 3 4 5 ...

1567 Commits