transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Anton Lozhkov	ce01122a3b	[Tests] Fix DistilHubert path (#14245 ) * Add audio-classification benchmarking results * fix distilhubert path	2021-11-02 17:53:50 +03:00
Yih-Dar	4a394cf53f	Fix test_configuration_tie in FlaxEncoderDecoderModelTest (#14076 ) * check test_configuration_tie * Fix test_configuration_tie * make test slow again * Remove property and use model.module.bind * revert to slow test Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2021-11-02 15:32:41 +05:30
Li-Huai (Allan) Lin	a767276fdd	Fix generation docstring (#14216 ) * Fix generation docstring * Style	2021-11-02 09:22:45 +01:00
NielsRogge	e20faa6f03	Add BeitForSemanticSegmentation (#14096 ) * Add first draft * Make forward pass work * Improve conversion script * Add notebook that checks if it works * Add BeitForSemanticSegmentation to the tests * More improvements * Make BeitForSemanticSegmentation consistent with Segformer * Small bug fix * Add BeitForSemanticSegmentation to docs * Make sure model doesn't output hidden states when the user doesn't want to * Make it possible to convert the large model * Fix issue * Fix conversion script for large model * Add auxiliary_head option to semantic segmentation model * Apply suggestions from @sgugger's review * Apply suggestions from code review * Fix failing test Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2021-11-01 19:55:45 +01:00
Walter Martin	8b32578119	improving efficiency of mlflow metric logging (#14232 ) Signed-off-by: Walter Martin <wamartin@microsoft.com>	2021-11-01 13:46:11 -04:00
Suraj Patil	ce91bf9a34	[GPTJ] enable common tests and few fixes (#14190 ) * enable common tests, small fixes * don't tie word embeds * don't ignore lm_head	2021-11-01 22:38:52 +05:30
mathor	70d5711848	Fix a writing issue in the comments of trainer.py (#14202 )	2021-11-01 09:24:03 -04:00
Prabhudatta Das	33fb98338e	Raising exceptions instead of using assertions for few models (#14219 ) * raising exceptions instead of using assertions for few models * fixed formatting issues * fixing copy inconsistencies	2021-11-01 08:53:13 -04:00
Nicolas Patry	999540dfe0	Tensor location is already handled (#14224 ) in `base.py` not in subclasses.	2021-11-01 08:42:27 -04:00
Nicolas Patry	323f28dce2	Fixing `image-segmentation` tests. (#14223 )	2021-11-01 08:25:34 -04:00
NielsRogge	7396095af7	Update README of QA examples (#14172 )	2021-11-01 12:52:22 +01:00
Yih-Dar	9450bfcc6c	Add more missing models to models/__init__.py (#14177 ) * Add missing models to models/__init__.py * Fix issues previously undetected * Add UniSpeechSatForPreTraining to all_model_classes * fix unispeech sat * fix * Add check_model_list() to check_repo.py * Remove _ignore_models = ["bort"] Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>	2021-11-01 10:52:36 +00:00
Lysandre	9fc1951711	Docs for v4.12.2	2021-10-29 14:51:05 -04:00
Lysandre	513fa30a63	Docs for v4.12.1	2021-10-29 13:49:50 -04:00
Lysandre Debut	63d91f449c	Torch 1.10 (#14169 ) * Torch 1.10 * torch scatter for 1.10 * style * Skip tests ok	2021-10-29 13:43:43 -04:00
Haram Lee	e823d8198a	Add a condition for checking labels (#14211 )	2021-10-29 13:12:10 -04:00
Nicolas Patry	b338596346	Fixing image segmentation with inference mode. (#14204 ) * Fixing image segmentation for inference mode. * Update src/transformers/pipelines/base.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2021-10-29 11:24:09 -04:00
Sylvain Gugger	c28bc80bbb	Generalize problem_type to all sequence classification models (#14180 ) * Generalize problem_type to all classification models * Missing import * Deberta BC and fix tests * Fix template * Missing imports * Revert change to reformer test * Fix style	2021-10-29 10:32:56 -04:00
Sylvain Gugger	4ab6a4a086	Fix pipeline tests env and fetch (#14209 ) * Fix pipeline tests env and fetch * Fix quality	2021-10-29 09:35:05 -04:00
Nicolas Patry	dc540dd316	Adding `handle_long_generation` paramters for `text-generation` pipeline. (#14118 ) * Adding `handle_long_generation` paramters for `text-generation` pipeline. * More error handling * Fixing tests by dropping tf support on this functionality, it needs `max_new_tokens` to make it possible to understand user's intent. Otherwise, `max_length` == `tokenizer.model_max_length` < input_ids.shape[0]. * Fixing doc ? * Doc ? * Remove link from doc. * Catched an issue on roberta. * Damn doc. * Non BC proposal ? * Cleaning the fix ? * Finally using only a test override. * Don't need to modify this. * Bad print.	2021-10-29 15:29:28 +02:00
Daniel Stancl	d37f1fb8ba	Add `BlenderbotTokenizerFast` (#13720 ) * Add the support for the fast (rust) implementation of BlenbderbotTokenizer * Fix a converter and a typo in a doc * Apply the patil-suraj's suggestion * (Nitpick) Fast tokenization -> Fast Tokenization in doc * Apply the SaulLu's suggestion * Apply Narsil's suggestion to fix test pipelines * Add encoder_no_repeat_ngram_size according to the Narsil's suggestion * Revert the last (unnecessary) commit * Override pipeline config for Blenderbot to allow for larger pos. emb. * make fix-copies	2021-10-29 09:19:01 -04:00
Thomas Wang	5b45422b58	Remove n_ctx from configs (#14165 ) * Remove n_ctx from configs * Fix GPTJ and OpenAIGPT, both are acceptable breaking changes as there are no configs such that it breaks * Remove unecessary n_positions from TFOpenAIGPT	2021-10-29 11:50:25 +02:00
Nicolas Patry	be236361f1	Adding `batch_size` support for (almost) all pipelines (#13724 ) * Tentative enabling of `batch_size` for pipelines. * Add systematic test for pipeline batching. * Enabling batch_size on almost all pipelines - Not `zero-shot` (it's already passing stuff as batched so trickier) - Not `QA` (preprocess uses squad features, we need to switch to real tensors at this boundary. * Adding `min_length_for_response` for conversational. * Making CTC, speech mappings avaiable regardless of framework. * Attempt at fixing automatic tests (ffmpeg not enabled for fast tests) * Removing ffmpeg dependency in tests. * Small fixes. * Slight cleanup. * Adding docs and adressing comments. * Quality. * Update docs/source/main_classes/pipelines.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/pipelines/question_answering.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/pipelines/zero_shot_classification.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Improving docs. * Update docs/source/main_classes/pipelines.rst Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com> * N -> oberved_batch_size softmax trick. * Follow `padding_side`. * Supporting image pipeline batching (and padding). * Rename `unbatch` -> `loader_batch`. * unbatch_size forgot. * Custom padding for offset mappings. * Attempt to remove librosa. * Adding require_audio. * torchaudio. * Back to using datasets librosa. * Adding help to set a pad_token on the tokenizer. * Update src/transformers/pipelines/base.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/pipelines/base.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/pipelines/base.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Quality. Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>	2021-10-29 11:34:18 +02:00
David del Río Medina	4469010c1b	Replace assertions with RuntimeError exceptions (#14186 )	2021-10-28 17:17:43 -04:00
Patrick von Platen	ba71f1b57f	Update README.md	2021-10-28 19:43:05 +02:00
Lysandre	b8fad022a0	v4.13.0.dev0	2021-10-28 12:56:46 -04:00
Lysandre	62bf536631	Release v4.12.0	2021-10-28 12:09:49 -04:00
NielsRogge	5f3bf65111	Fix EncoderDecoderModel docs (#14197 ) * Fix docs * Apply suggestions from review + fix bug	2021-10-28 18:01:00 +02:00
NielsRogge	ac12a5ae47	Fix EncoderDecoderModel classes to be more like BART and T5 (#14139 ) * First draft * Make tuple output more readable * Replace assertions by value errors * Make it possible to predict_with_generate for vision and speech models * Adapt Seq2SeqTrainer to work with VisionEncoderDecoder/SpeechEncoderDecoder * Add deprecation warning * Add copied from statements to vision and speech encoder decoders * Fix failing test * Apply @patrickvonplaten's suggestion * Use reshape instead of view for consistency	2021-10-28 15:29:04 +02:00
Anton Lozhkov	1251072f46	Fix SEW-D implementation differences (#14191 ) * Fix SEW-D * Update tests * isort	2021-10-28 16:22:18 +03:00
Anton Lozhkov	78b6a2ecbd	Add audio-classification benchmarking results (#14192 )	2021-10-28 15:59:18 +03:00
NielsRogge	1dc96a760d	Add SegFormer (#14019 ) * First draft * Make style & quality * Improve conversion script * Add print statement to see actual slice * Make absolute tolerance smaller * Fix image classification models * Add post_process_semantic method * Disable padding * Improve conversion script * Rename to ForSemanticSegmentation, add integration test, remove post_process methods * Improve docs * Fix code quality * Fix feature extractor tests * Fix tests for image classification model * Delete file * Add is_torch_available to feature extractor * Improve documentation of feature extractor methods * Apply suggestions from @sgugger's code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply some more suggestions of code review * Rebase with master * Fix rebase issues * Make sure model only outputs hidden states when the user wants to * Apply suggestions from code review * Add pad method * Support padding of 2d images * Add print statement * Add print statement * Move padding method to SegformerFeatureExtractor * Fix issue * Add casting of segmentation maps * Add test for padding * Add small note about padding Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-10-28 08:23:52 -04:00
Stas Bekman	123cce6ffc	[modeling_utils] respect original dtype in _get_resized_lm_head (#14181 ) * respect dtype in _get_resized_lm_head * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * consistency Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-10-27 19:01:50 -07:00
Patrick von Platen	88cd82e801	Update README.md	2021-10-28 02:35:01 +02:00
Patrick von Platen	e118db15d6	Update README.md	2021-10-28 01:59:27 +02:00
Patrick von Platen	01b1466983	[TPU tests] Enable first TPU examples pytorch (#14121 ) * up * up * fix * up * Update examples/pytorch/test_xla_examples.py * correct labels * up * up * up * up * up * up	2021-10-28 01:22:28 +02:00
Anton Lozhkov	232822f36d	Add DistilHuBERT (#14174 ) * Add conversion * Rename * Add an integration test and remove layer_norm * Remove layer_norm from the converter * wording * Fix imports	2021-10-27 20:17:31 +03:00
Lahfa Samy	e5b8ffb848	Replace assert of data/data_collator.py by ValueError (#14131 ) * Replace assert of data_collator.py by ValueError * Replace assert of data_collator.py by ValueError	2021-10-27 12:19:10 -04:00
Anton Lozhkov	25ceb81871	[Pipelines] Fix ASR model types check (#14178 )	2021-10-27 17:17:47 +03:00
Patrick von Platen	6200fd7bbc	[Gradient checkpointing] Enable for Deberta + DebertaV2 + SEW-D (#14175 ) * up * up * finish * up * final changes	2021-10-27 15:47:20 +02:00
Anton Lozhkov	e1dc5afd28	Add SEW CTC models (#14158 ) * Add SEW CTC models * Update paths * Update paths	2021-10-27 12:21:09 +03:00
Lysandre Debut	1e53faeb2e	Fix gelu test for torch 1.10 (#14167 )	2021-10-26 22:20:51 -04:00
Kamal Raj	8ddbfe9752	switch to inference_mode from no_gard (#13667 ) * switch to inference_mode from no_gard faster inference * added switch to support older version of pytorch	2021-10-26 18:02:58 -04:00
Emanuel Huber	ebd48c6de5	Replace assertions with ValueError exception (#14142 ) Updated masked-language modeling examples in pytorch with convention defined by #12789	2021-10-26 17:14:29 -04:00
Matthew Goldey	42bfb83d74	fix typos in error messages in speech recognition example and modelcard.py (#14166 ) * specify the text column name in the error message * pluralize the word fields	2021-10-26 16:36:26 -04:00
Jangwon Park	41dad89f70	chore: typo on ner accelerate example code (#14150 )	2021-10-26 16:23:41 -04:00
Lysandre	27c888db6c	Fix copies	2021-10-26 15:48:28 -04:00
Jay Zhang	3f23634a17	[ONNX] Add symbolic function for XSoftmax op for exporting to ONNX. (#14013 ) * Add symbolic function for XSoftmax op for exporting to ONNX. * Fix format issues. * Fix a CI issue relative to copies.	2021-10-26 15:25:02 -04:00
Patrick von Platen	9f3aa46f45	Add Unispeech & Unispeech-SAT (#13963 ) * unispeech * add copy from * remove hubert copy from * finish for today * add unispeech-sat * adapt more * up * up * up * up * add modeling * add tests * up * up * finish * up * Apply suggestions from code review * up * up * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * up * up Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-10-26 18:59:58 +02:00
Patrick von Platen	9799f4e150	Update README.md	2021-10-26 18:59:25 +02:00

1 2 3 4 5 ...

8253 Commits