transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-28 16:52:24 +06:00

Author	SHA1	Message	Date
Ian Castillo	ed70f24291	Add Spanish translation of converting_tensorflow_models.mdx (#18512 ) * Add file in spanish docs to be translated * Finish translation to Spanish * Improve Spanish wording * Add suggested changes from review	2022-08-08 15:53:43 -04:00
Rasmus Arpe Fogh Jensen	a765b68aa6	Update no_trainer.py scripts to include accelerate gradient accumulation wrapper (#18473 ) * Added accelerate gradient accumulation wrapper to run_image_classification_no_trainer.py example script * make fixup changes * PR comments * changed input to Acceletor based on PR comment, ran make fixup * Added comment explaining the sync_gradients statement * Fixed lr scheduler max steps * Changed run_clm_no_trainer.py script to use accelerate gradient accum wrapper * Fixed all scripts except wav2vec2 pretraining to use accelerate gradient accum wrapper * Added accelerate gradient accum wrapper for wav2vec2_pretraining_no_trainer.py script * make fixup and lr_scheduler step inserted back into run_qa_beam_search_no_trainer.py * removed changes to run_wav2vec2_pretraining_no_trainer.py script and fixed using wrong constant in qa_beam_search_no_trainer.py script	2022-08-08 15:52:47 -04:00
Mishig Davaadorj	f1f5de31ed	Update perf_train_gpu_one.mdx (#18532 )	2022-08-08 20:33:34 +02:00
NielsRogge	82bb682643	[VideoMAE] Add model to doc tests (#18523 ) * Add videomae to doc tests * Add pip install decord Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-08-08 19:28:51 +02:00
Steven Liu	3632531ec6	Add example of multimodal usage to pipeline tutorial (#18498 ) * 📝 add example of multimodal usage to pipeline tutorial * 🖍 apply feedbacks * 🖍 apply niels feedback	2022-08-08 11:31:31 -05:00
Steven Liu	36b37990af	✨ update to use interlibrary links instead of Markdown (#18500 )	2022-08-08 10:53:52 -05:00
Yih-Dar	ec8d26248f	unpin resampy (#18527 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-08 17:44:10 +02:00
Sylvain Gugger	47e1676255	New cache fixes: add safeguard before looking in folders (#18522 )	2022-08-08 10:22:27 -04:00
Ankur Goyal	7495924007	Specify en in doc-builder README example (#18526 ) Co-authored-by: Ankur Goyal <ankur@impira.com>	2022-08-08 10:22:17 -04:00
Sylvain Gugger	aff5117f46	Remove debug statement	2022-08-08 09:54:10 -04:00
Sylvain Gugger	70b0d4e193	Fix compatibility with 1.12 (#17925 ) * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * fix torch.onnx.symbolic_opset12 import * Reject bad version Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-08 09:53:08 -04:00
Sourab Mangrulkar	2fecde742d	update fsdp docs (#18521 ) * updating fsdp documentation * typo fix	2022-08-08 18:56:51 +05:30
Sylvain Gugger	377cdded7a	Clean up hub (#18497 ) * Clean up utils.hub * Remove imports * More fixes * Last fix	2022-08-08 08:48:10 -04:00
Nicolas Patry	a4562552eb	[DX fix] Fixing QA pipeline streaming a dataset. (#18516 ) * [DX fix] Fixing QA pipeline streaming a dataset. QuestionAnsweringArgumentHandler would iterate over the whole dataset effectively killing all properties of the pipeline. This restores nice properties when using `Dataset` or `Generator` since those are meant to be consumed lazily. * Handling TF better.	2022-08-08 14:25:56 +02:00
regisss	88a0ce57bb	Add seed setting to image classification example (#18519 )	2022-08-08 08:08:11 -04:00
Julien Chaumond	9129fd0377	`transformers-cli login` => `huggingface-cli login` (#18490 ) * zero chance anyone's using that constant no? * `transformers-cli login` => `huggingface-cli login` * `transformers-cli repo create` => `huggingface-cli repo create` * `make style`	2022-08-06 09:42:55 +02:00
Julien Chaumond	8d1f9039d0	Just re-reading the whole doc every couple of months 😬 (#18489 ) * Delete valohai.yaml * NLP => ML * typo * website supports https * datasets * 60k + modalities * unrelated link fixing for accelerate * Ok those links were actually broken * Fix link * Make `AutoTokenizer` auto-link * wording tweak * add at least one non-nlp task	2022-08-06 09:38:55 +02:00
Julien Chaumond	b8c247b6d0	Typo reported by Joel Grus on TWTR (#18493 )	2022-08-05 13:29:38 -04:00
Yih-Dar	38d656041b	disable Onnx test for google/long-t5-tglobal-base (#18454 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-05 19:27:19 +02:00
Sylvain Gugger	56a55d3ce4	Forgot one new_ for cache migration	2022-08-05 13:24:53 -04:00
Yih-Dar	9d64f7f00c	Update some expected values in `quicktour.mdx` for `resampy 0.3.0` (#18484 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-05 19:17:51 +02:00
Sylvain Gugger	faacdf007b	Move cache folder to huggingface/hub for consistency with hf_hub (#18492 ) * Move cache folder to just huggingface * Thank you VsCode for this needless import * Move to hub * Forgot one	2022-08-05 13:14:00 -04:00
Yih-Dar	280db2e39c	Fix `test_dbmdz_english` by updating expected values (#18482 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-05 16:49:54 +02:00
Sylvain Gugger	5cd4032368	Use new huggingface_hub tools for download models (#18438 ) * Draft new cached_file * Initial draft for config and model * Small fixes * Fix first batch of tests * Look in cache when internet is down * Fix last tests * Bad black, not fixing all quality errors * Make diff less * Implement change for TF and Flax models * Add tokenizer and feature extractor * For compatibility with main * Add utils to move the cache and auto-do it at first use. * Quality * Deal with empty commit shas * Deal with empty etag * Address review comments	2022-08-05 10:12:40 -04:00
Sylvain Gugger	70fa1a8d26	Fix pipeline tests (#18487 ) * Fix pipeline tests * Make sure all pipelines tests run with init changes	2022-08-05 09:14:51 -04:00
Sylvain Gugger	c7849d9efc	Remove py.typed (#18485 )	2022-08-05 09:12:19 -04:00
Yih-Dar	893122f666	Add TF prefix to TF-Res test class (#18481 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-05 13:59:55 +02:00
Seunghwan Hong	bf174f916b	Refactor `TFSwinLayer` to increase serving compatibility (#18352 ) * Refactor `TFSwinLayer` to increase serving compatibility Signed-off-by: Seunghwan Hong <seunghwan@scatterlab.co.kr> * Fix missed parameters while refactoring Signed-off-by: Seunghwan Hong <seunghwan@scatterlab.co.kr> * Fix window_reverse to calculate batch size Signed-off-by: Seunghwan Hong <harrydrippin@gmail.com> Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2022-08-05 07:40:14 -04:00
Seunghwan Hong	575aa6ef1a	Fix TFSwinSelfAttention to have relative position index as non-trainable weight (#18226 ) Signed-off-by: Seunghwan Hong <seunghwan@scatterlab.co.kr>	2022-08-05 07:39:40 -04:00
Nicolas Patry	586dcf6b21	Fixing issue where generic model types wouldn't load properly with the pipeline (#18392 ) * Adding a better error message when the model is improperly configured within transformers. * Update src/transformers/pipelines/__init__.py * Black version. * Overriding task aliases so that tokenizer+feature_extractor values are correct. * Fixing task aliases by overriding their names early * X. * Fixing feature-extraction. * black again. * Normalizing `translation` too. * Fixing last few corner cases. translation need to use its non normalized name (translation_XX_to_YY, so that the task_specific_params are correctly overloaded). This can be removed and cleaned up in a later PR. `speech-encode-decoder` actually REQUIRES to pass a `tokenizer` manually so the error needs to be discarded when the `tokenizer` is already there. * doc-builder fix. * Fixing the real issue. * Removing dead code. * Do not import the actual config classes.	2022-08-05 08:45:07 +02:00
Yih-Dar	14928921e2	Add `TF_MODEL_FOR_SEMANTIC_SEGMENTATION_MAPPING` (#18469 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-04 20:41:15 +02:00
Kian Sierra McGettigan	0bf1e1aca4	Update no trainer examples for QA and Semantic Segmentation (#18474 ) * swag_no_trainer updated for with gather_metrics * Removed unused variable samples_seen * updated examples with gather_for_metrics	2022-08-04 13:22:19 -04:00
Yih-Dar	d2704c4143	Add machine type in the artifact of Examples directory job (#18459 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-04 18:52:01 +02:00
NielsRogge	f9a0008d2d	Add VideoMAE (#17821 ) * First draft * Add VideoMAEForVideoClassification * Improve conversion script * Add VideoMAEForPreTraining * Add VideoMAEFeatureExtractor * Improve VideoMAEFeatureExtractor * Improve docs * Add first draft of model tests * Improve VideoMAEForPreTraining * Fix base_model_prefix * Make model take pixel_values of shape (B, T, C, H, W) * Add loss computation of VideoMAEForPreTraining * Improve tests * Improve model testsé * Make all tests pass * Add VideoMAE to main README * Add tests for VideoMAEFeatureExtractor * Add integration test * Improve conversion script * Rename patch embedding class * Remove VideoMAELayer from init * Update design of patch embeddings * Improve comments * Improve conversion script * Improve conversion script * Add conversion of pretrained model * Add loss verification of pretrained model * Add loss verification of unnormalized targets * Add integration test for pretraining model * Apply suggestions from code review * Fix bug to make feature extractor resize only shorter edge * Address more comments * Improve normalization of videos * Add doc examples * Move constants to dedicated script * Remove scripts * Transfer checkpoints, fix docs * Update script * Update image mean and std * Fix doc tests * Set return_tensors to NumPy by default * Revert the previous change Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-08-04 18:02:55 +02:00
Thomas Wang	672b66262a	Add FX support for torch.baddbmm andd torch.Tensor.baddbmm (#18363 )	2022-08-04 16:02:16 +02:00
Sylvain Gugger	df28de0581	Fix load of model checkpoints in the Trainer (#18470 )	2022-08-04 08:22:25 -04:00
Kian Sierra McGettigan	330247ede2	Update no trainer scripts for multiple-choice (#18468 ) * swag_no_trainer updated for with gather_metrics * Removed unused variable samples_seen	2022-08-04 07:29:32 -04:00
Michael Benayoun	c74befc9e3	HFTracer.trace can now take callables and torch.nn.Module (#18457 ) * Enable HFTracer to trace with custom dummy inputs instead of pre-computed ones * Add HFTracer.trace docstring, and make it possible to handle callable and torch.nn.Module in general * Remove pdb comment * Apply suggestions	2022-08-04 13:29:18 +02:00
nlpcat	fc1d841b2d	change shape to support dynamic batch input in tf.function XLA generate for tf serving (#18372 ) * change shape to support dynamic batch input in tf.generate * add tests Co-authored-by: nlpcatcode <nlpcodecat@gmail.com>	2022-08-04 11:26:11 +01:00
Thomas Wang	b69a62d579	[BLOOM] Clean modeling code (#18344 ) * Cleanup some code * Improve signatures * Try to reduce the number of reshape/copies * I don't think we actually need the layer_num scaling trick * No need for duplication * Try to fix beam_search * Fix beam search * Removing layer num normalization seems to be breaking * Not sure self.layer_number normalization actually matters * Try and be backward compatible * Try to fix beam_search * Revert attempt to be backward compatible * Improve documentation on past_key_values format * Optimize the device allocation in case of hidden_states in multiple devices * No need to manually cast the values to a specific device * Rename with long version of variables * Improve type hinting * Add comment that explains that some methods return views * Actually i think the attention casting only makes sense when we use torch.float16 * We don't actually need layer_number to be passed anymore * Fix FX test * Bypass torch.baddbmm * Apply suggestions from code review * Add comment about support for torchScript v1.11 * fix ONNX support for bloom (#18456) Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com> Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>	2022-08-04 11:08:03 +02:00
LSinev	02b176c4ce	Fix torch version comparisons (#18460 ) Comparisons like version.parse(torch.__version__) > version.parse("1.6") are True for torch==1.6.0+cu101 or torch==1.6.0+cpu version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py	2022-08-03 13:37:18 -04:00
Sayak Paul	be41eaf55f	fix: keras fit tests for segformer tf and minor refactors. (#18412 ) * fix: keras fit tests for segformer tf and minor refactors. * refactor: test_keras_fit to make it simpler using the existing one. * fix: styling issues.	2022-08-03 16:39:54 +01:00
Alara Dirik	fc546332d7	add zero-shot obj detection notebook to docs (#18453 )	2022-08-03 17:14:39 +03:00
Daniel Suess	8fb7c908c8	Fix failing tests for XLA generation in TF (#18298 ) * Fix failing test_xla_generate_slow tests * Fix failing speech-to-text xla_generate tests	2022-08-03 09:45:15 -04:00
Omar Sanseviero	a507908cd3	Update pinned hhub version (#18448 ) * Update pinned hhub version * Make style	2022-08-03 08:37:42 -04:00
Ritik Nandwal	3db4378bd7	Update no trainer scripts for language modeling and image classification examples (#18443 ) * Update no_trainer script for image-classification * Update no_trainer scripts for language-modeling examples * Remove unused variable * Removing truncation from losses array for language modeling examples	2022-08-03 08:33:18 -04:00
Ian Castillo	10e1ec9a8c	Add Spanish translation of run_scripts.mdx (#18415 ) * Add file in spanish docs to be translated * Translate first two sections to Spanish * Translate four additional sections to Spanish * Finish translation to Spanish * Improve writing style in Spanish * Add suggested changes from reviewer	2022-08-03 07:32:20 -04:00
Gary Miguel	9d7b70bcd7	support ONNX export of XDropout in deberta{,_v2} and sew_d (#17502 ) * support ONNX export of XDropout in deberta{,_v2} * black * copy to sew_d * add test * isort * use pytest.mark.filterwarnings * review comments	2022-08-03 06:33:44 -04:00
Steven Liu	92915ebec2	Update _toctree.yml (#18440 ) This PR moves GroupViT and LXMert to their correct sections. As pointed out by @NielsRogge and @LysandreJik, GroupViT and LXMert are both multimodal models.	2022-08-03 12:26:01 +02:00
Sourab Mangrulkar	22a0dd2ef7	fixing error when using sharded ddp (#18435 )	2022-08-03 08:39:58 +05:30

... 92 93 94 95 96 ...

15053 Commits