transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
statelesshz	9c875839c0	add ascend npu accelerator support (#24879 ) * Add Ascend NPU accelerator support * fix style warining	2023-07-18 08:20:32 -04:00
Yih-Dar	f14c7f999d	Fix CircleCI cache (#24880 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-18 13:45:00 +02:00
Younes Belkada	ca974aff0f	[`Docs`] Clarify 4bit docs (#24878 ) * clarify 4bit docs * Apply suggestions from code review Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> --------- Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>	2023-07-18 13:39:08 +02:00
Yih-Dar	2ab75add4b	Remove `tests/onnx` (#24868 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-17 22:37:28 +02:00
Sylvain Gugger	d561408cc3	Skip Add model like job (#24865 )	2023-07-17 15:52:04 -04:00
Yih-Dar	870dfc15b2	Skip failing `ZeroShotAudioClassificationPipelineTests::test_small_model_pt` for now (#24867 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-17 15:51:50 -04:00
Marc Sun	9dc965bb40	deprecate no_cuda (#24863 ) * deprecate no_cuda * style * remove doc * remove doc 2 * fix style	2023-07-17 14:52:28 -04:00
statelesshz	0f4502d335	Remove deprecated codes (#24837 ) * remove `xpu_backend` training argument * always call `contextlib.nullcontext()` since transformers updated to python3.8 * these codes will not be executed	2023-07-17 14:45:59 -04:00
Yih-Dar	eeaa9c016a	Make CLIP model could use new added tokens with meaningful pooling (#24777 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-17 20:35:20 +02:00
Syed Salman Habeeb Quadri	d0154015f7	Replace assert statements with exceptions (#24856 ) * Changed AssertionError to ValueError try-except block was using AssesrtionError in except statement while the expected error is value error. Fixed the same. * Changed AssertionError to ValueError try-except block was using AssesrtionError in except statement while the expected error is ValueError. Fixed the same. Note: While raising the ValueError args are passed to it, but later added again while handling the error (See the code snippet) * Changed AssertionError to ValueError try-except block was using AssesrtionError in except statement while the expected error is ValueError. Fixed the same. Note: While raising the ValueError args are passed to it, but later added again while handling the error (See the code snippet) * Changed AssertionError to ValueError * Changed AssertionError to ValueError * Changed AssertionError to ValueError * Changed AssertionError to ValueError * Changed AssertionError to ValueError * Changed assert statement to ValueError based * Changed assert statement to ValueError based * Changed assert statement to ValueError based * Changed incorrect error handling from AssertionError to ValueError * Undoed change from AssertionError to ValueError as it is not needed * Reverted back to using AssertionError as it is not necessary to make it into ValueError * Fixed erraneous comparision Changed == to != * Fixed erraneous comparision Changed == to != * formatted the code * Ran make fix-copies	2023-07-17 14:32:44 -04:00
Sylvain Gugger	12b908c659	Fix the fetch of all example tests (#24864 )	2023-07-17 14:10:13 -04:00
Sylvain Gugger	e9ad51306f	4.32.0.dev0	2023-07-17 13:30:44 -04:00
Sylvain Gugger	49eb357564	Fix token pass (#24862 ) * Fix how token is passed along in from_pretrained for tokenizers * It's actually not necessary	2023-07-17 13:27:11 -04:00
Yoach Lacombe	f42a35e611	Add bark (#24086 ) * first raw version of the bark integration * working code on small models with single run * add converting script from suno weights 2 hf * many changes * correct past_kv output * working implementation for inference * update the converting script according to the architecture changes * add a working end-to-end inference code * remove some comments and make small changes * remove unecessary comment * add docstrings and ensure no unecessary intermediary output during audio generation * remove done TODOs * make style + add config docstrings * modification for batch inference support on the whole model * add details to .generation_audio method * add copyright * convert EncodecModel from original library to transformers implementation * add two class in order to facilitate model and sub-models loading from the hub * add support of loading the whole model * add BarkProcessor * correct modeling according to processor output * Add proper __init__ and auto support * Add up-to-date copyright/license message * add relative import instead of absolute * cleaner head_dim computation * small comment removal or changes * more verbose LayerNorm init method * specify eps for clearer comprehension * more verbose variable naming in the MLP module * remove unecessary BarkBlock parameter * clearer code in the forward pass of the BarkBlock * remove _initialize_modules method for cleaner code * Remove unnecessary methods from sub-models * move code to remove unnecessary function * rename a variable for clarity and change an assert * move code and change variable name for clarity * remove unnecessary asserts * correct small bug * correct a comment * change variable names for clarity * remove asserts * change import from absolute to relative * correct small error due to comma missing + correct import * Add attribute Bark config * add first version of tests * update attention_map * add tie_weights and resize_token_embeddings for fineModel * correct getting attention_mask in generate_text_semantic * remove Bark inference trick * leave more choices in barkProcessor * remove _no_split_modules * fixe error in forward of block and introduce clearer notations * correct converting script with last changes * make style + add draft bark.mdx * correct BarkModelTest::test_generate_text_semantic * add Bark in main README * add dummy_pt_objects for Bark * add missing models in the main init * correct test_decoder_model_past_with_large_inputs * disable torchscript test * change docstring of BarkProcessor * Add test_processor_bark * make style * correct copyrights * add bark.mdx + make style, quality and consistency * Apply suggestions from code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Remove unnecessary test method * simply logic of a test * Only check first ids for slow audio generation * split full end-to-end generation tests * remove unneccessary comment * change submodel names for clearer naming * remove ModuleDict from modeling_bark * combine two if statements * ensure that an edge misued won't happen * modify variable name * move code snippet to the right place (coarse instead of semantic) * change BarkSemanticModule -> BarkSemanticModel * align BarkProcessor with transformers paradigm * correct BarkProcessor tests with last commit changes * change _validate_voice_preset to an instance method instead of a class method * tie_weights already called with post_init * add codec_model config to configuration * update bark modeling tests with recent BarkProcessor changes * remove SubModelPretrainedModel + change speakers embeddings prompt type in BarkModel * change absolute imports to relative * remove TODO * change docstrings * add examples to docs and docstrings * make style * uses BatchFeature in BarkProcessor insteads of dict * continue improving docstrings and docs + make style * correct docstrings examples * more comprehensible speaker_embeddings load/Save * rename speaker_embeddings_dict -> speaker_embeddings * correct bark.mdx + add bark to documentation_tests * correct docstrings configuration_bark * integrate last nit suggestions * integrate BarkGeneration configs * make style * remove bark tests from documentation_tests.txt because timeout - tested manually * add proper generation config initialization * small bark.mdx documentation changes * rename bark.mdx -> bark.md * add torch.no_grad behind BarkModel.generate_audio() * replace assert by ValueError in convert_suno_to_hf.py * integrate a series of short comments from reviewer * move SemanticLogitsProcessors and remove .detach() from Bark docs and docstrings * actually remove SemanticLogitsProcessor from modeling_bark.oy * BarkProcessor returns a single output instead of tuple + correct docstrings * make style + correct bug * add initializer_range to BarkConfig + correct slow modeling tests * add .clone() to history_prompt.coarse_prompt to avoid modifying input array * Making sure no extra "`" are present * remove extra characters in modeling_bark.py * Correct output if history_prompt is None * remove TODOs * remove ravel comment * completing generation_configuration_bark.py docstrings * change docstrings - number of audio codebooks instead of Encodec codebooks * change 'bias' docstrings in configuration_bark.py * format code * rename BarkModel.generate_audio -> BarkModel.generate_speech * modify AutoConfig instead of EncodecConfig in BarkConfig * correct AutoConfig wrong init * refactor BarkModel and sub-models generate_coarse, generate_fine, generate_text_semantic * remove SemanticLogitsProcessor and replace it with SuppressTokensLogitsProcessor * move nb_codebook related config arguments to BarkFineConfig * rename bark.mdx -> bark.md * correcting BarkModelConfig from_pretrained + remove keys_to_ignore * correct bark.md with correct hub path * correct code bug in bark.md * correct list tokens_to_suppress * modify Processor to load nested speaker embeddings in a safer way * correct batch sampling in BarkFineModel.generate_fine * Apply suggestions from code review Small docstrings correction and code improvements Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * give more details about num_layers in docstrings * correct indentation mistake * correct submodelconfig order of docstring variables * put audio models in alphabetical order in utils/check_repo.my * remove useless line from test_modeling_bark.py * makes BarkCoarseModelTest inherits from (ModelTesterMixin, GenerationTesterMixin, unittest.TestCase) instead of BarkSemanticModelTest * make a Tester class for each sub-model instead of inheriting * add test_resize_embeddings=True for Bark sub-models * add Copied from transformers.models.gpt_neo.modeling_gpt_neo.GPTNeoSelfAttention._split_heads * remove 'Copied fom Bark' comment * remove unneccessary comment * change np.min -> min in modeling_bark.py * refactored all custom layers to have Bark prefix * add attention_mask as an argument of generate_text_semantic * refactor sub-models start docstrings to have more precise config class definition * move _tied_weights_keys overriding * add docstrings to generate_xxx in modeling_bark.py * add loading whole BarkModel to convert_suno_to_hf * refactor attribute and variable names * make style convert_suno * update bark checkpoints * remove never entered if statement * move bark_modeling docstrings after BarkPretrainedModel class definition * refactor modeling_bark.py: kv -> key_values * small nits - code refactoring and removing unecessary lines from _init_weights * nits - replace inplace method by variable assigning * remove optional when necessary * remove some lines in generate_speech * add default value for optional parameter * Refactor preprocess_histories_before_coarse -> preprocess_histories Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * correct usage after refactoring * refactor Bark's generate_xxx -> generate and modify docstrings and tests accordingly * update docstrings python in configuration_bark.py * add bark files in utils/documentation_test.txt * correct docstrings python snippet * add the ability to use parameters in the form of e.g coarse_temperature * add semantic_max_new_tokens in python snippet in docstrings for quicker generation * Reformate sub-models kwargs in BakModel.generate Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * correct kwargs in BarkModel.generate * correct attention_mask kwarg in BarkModel.generate * add tests for sub-models args in BarkModel.generate and correct BarkFineModel.test_generate_fp16 * enrich BarkModel.generate docstrings with a description of how to use the kwargs --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-07-17 17:53:24 +01:00
Sylvain Gugger	c21c3737c1	Add TAPEX to the list of deprecated models (#24859 ) * Add TAPEX to the list of deprecated models * Add check * Fix typo * Fix import path for Van conversion	2023-07-17 12:53:03 -04:00
Younes Belkada	054e802914	fix broken links in READMEs (#24861 ) fix MRA in READMEs	2023-07-17 18:47:14 +02:00
bofeng huang	c965d30279	Fix comments for `_merge_heads` (#24855 ) * Fix comments * Fix comments	2023-07-17 11:07:16 -04:00
Yih-Dar	e4a52b6a15	Fix `is_vision_available` (#24853 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-17 16:58:51 +02:00
Samin Yasar	4f08887053	Add Multimodal heading and Document question answering in task_summary.mdx (#23318 ) * add multimodal heading and docqa * fix sentence * task_summary data type = modality clarification * change the multimodal example to a smaller model	2023-07-17 13:51:19 +01:00
dependabot[bot]	38dfb86958	Bump cryptography from 41.0.0 to 41.0.2 in /examples/research_projects/decision_transformer (#24833 ) Bump cryptography in /examples/research_projects/decision_transformer Bumps [cryptography](https://github.com/pyca/cryptography) from 41.0.0 to 41.0.2. - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pyca/cryptography/compare/41.0.0...41.0.2) --- updated-dependencies: - dependency-name: cryptography dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-07-17 07:17:17 -04:00
namespace-Pt	18d42bfd23	Remove unused code in GPT-Neo (#24826 ) 1	2023-07-17 07:07:47 -04:00
Sohyun Sim	9771ad33be	🌐 [i18n-KO] Translated `custom_tools.mdx` to Korean (#24580 ) * docs: ko: custom_tools.mdx * feat: deepl draft * fix: change .mdx to .md * fix: resolve suggestions * fix: resolve suggestions	2023-07-17 07:04:10 -04:00
statelesshz	8ba26c18cf	deprecate `sharded_ddp` training argument (#24825 ) * deprecate fairscale's ShardedDDP * fix code style * roll back * deprecate the `sharded_ddp` training argument --------- Co-authored-by: jihuazhong <jihuazhong1@huawei.com>	2023-07-17 06:57:42 -04:00
Kadir Nar	5bb4430edc	[🔗 Docs] Fixed Incorrect Migration Link (#24793 ) * [🔗 Docs] Fixed Incorrect Migration Link * Update README.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-07-14 17:47:50 -04:00
Sylvain Gugger	1023705440	Check models used for common tests are small (#24824 ) * First models * Conditional DETR * Treat DETR models, skip others * Skip LayoutLMv2 as well * Fix last tests	2023-07-14 14:43:19 -04:00
Dario Sučić	a865b62e07	set correct model input names for gptsw3tokenizer (#24788 )	2023-07-14 18:13:45 +01:00
Nicolas Patry	50726f9ea7	Fixing double `use_auth_token.pop` (preventing private models from being visible). (#24812 ) Fixing double `use_auth_token.pop` (preventing private models from being visible). Should fix: https://github.com/huggingface/transformers/issues/14334#issuecomment-1634527833 Repro: Have a private repo, with `vocab.json` (spread out files for the tokenizer) and use `AutoTokenizer.from_pretrained(..., use_auth_token="token")`.	2023-07-14 15:20:02 +02:00
Sylvain Gugger	91d7df58b6	Copy code when using local trust remote code (#24785 ) * Copy code when using local trust remote code * Remote upgrade strategy * Revert "Remote upgrade strategy" This reverts commit `4f0392f5d7`.	2023-07-13 16:57:20 -04:00
Sylvain Gugger	f32303d519	Run hub tests (#24807 ) * Run hub tests * [all-test] Run tests please! * [all-test] Add vision dep for hub tests * Fix tests	2023-07-13 15:25:45 -04:00
Fady Nakhla	9d7a0871e2	Use _BaseAutoModelClass's register method (#24810 ) Switching _BaseAutoModelClass from_pretrained and from_config to use the register classmethod that it defines rather than using the _LazyAutoMapping register method directly. This makes use of the additional consistency check within the base model's register.	2023-07-13 15:24:51 -04:00
Georgie Mathews	0866705022	Update setup.py to be compatible with pipenv (#24789 )	2023-07-13 12:56:43 -04:00
Matt	c0ca73dc98	Remove Falcon docs for the release until TGI is ready (#24808 ) * Remove Falcon docs for the release until TGI is ready * Update toctree	2023-07-13 17:27:58 +01:00
dymil	f9a711df4a	Fix typo 'submosules' (#24809 )	2023-07-13 16:56:53 +01:00
amyeroberts	eebce4470c	Add accelerate version in transformers-cli env (#24806 ) * Add accelerate version in transformers-cli env * Add accelerate config	2023-07-13 16:50:19 +01:00
Joao Gante	34d9409427	Llama/GPTNeoX: add RoPE scaling (#24653 ) * add rope_scaling * tmp commit * add gptneox * add tests * GPTNeoX can now handle long inputs, so the pipeline test was wrong * Update src/transformers/models/open_llama/configuration_open_llama.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * remove ntk * remove redundant validation --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-07-13 16:47:30 +01:00
Sylvain Gugger	9342c8fb82	Deprecate models (#24787 ) * Deprecate some models * Fix imports * Fix inits too * Remove tests * Add deprecated banner to documentation * Remove from init * Fix auto classes * Style * Remote upgrade strategy 1 * Remove site package cache * Revert this part * Fix typo... * Update utils * Update docs/source/en/model_doc/bort.md Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comments * With all files saved --------- Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2023-07-13 11:46:54 -04:00
Yih-Dar	717dadc6f3	Skip torchscript tests for `MusicgenForConditionalGeneration` (#24782 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-13 15:54:18 +02:00
amyeroberts	e367a9770f	Fix MobileVitV2 doctest checkpoint (#24805 ) * Fix doctest checkpoint * Add import torch for mobilevit	2023-07-13 14:47:59 +01:00
Yih-Dar	e538189931	Upgrade jax/jaxlib/flax pin versions (#24791 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-13 13:57:30 +02:00
Bram Vanroy	6ba4d5de3a	[DOC] Clarify relationshi load_best_model_at_end and save_total_limit (#24614 ) * Update training_args.py Clarify the relationship between `load_best_model_at_end` and `save_total_limit`. * fix: faulty quotes * make quality * Update src/transformers/training_args.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * DOCS: add explicit `True` * DOCS: make style/quality --------- Co-authored-by: Bram Vanroy <Bram.Vanroy@UGent.be> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-07-13 07:36:16 -04:00
SeongBeomLEE	21946a8cf4	[fix] Change the condition of ValueError in "convert_checkpoint_from_transformers_to_megatron" (#24769 ) * fix: half inference error norm_factor is still torch.float32 after using model.half So I changed it to register_buffer so I can change it to torch.float16 after using model.half * fix: Added a variable "persistent=False" * run make style * [fix] Change the condition of ValueError convert_checkpoint_from_transformers_to_megatron * [fix] error wording layers -> attention heads	2023-07-13 11:57:56 +01:00
Liyang90	1f6f32c243	Removing unnecessary `device=device` in modeling_llama.py (#24696 ) * Update modeling_llama.py Removing unnecessary `device=device` * fix in all occurrences of _make_causal_mask	2023-07-13 10:30:22 +01:00
Yih-Dar	906afa1d5c	Revert "Unpin protobuf in docker file (for daily CI)" (#24800 ) Revert "Unpin protobuf in docker file (for daily CI) (#24761)" This reverts commit `45025d92f8`.	2023-07-13 04:19:45 +02:00
Zach Mueller	f1732e1374	Rm duplicate pad_across_processes (#24780 ) Rm duplicate	2023-07-12 11:47:21 -04:00
Lysandre Debut	cfc8a05305	Remove WWT from README (#24672 )	2023-07-12 10:58:08 -04:00
Pedro Cuenca	395e566a42	gpt-bigcode: avoid `zero_` to support Core ML (#24755 ) gpt-bigcode: avoid `zeros_` to support Core ML. In-place `zeros_` is not supported by the Core ML conversion process. This PR replaces it with `zeros_like` so conversion can proceed. The change only affects a workaround for a PyTorch bug on the `cpu` device.	2023-07-12 16:38:25 +02:00
Zach Mueller	0284285501	Fix pad across processes dim in trainer and not being able to set the timeout (#24775 ) * dim, and rm copy * Don't rm copy for now * Oops * pad index * Should be a working test * Tickle down ddp timeout * Put fix back in now that testing locally is done * Better comment specifying timeout Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-07-12 10:01:51 -04:00
Yih-Dar	4f85aaa6c9	Update default values of bos/eos token ids in `CLIPTextConfig` (#24773 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-12 13:50:26 +02:00
Bauke Brenninkmeijer	fc9e387dc0	Replacement of 20 asserts with exceptions (#24757 ) * initial replacements of asserts with errors/exceptions * replace assert with exception in generation, align and bart * reset formatting change * reset another formatting issue * Apply suggestion Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * don't touch this file * change to 'is not False' * fix type --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-07-12 07:45:09 -04:00
Joao Gante	430a04a75a	Docs: Update logit processors __call__ docs (#24729 ) * tmp commit * __call__ docs * kwargs documented; shorter input_ids doc * nit * Update src/transformers/generation/logits_process.py	2023-07-12 12:21:30 +01:00

1 2 3 4 5 ...

13436 Commits