transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-15 02:28:24 +06:00

Author	SHA1	Message	Date
Stas Bekman	6112b1c644	[doc] `image_processing_vilt.py` wrong default documented (#24931 ) [doc] image_processing_vilt.py wrong default	2023-07-19 13:57:40 -07:00
Younes Belkada	ee4250a35f	[`Llama2`] replace `self.pretraining_tp` with `self.config.pretraining_tp` (#24906 ) * add possibility to disable TP * fixup * adapt from offline discussions	2023-07-19 14:26:27 +02:00
Travis Cline	3a43794dd6	Fix minor llama2.md model doc typos (#24909 ) Update llama2.md Fix typos in the llama2 model doc	2023-07-19 08:13:14 -04:00
lee1jun	99c1268e0a	fix typo in BARK_PRETRAINED_MODEL_ARCHIVE_LIST (#24902 ) fix typo in BARK_PRETRAINED_MODEL_ARCHIVE_LIST suno/barh should be suno/bark	2023-07-19 07:35:04 -04:00
Madhava Jay	aa4afa67f3	Fixed issue where ACCELERATE_USE_CPU="False" results in bool(True) (#24907 ) - This results in cpu mode on Apple Silicon mps	2023-07-19 07:30:01 -04:00
Yih-Dar	243b2ea3fd	Fix `test_model_parallelism` for `FalconModel` (#24914 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-19 13:18:16 +02:00
Eliah Kagan	c035970212	Update tested versions in READMEs (#24895 ) * Update supported Python and PyTorch versions in readme * Update Python, etc. versions in non-English readmes These were more out of date than in the English readme. This updates all the versions the readmes claim the repository is tested with to the same versions stated in the English readme. Those versions are current at least in the case of the Python and PyTorch versions (and less out of date for the others). * Propagate trailing whitespace fix to model list This runs "make fix-copies". The only change is the removal of whitespace. No actual information or wording is changed. * Update tested TensorFlow to 2.6 in all readmes Per pinning in setup.py Unlike Python and PyTorch, the minimum supported TensorFlow version has not very recently changed, but old versions were listed in all READMEs.	2023-07-19 07:17:34 -04:00
Yih-Dar	129cb6d523	Avoid some pipeline tasks to use `use_cache=True` (#24893 ) * fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-19 09:49:52 +02:00
Zach Mueller	476be08c4a	Check for accelerate env var when doing CPU only (#24890 ) Check for use-cpu	2023-07-18 18:40:37 -04:00
Zach Mueller	a982c0225e	Disable ipex env var if false (#24885 ) Disable ipex if in use	2023-07-18 16:07:02 -04:00
Arthur	07360b6c9c	[`Llama2`] Add support for Llama 2 (#24891 ) * add llama * add other readmes * update padding id in readme * add link to paper * fix paths and tokenizer * more nits * styling * fit operation in 2 lines when possible * nits * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add form * update reademe * update readme, we don't have a default pad token * update test and tokenization * LLaMA instead of Llama * nits * add expected text * add greeedy output * styling * Update src/transformers/models/llama/modeling_llama.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * sequential device map * skip relevant changes --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-07-18 15:18:31 -04:00
Yih-Dar	30c172fc20	Separate CircleCI cache between `main` and `pull` (or other branches) (#24886 ) * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-18 21:05:26 +02:00
Hwijeen Ahn	dd49404a89	check if eval dataset is dict (#24877 ) * check if eval dataset is dict * formatting	2023-07-18 13:33:41 -04:00
Younes Belkada	5c5cb4eeb2	[`Blip`] Fix blip output name (#24889 ) * fix blip output name * add property * oops * fix failing test	2023-07-18 19:30:27 +02:00
Younes Belkada	a9e067a45c	[`InstructBlip`] Fix int8/fp4 issues (#24888 ) * fix dtype issue * revert `.float()` * fix copies	2023-07-18 19:24:36 +02:00
NielsRogge	3ec10e6c76	Add DINOv2 (#24016 ) * First draft * More improvements * Convert patch embedding layer * Convert all weights * Make conversion work * Improve conversion script * Fix style * Make all tests pass * Add image processor to auto mapping * Add swiglu ffn * Add image processor to conversion script * Fix conversion of giant model * Fix documentation * Fix style * Fix tests * Address comments * Address more comments * Remove unused arguments * Remove more arguments * Rename parameters * Include mask token * Address comments * Add docstring * Transfer checkpoints * Empty commit	2023-07-18 15:34:06 +01:00
Yih-Dar	57da42ad05	Enable `ZeroShotAudioClassificationPipelineTests::test_small_model_pt` (#24882 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-18 15:08:53 +02:00
statelesshz	9c875839c0	add ascend npu accelerator support (#24879 ) * Add Ascend NPU accelerator support * fix style warining	2023-07-18 08:20:32 -04:00
Yih-Dar	f14c7f999d	Fix CircleCI cache (#24880 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-18 13:45:00 +02:00
Younes Belkada	ca974aff0f	[`Docs`] Clarify 4bit docs (#24878 ) * clarify 4bit docs * Apply suggestions from code review Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> --------- Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>	2023-07-18 13:39:08 +02:00
Yih-Dar	2ab75add4b	Remove `tests/onnx` (#24868 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-17 22:37:28 +02:00
Sylvain Gugger	d561408cc3	Skip Add model like job (#24865 )	2023-07-17 15:52:04 -04:00
Yih-Dar	870dfc15b2	Skip failing `ZeroShotAudioClassificationPipelineTests::test_small_model_pt` for now (#24867 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-17 15:51:50 -04:00
Marc Sun	9dc965bb40	deprecate no_cuda (#24863 ) * deprecate no_cuda * style * remove doc * remove doc 2 * fix style	2023-07-17 14:52:28 -04:00
statelesshz	0f4502d335	Remove deprecated codes (#24837 ) * remove `xpu_backend` training argument * always call `contextlib.nullcontext()` since transformers updated to python3.8 * these codes will not be executed	2023-07-17 14:45:59 -04:00
Yih-Dar	eeaa9c016a	Make CLIP model could use new added tokens with meaningful pooling (#24777 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-17 20:35:20 +02:00
Syed Salman Habeeb Quadri	d0154015f7	Replace assert statements with exceptions (#24856 ) * Changed AssertionError to ValueError try-except block was using AssesrtionError in except statement while the expected error is value error. Fixed the same. * Changed AssertionError to ValueError try-except block was using AssesrtionError in except statement while the expected error is ValueError. Fixed the same. Note: While raising the ValueError args are passed to it, but later added again while handling the error (See the code snippet) * Changed AssertionError to ValueError try-except block was using AssesrtionError in except statement while the expected error is ValueError. Fixed the same. Note: While raising the ValueError args are passed to it, but later added again while handling the error (See the code snippet) * Changed AssertionError to ValueError * Changed AssertionError to ValueError * Changed AssertionError to ValueError * Changed AssertionError to ValueError * Changed AssertionError to ValueError * Changed assert statement to ValueError based * Changed assert statement to ValueError based * Changed assert statement to ValueError based * Changed incorrect error handling from AssertionError to ValueError * Undoed change from AssertionError to ValueError as it is not needed * Reverted back to using AssertionError as it is not necessary to make it into ValueError * Fixed erraneous comparision Changed == to != * Fixed erraneous comparision Changed == to != * formatted the code * Ran make fix-copies	2023-07-17 14:32:44 -04:00
Sylvain Gugger	12b908c659	Fix the fetch of all example tests (#24864 )	2023-07-17 14:10:13 -04:00
Sylvain Gugger	e9ad51306f	4.32.0.dev0	2023-07-17 13:30:44 -04:00
Sylvain Gugger	49eb357564	Fix token pass (#24862 ) * Fix how token is passed along in from_pretrained for tokenizers * It's actually not necessary	2023-07-17 13:27:11 -04:00
Yoach Lacombe	f42a35e611	Add bark (#24086 ) * first raw version of the bark integration * working code on small models with single run * add converting script from suno weights 2 hf * many changes * correct past_kv output * working implementation for inference * update the converting script according to the architecture changes * add a working end-to-end inference code * remove some comments and make small changes * remove unecessary comment * add docstrings and ensure no unecessary intermediary output during audio generation * remove done TODOs * make style + add config docstrings * modification for batch inference support on the whole model * add details to .generation_audio method * add copyright * convert EncodecModel from original library to transformers implementation * add two class in order to facilitate model and sub-models loading from the hub * add support of loading the whole model * add BarkProcessor * correct modeling according to processor output * Add proper __init__ and auto support * Add up-to-date copyright/license message * add relative import instead of absolute * cleaner head_dim computation * small comment removal or changes * more verbose LayerNorm init method * specify eps for clearer comprehension * more verbose variable naming in the MLP module * remove unecessary BarkBlock parameter * clearer code in the forward pass of the BarkBlock * remove _initialize_modules method for cleaner code * Remove unnecessary methods from sub-models * move code to remove unnecessary function * rename a variable for clarity and change an assert * move code and change variable name for clarity * remove unnecessary asserts * correct small bug * correct a comment * change variable names for clarity * remove asserts * change import from absolute to relative * correct small error due to comma missing + correct import * Add attribute Bark config * add first version of tests * update attention_map * add tie_weights and resize_token_embeddings for fineModel * correct getting attention_mask in generate_text_semantic * remove Bark inference trick * leave more choices in barkProcessor * remove _no_split_modules * fixe error in forward of block and introduce clearer notations * correct converting script with last changes * make style + add draft bark.mdx * correct BarkModelTest::test_generate_text_semantic * add Bark in main README * add dummy_pt_objects for Bark * add missing models in the main init * correct test_decoder_model_past_with_large_inputs * disable torchscript test * change docstring of BarkProcessor * Add test_processor_bark * make style * correct copyrights * add bark.mdx + make style, quality and consistency * Apply suggestions from code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Remove unnecessary test method * simply logic of a test * Only check first ids for slow audio generation * split full end-to-end generation tests * remove unneccessary comment * change submodel names for clearer naming * remove ModuleDict from modeling_bark * combine two if statements * ensure that an edge misued won't happen * modify variable name * move code snippet to the right place (coarse instead of semantic) * change BarkSemanticModule -> BarkSemanticModel * align BarkProcessor with transformers paradigm * correct BarkProcessor tests with last commit changes * change _validate_voice_preset to an instance method instead of a class method * tie_weights already called with post_init * add codec_model config to configuration * update bark modeling tests with recent BarkProcessor changes * remove SubModelPretrainedModel + change speakers embeddings prompt type in BarkModel * change absolute imports to relative * remove TODO * change docstrings * add examples to docs and docstrings * make style * uses BatchFeature in BarkProcessor insteads of dict * continue improving docstrings and docs + make style * correct docstrings examples * more comprehensible speaker_embeddings load/Save * rename speaker_embeddings_dict -> speaker_embeddings * correct bark.mdx + add bark to documentation_tests * correct docstrings configuration_bark * integrate last nit suggestions * integrate BarkGeneration configs * make style * remove bark tests from documentation_tests.txt because timeout - tested manually * add proper generation config initialization * small bark.mdx documentation changes * rename bark.mdx -> bark.md * add torch.no_grad behind BarkModel.generate_audio() * replace assert by ValueError in convert_suno_to_hf.py * integrate a series of short comments from reviewer * move SemanticLogitsProcessors and remove .detach() from Bark docs and docstrings * actually remove SemanticLogitsProcessor from modeling_bark.oy * BarkProcessor returns a single output instead of tuple + correct docstrings * make style + correct bug * add initializer_range to BarkConfig + correct slow modeling tests * add .clone() to history_prompt.coarse_prompt to avoid modifying input array * Making sure no extra "`" are present * remove extra characters in modeling_bark.py * Correct output if history_prompt is None * remove TODOs * remove ravel comment * completing generation_configuration_bark.py docstrings * change docstrings - number of audio codebooks instead of Encodec codebooks * change 'bias' docstrings in configuration_bark.py * format code * rename BarkModel.generate_audio -> BarkModel.generate_speech * modify AutoConfig instead of EncodecConfig in BarkConfig * correct AutoConfig wrong init * refactor BarkModel and sub-models generate_coarse, generate_fine, generate_text_semantic * remove SemanticLogitsProcessor and replace it with SuppressTokensLogitsProcessor * move nb_codebook related config arguments to BarkFineConfig * rename bark.mdx -> bark.md * correcting BarkModelConfig from_pretrained + remove keys_to_ignore * correct bark.md with correct hub path * correct code bug in bark.md * correct list tokens_to_suppress * modify Processor to load nested speaker embeddings in a safer way * correct batch sampling in BarkFineModel.generate_fine * Apply suggestions from code review Small docstrings correction and code improvements Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * give more details about num_layers in docstrings * correct indentation mistake * correct submodelconfig order of docstring variables * put audio models in alphabetical order in utils/check_repo.my * remove useless line from test_modeling_bark.py * makes BarkCoarseModelTest inherits from (ModelTesterMixin, GenerationTesterMixin, unittest.TestCase) instead of BarkSemanticModelTest * make a Tester class for each sub-model instead of inheriting * add test_resize_embeddings=True for Bark sub-models * add Copied from transformers.models.gpt_neo.modeling_gpt_neo.GPTNeoSelfAttention._split_heads * remove 'Copied fom Bark' comment * remove unneccessary comment * change np.min -> min in modeling_bark.py * refactored all custom layers to have Bark prefix * add attention_mask as an argument of generate_text_semantic * refactor sub-models start docstrings to have more precise config class definition * move _tied_weights_keys overriding * add docstrings to generate_xxx in modeling_bark.py * add loading whole BarkModel to convert_suno_to_hf * refactor attribute and variable names * make style convert_suno * update bark checkpoints * remove never entered if statement * move bark_modeling docstrings after BarkPretrainedModel class definition * refactor modeling_bark.py: kv -> key_values * small nits - code refactoring and removing unecessary lines from _init_weights * nits - replace inplace method by variable assigning * remove optional when necessary * remove some lines in generate_speech * add default value for optional parameter * Refactor preprocess_histories_before_coarse -> preprocess_histories Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * correct usage after refactoring * refactor Bark's generate_xxx -> generate and modify docstrings and tests accordingly * update docstrings python in configuration_bark.py * add bark files in utils/documentation_test.txt * correct docstrings python snippet * add the ability to use parameters in the form of e.g coarse_temperature * add semantic_max_new_tokens in python snippet in docstrings for quicker generation * Reformate sub-models kwargs in BakModel.generate Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * correct kwargs in BarkModel.generate * correct attention_mask kwarg in BarkModel.generate * add tests for sub-models args in BarkModel.generate and correct BarkFineModel.test_generate_fp16 * enrich BarkModel.generate docstrings with a description of how to use the kwargs --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-07-17 17:53:24 +01:00
Sylvain Gugger	c21c3737c1	Add TAPEX to the list of deprecated models (#24859 ) * Add TAPEX to the list of deprecated models * Add check * Fix typo * Fix import path for Van conversion	2023-07-17 12:53:03 -04:00
Younes Belkada	054e802914	fix broken links in READMEs (#24861 ) fix MRA in READMEs	2023-07-17 18:47:14 +02:00
bofeng huang	c965d30279	Fix comments for `_merge_heads` (#24855 ) * Fix comments * Fix comments	2023-07-17 11:07:16 -04:00
Yih-Dar	e4a52b6a15	Fix `is_vision_available` (#24853 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-07-17 16:58:51 +02:00
Samin Yasar	4f08887053	Add Multimodal heading and Document question answering in task_summary.mdx (#23318 ) * add multimodal heading and docqa * fix sentence * task_summary data type = modality clarification * change the multimodal example to a smaller model	2023-07-17 13:51:19 +01:00
dependabot[bot]	38dfb86958	Bump cryptography from 41.0.0 to 41.0.2 in /examples/research_projects/decision_transformer (#24833 ) Bump cryptography in /examples/research_projects/decision_transformer Bumps [cryptography](https://github.com/pyca/cryptography) from 41.0.0 to 41.0.2. - [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst) - [Commits](https://github.com/pyca/cryptography/compare/41.0.0...41.0.2) --- updated-dependencies: - dependency-name: cryptography dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-07-17 07:17:17 -04:00
namespace-Pt	18d42bfd23	Remove unused code in GPT-Neo (#24826 ) 1	2023-07-17 07:07:47 -04:00
Sohyun Sim	9771ad33be	🌐 [i18n-KO] Translated `custom_tools.mdx` to Korean (#24580 ) * docs: ko: custom_tools.mdx * feat: deepl draft * fix: change .mdx to .md * fix: resolve suggestions * fix: resolve suggestions	2023-07-17 07:04:10 -04:00
statelesshz	8ba26c18cf	deprecate `sharded_ddp` training argument (#24825 ) * deprecate fairscale's ShardedDDP * fix code style * roll back * deprecate the `sharded_ddp` training argument --------- Co-authored-by: jihuazhong <jihuazhong1@huawei.com>	2023-07-17 06:57:42 -04:00
Kadir Nar	5bb4430edc	[🔗 Docs] Fixed Incorrect Migration Link (#24793 ) * [🔗 Docs] Fixed Incorrect Migration Link * Update README.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-07-14 17:47:50 -04:00
Sylvain Gugger	1023705440	Check models used for common tests are small (#24824 ) * First models * Conditional DETR * Treat DETR models, skip others * Skip LayoutLMv2 as well * Fix last tests	2023-07-14 14:43:19 -04:00
Dario Sučić	a865b62e07	set correct model input names for gptsw3tokenizer (#24788 )	2023-07-14 18:13:45 +01:00
Nicolas Patry	50726f9ea7	Fixing double `use_auth_token.pop` (preventing private models from being visible). (#24812 ) Fixing double `use_auth_token.pop` (preventing private models from being visible). Should fix: https://github.com/huggingface/transformers/issues/14334#issuecomment-1634527833 Repro: Have a private repo, with `vocab.json` (spread out files for the tokenizer) and use `AutoTokenizer.from_pretrained(..., use_auth_token="token")`.	2023-07-14 15:20:02 +02:00
Sylvain Gugger	91d7df58b6	Copy code when using local trust remote code (#24785 ) * Copy code when using local trust remote code * Remote upgrade strategy * Revert "Remote upgrade strategy" This reverts commit `4f0392f5d7`.	2023-07-13 16:57:20 -04:00
Sylvain Gugger	f32303d519	Run hub tests (#24807 ) * Run hub tests * [all-test] Run tests please! * [all-test] Add vision dep for hub tests * Fix tests	2023-07-13 15:25:45 -04:00
Fady Nakhla	9d7a0871e2	Use _BaseAutoModelClass's register method (#24810 ) Switching _BaseAutoModelClass from_pretrained and from_config to use the register classmethod that it defines rather than using the _LazyAutoMapping register method directly. This makes use of the additional consistency check within the base model's register.	2023-07-13 15:24:51 -04:00
Georgie Mathews	0866705022	Update setup.py to be compatible with pipenv (#24789 )	2023-07-13 12:56:43 -04:00
Matt	c0ca73dc98	Remove Falcon docs for the release until TGI is ready (#24808 ) * Remove Falcon docs for the release until TGI is ready * Update toctree	2023-07-13 17:27:58 +01:00
dymil	f9a711df4a	Fix typo 'submosules' (#24809 )	2023-07-13 16:56:53 +01:00

... 31 32 33 34 35 ...

15053 Commits