transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 11:11:05 +06:00

History

Yoach Lacombe d2cdefb9ec Add new meta w2v2-conformer BERT-like model (#28165 ) * first commit * correct default value non causal * update config and modeling code * update converting checkpoint * clean modeling and fix tests * make style * add new config parameters to docstring * fix copied from statements * Apply suggestions from code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * make position_embeddings_type docstrings clearer * clean converting script * remove function not used * clean modeling file * apply suggestion for test file + add convert script to not_doctested * modify tests according to review - cleaner logic and more tests * Apply nit suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add checker of valid position embeddings type * instantiate new layer norm layer with the right eps * fix freeze_feature_encoder since it can be None in some cases * add test same output in convert script * restore wav2vec2conformer and add new model * create processor and FE + clean * add new model code * fix convert script and set default config parameters * correct model id paths * make style * make fix-copies and cleaning files * fix copied from statements * complete .md and fixe copies * clean convert script argument defaults * fix config parameters docstrings * fix config docstring * add copied from and enrich FE tests * fix copied from and repo-consistency * add autotokenizer * make test input length shorter and change docstring code * fix docstrings and copied from * add add_adapter to ASR training example * make testing of adapters more robust * adapt to multi adapter layers * refactor input_values->input_features and remove w2v2-bert feature extractor * remove pretraining model * remove depreciated features and useless lines * add copied from and ignore statements to modeling tests * remove pretraining model #2 * change import in convert script * change default in convert script * update readme and remove useless line * Update tests/models/wav2vec2_bert/test_processor_wav2vec2_bert.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * refactor BERT to Bert for consistency * remove useless ignore copy statement * add persistent to buffer in rotary * add eps in LayerNorm init and remove copied from * add adapter activation parameters and add copied from statements * Fix copied statements and add unitest.skip reasons * add copied statement in test_processor * refactor processor * make style * replace numpy random by torch rand * remove expected output CTC * improve converting script with processor class * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * remove gumbel class * remove tests related to previously deleted class * Update src/transformers/models/wav2vec2_bert/configuration_wav2vec2_bert.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * correct typos * remove uused parameters * update processor to takes both text and audio * update checkpoints * update expected output and add ctc expected output * add label_attention_mask * replace pt with np in processor tests * fix typo * revert to behaviour with labels_attention_mask --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>		2024-01-18 13:37:34 +00:00
..
benchmark
bettertransformer	Fixed malapropism error (#26660 )	2023-10-09 11:04:57 +02:00
deepspeed	Fix initialization for missing parameters in `from_pretrained` under ZeRO-3 (#28245 )	2024-01-09 14:58:21 +00:00
extended	Device agnostic trainer testing (#27131 )	2023-10-30 18:16:40 +00:00
fixtures	[WIP] add SpeechT5 model (#18922 )	2023-02-03 12:43:46 -05:00
fsdp	fix resuming from ckpt when using FSDP with FULL_STATE_DICT (#27891 )	2023-12-16 19:41:43 +05:30
generation	Config: warning when saving generation kwargs in the model config (#28514 )	2024-01-16 18:31:01 +00:00
models	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
optimization	Make schedulers picklable by making lr_lambda fns global (#21768 )	2023-03-02 12:08:43 -05:00
peft_integration	[`Peft`] `modules_to_save` support for peft integration (#27466 )	2023-11-14 10:32:57 +01:00
pipelines	Tokenizer kwargs in textgeneration pipe (#28362 )	2024-01-15 16:52:18 +01:00
quantization	[GPTQ] Fix test (#28018 )	2024-01-15 11:22:54 -05:00
repo_utils	Allow `# Ignore copy` (#27328 )	2023-12-07 10:00:08 +01:00
sagemaker	Broken links fixed related to datasets docs (#27569 )	2023-11-17 13:44:09 -08:00
tokenization	[`Styling`] stylify using ruff (#27144 )	2023-11-16 17:43:19 +01:00
tools	Add support for for loops in python interpreter (#24429 )	2023-06-26 09:58:14 -04:00
trainer	Support `DeepSpeed` when using auto find batch size (#28088 )	2024-01-10 06:03:13 -05:00
utils	improve dev setup comments and hints (#28495 )	2024-01-15 18:36:40 +00:00
__init__.py
test_backbone_common.py	Align backbone stage selection with out_indices & out_features (#27606 )	2023-12-20 18:33:17 +00:00
test_cache_utils.py	Generate: SinkCache can handle iterative prompts (#27907 )	2023-12-08 20:02:20 +00:00
test_configuration_common.py	[ `PretrainedConfig`] Improve messaging (#27438 )	2023-11-15 14:10:39 +01:00
test_configuration_utils.py	Config: warning when saving generation kwargs in the model config (#28514 )	2024-01-16 18:31:01 +00:00
test_feature_extraction_common.py	Split common test from core tests (#24284 )	2023-06-15 07:30:24 -04:00
test_feature_extraction_utils.py	Remove-auth-token (#27060 )	2023-11-13 14:20:54 +01:00
test_image_processing_common.py	Fix a couple of typos and add an illustrative test (#26941 )	2023-12-11 15:51:51 +00:00
test_image_processing_utils.py	Remove-auth-token (#27060 )	2023-11-13 14:20:54 +01:00
test_image_transforms.py	Normalize floating point cast (#27249 )	2023-11-10 15:35:27 +00:00
test_modeling_common.py	Fix SDPA tests (#28552 )	2024-01-17 17:29:18 +01:00
test_modeling_flax_common.py	Split common test from core tests (#24284 )	2023-06-15 07:30:24 -04:00
test_modeling_flax_utils.py	Default to msgpack for safetensors (#27460 )	2023-11-13 15:17:01 +01:00
test_modeling_tf_common.py	Replace build() with build_in_name_scope() for some TF tests (#28046 )	2023-12-14 17:42:25 +00:00
test_modeling_tf_utils.py	Replace build() with build_in_name_scope() for some TF tests (#28046 )	2023-12-14 17:42:25 +00:00
test_modeling_utils.py	Config: warning when saving generation kwargs in the model config (#28514 )	2024-01-16 18:31:01 +00:00
test_pipeline_mixin.py	Shorten the conversation tests for speed + fixing position overflows (#26960 )	2023-10-31 14:20:04 +00:00
test_processing_common.py	Save `Processor` (#27761 )	2024-01-18 10:21:45 +00:00
test_sequence_feature_extraction_common.py	Fix typo (#25966 )	2023-09-05 10:12:25 +02:00
test_tokenization_common.py	[ `TokenizationUtils`] Fix `add_special_tokens` when the token is already there (#28520 )	2024-01-16 16:36:29 +01:00
test_tokenization_utils.py	Remove-auth-token (#27060 )	2023-11-13 14:20:54 +01:00