transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-13 09:40:06 +06:00

Author	SHA1	Message	Date
Yih-Dar	cf7b98b807	Fix `run_clip.py` (#20234 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-11-15 15:45:21 +01:00
Ming Liu	36b063ed4f	Update README.md (#20188 ) There is typo in the original hyperlink. Below is the original version: Based on the script [`run_translation_no_trainer.py`](https://github.com/huggingface/transformers/blob/main/examples/pytorch/translation/run_translationn_no_trainer.py).	2022-11-14 12:53:02 -05:00
Sanchit Gandhi	af1a7c8ca3	[Examples] Generalise Seq2Seq ASR to handle Whisper (#19519 ) * merge conflicts * bos and eos in datacollator * (temp) hardcode removal of attention mask * freeze encoder * actually freeze encoder * set max length / num beams according to gen kwargs * (temp) fix tests * don't pop attn mask * override return attention mask config from Hub * Hub configs updated 🤗 * final fixes * update type annotations * backward comp	2022-11-14 17:45:46 +00:00
bhuang	3502c202f9	Update README.md (#20063 )	2022-11-04 08:56:54 -04:00
Sylvain Gugger	06886d5a68	Only resize embeddings when necessary (#20043 ) * Only resize embeddings when necessary * Add comment	2022-11-03 12:05:04 -04:00
amyeroberts	a6b7759880	Add Image Processors (#19796 ) * Add CLIP image processor * Crop size as dict too * Update warning * Actually use logger this time * Normalize doesn't change dtype of input * Add perceiver image processor * Tidy up * Add DPT image processor * Add Vilt image processor * Tidy up * Add poolformer image processor * Tidy up * Add LayoutLM v2 and v3 imsge processors * Tidy up * Add Flava image processor * Tidy up * Add deit image processor * Tidy up * Add ConvNext image processor * Tidy up * Add levit image processor * Add segformer image processor * Add in post processing * Fix up * Add ImageGPT image processor * Fixup * Add mobilevit image processor * Tidy up * Add postprocessing * Fixup * Add VideoMAE image processor * Tidy up * Add ImageGPT image processor * Fixup * Add ViT image processor * Tidy up * Add beit image processor * Add mobilevit image processor * Tidy up * Add postprocessing * Fixup * Fix up * Fix flava and remove tree module * Fix image classification pipeline failing tests * Update feature extractor in trainer scripts * Update pad_if_smaller to accept tuple and int size * Update for image segmentation pipeline * Update src/transformers/models/perceiver/image_processing_perceiver.py Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com> * Update src/transformers/image_processing_utils.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/beit/image_processing_beit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * PR comments - docstrings; remove accidentally added resize; var names * Update docstrings * Add exception if size is not in the right format * Fix exception check * Fix up * Use shortest_edge in tuple in script Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>	2022-11-02 11:57:36 +00:00
Sylvain Gugger	c3a93d8d82	v4.25.0.dev0	2022-10-31 21:48:40 -04:00
Sanchit Gandhi	f38a145418	[ASR] Update 'tasks' for model card (#19986 )	2022-10-31 16:50:17 +00:00
regisss	5d2d51a0fb	Fix LR (#19875 )	2022-10-26 08:35:53 -04:00
GMFTBY	71786b10c5	Adding the state-of-the-art contrastive search decoding methods for the codebase of generation_utils.py (#19477 ) * add: the contrastive search for generaton_utils * add: testing scripts for contrastive search under examples/text-generation * update the quality of codes * revise the docstring; make the generation_contrastive_search.py scripts; * revise the examples/pytorch/text-generation/run_generation_contrastive_search.py to the auto-APIs format * revise the necessary documents * fix: revise the docstring of generation_contrastive_search.py * Fix the code indentation * fix: revise the nits and examples in contrastive_search docstring. * fix the copyright * delete generation_contrastive_search.py * revise the logic in contrastive_search * update the intergration test and the docstring * run the tests over * add the slow decorate to the contrastive_search intergrate test * add more test * do the style, quality, consistency checks	2022-10-19 10:17:46 +01:00
amyeroberts	31ec424b3d	Add decorator to flaky test (#19674 )	2022-10-18 18:51:37 +01:00
Yifan Yang	94d7c3ba44	[Examples] make default preprocessing_num_workers=1 (#19684 ) * [Examples] make default preprocessing_num_workers=1 * [Examples] revert changes in research projects	2022-10-17 14:17:01 -04:00
Sanchit Gandhi	eefcecaa35	[Examples] Fix typos in run speech recognition seq2seq (#19514 )	2022-10-12 15:33:22 +01:00
FilipposVentirozos	4ed0fa3676	Fix pytorch seq2seq qa (#19258 ) * fixed typo for SQuAD * Fixed the preprocess_validation_function function for the labels to reflect the remaining truncated instances * Rolled back the trainer_seq2seq_qa.py for UnboundLocalError: local variable 'metrics' referenced before assignment Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-10-12 08:33:44 -04:00
regisss	bb2cfd1824	Add multi-node conditions in trainer_qa.py and trainer_seq2seq.py (#19502 ) * Add multi-node conditions in trainer_qa.py and trainer_seq2seq.py * Code improvement	2022-10-11 22:48:56 -04:00
Lysandre	10100979ed	Dev version	2022-10-10 17:25:40 -04:00
wei zhao	7d5ce6802e	Fix typo in image-classification/README.md (#19424 ) Fix link typo of the following content. PyTorch version, Trainer PyTorch version, no Trainer	2022-10-10 09:16:58 -04:00
ddobokki	fa4bcd5274	edit: cast attention_mask to long in DataCollatorCTCWithPadding (#19369 ) * edit: casting attention_mask to long in DataCollatorCTCWithPadding * edit: casting attention_mask to long in DataCollatorCTCWithPadding	2022-10-07 10:05:48 -04:00
Zachary Mueller	ad98642a82	Fix gather for metrics (#19360 )	2022-10-05 14:52:01 -04:00
Divyanshu Kumar	c28d04e9e2	Update no_trainer script for summarization (#19277 ) * Update no_trainer script for summarization * removed unnecessary import * fixes notation mistake * removed: unused variable	2022-10-03 09:21:51 -04:00
Sylvain Gugger	0fc68a7e14	Fix seq2seq QA example	2022-09-28 15:45:49 -04:00
Tatsuki Okada	4a0b958d61	Fix trainer seq2seq qa.py evaluate log and ft script (#19208 ) * fix args option * fix trainer eval log * fix out of memory qa script * do isort, black, flake * fix tokenize target * take it back. * fix: comment	2022-09-28 10:55:46 -04:00
Sylvain Gugger	c20b2c7e18	Use repo_type instead of deprecated datasets repo IDs (#19202 ) * Use repo_type instead of deprecated datasets repo IDs * Add missing one in doc	2022-09-26 09:50:48 -04:00
Enze	5da6afdd8d	Update run_clip.py (#19130 ) The overwrite_cache parameter is declared twice.	2022-09-23 20:48:41 +02:00
Leandro von Werra	ef6741fe65	Fix GLUE MNLI when using `max_eval_samples` (#18722 )	2022-09-21 09:33:22 +02:00
Santiago Castro	06f341de4f	Add a missing space in a script arg documentation (#19113 )	2022-09-20 21:43:32 +02:00
Lysandre	16913b3c92	Dev version	2022-09-14 14:58:20 -04:00
Rahul A R	00fc9217d1	Fixed bug which caused overwrite_cache to always be True (#19000 ) * fixed bug which caused overwrite_cache to always be True (#18967). * reformatting changes	2022-09-13 11:29:48 -04:00
Rafał Jankowski	85125fcffd	Neptune.ai integration improvements (#18934 ) * NeptuneCallback improvements * After review suggestions and deduplication of initial run * Added volatile checkpoints support due to missing post-rebase commit * Update README per review comments - Remove list formatting - Correct Neptune docs link Co-authored-by: Sabine <sabine.nyholm@neptune.ai>	2022-09-09 11:37:34 -04:00
Nicholas Broad	4f299b2446	Accelerator end training (#18910 ) * add accelerator.end_training() Some trackers need this to end their runs. * fixup and quality * add space * add space again ?!?	2022-09-07 07:46:26 -04:00
arun99481	3b19c0317b	updating gather function with gather_for_metrics in run_wav2vec2_pretraining (#18877 ) Co-authored-by: Arun Rajaram <arunrajaram@Aruns-MacBook-Pro.local>	2022-09-06 07:36:37 -04:00
Sylvain Gugger	c61f116b63	Tie weights after preparing the model in run_clm (#18855 )	2022-09-01 12:06:56 -04:00
Rahul A R	e9442440fc	streamlining 'checkpointing_steps' parsing (#18755 )	2022-08-25 11:00:38 -04:00
Rahul A R	c55d6e4e10	examples/run_summarization_no_trainer: fixed incorrect param to hasattr (#18720 ) * fixed incorrect param to hasattr * simplified condition checks * code cleanup	2022-08-24 12:12:42 -04:00
Atharva Ingle	d90a36d192	remove check for main process for trackers initialization (#18706 )	2022-08-22 11:16:27 -04:00
Atharva Ingle	e54a1b49aa	`model.tie_weights()` should be applied after `accelerator.prepare()` (#18676 ) * `model.tie_weights()` should be applied after `accelerator.prepare` Weight tying should be done after the model has been moved to XLA device as mentioned on PyTorch/XLA Troubleshooting guide [here](https://github.com/pytorch/xla/blob/master/TROUBLESHOOTING.md#xla-tensor-quirks) * format code	2022-08-18 13:46:57 -04:00
Zachary Mueller	358fc18613	Add evaluate to examples requirements (#18666 )	2022-08-18 10:57:39 -04:00
Stefan Schweter	358478e729	Examples: add Bloom support for token classification (#18632 ) * examples: add Bloom support for token classification (FLAX, PyTorch and TensorFlow) * examples: remove support for Bloom in token classication (FLAX and TensorFlow currently have no support for it)	2022-08-17 09:50:57 +02:00
zhoutang776	25e651a2de	Update run_translation_no_trainer.py (#18637 ) * Update run_translation_no_trainer.py found an error in selecting `no_decay` parameters and some small modifications when the user continues to train from a checkpoint * fixs `no_decay` and `resume_step` issue 1. change `no_decay` list 2. if use continue to train their model from provided checkpoint, the `resume_step` will not be initialized properly if `args.gradient_accumulation_steps != 1`	2022-08-16 13:25:57 -04:00
Rasmus Arpe Fogh Jensen	a765b68aa6	Update no_trainer.py scripts to include accelerate gradient accumulation wrapper (#18473 ) * Added accelerate gradient accumulation wrapper to run_image_classification_no_trainer.py example script * make fixup changes * PR comments * changed input to Acceletor based on PR comment, ran make fixup * Added comment explaining the sync_gradients statement * Fixed lr scheduler max steps * Changed run_clm_no_trainer.py script to use accelerate gradient accum wrapper * Fixed all scripts except wav2vec2 pretraining to use accelerate gradient accum wrapper * Added accelerate gradient accum wrapper for wav2vec2_pretraining_no_trainer.py script * make fixup and lr_scheduler step inserted back into run_qa_beam_search_no_trainer.py * removed changes to run_wav2vec2_pretraining_no_trainer.py script and fixed using wrong constant in qa_beam_search_no_trainer.py script	2022-08-08 15:52:47 -04:00
Sylvain Gugger	70b0d4e193	Fix compatibility with 1.12 (#17925 ) * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * fix torch.onnx.symbolic_opset12 import * Reject bad version Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-08 09:53:08 -04:00
regisss	88a0ce57bb	Add seed setting to image classification example (#18519 )	2022-08-08 08:08:11 -04:00
Julien Chaumond	9129fd0377	`transformers-cli login` => `huggingface-cli login` (#18490 ) * zero chance anyone's using that constant no? * `transformers-cli login` => `huggingface-cli login` * `transformers-cli repo create` => `huggingface-cli repo create` * `make style`	2022-08-06 09:42:55 +02:00
Julien Chaumond	8d1f9039d0	Just re-reading the whole doc every couple of months 😬 (#18489 ) * Delete valohai.yaml * NLP => ML * typo * website supports https * datasets * 60k + modalities * unrelated link fixing for accelerate * Ok those links were actually broken * Fix link * Make `AutoTokenizer` auto-link * wording tweak * add at least one non-nlp task	2022-08-06 09:38:55 +02:00
Kian Sierra McGettigan	0bf1e1aca4	Update no trainer examples for QA and Semantic Segmentation (#18474 ) * swag_no_trainer updated for with gather_metrics * Removed unused variable samples_seen * updated examples with gather_for_metrics	2022-08-04 13:22:19 -04:00
Kian Sierra McGettigan	330247ede2	Update no trainer scripts for multiple-choice (#18468 ) * swag_no_trainer updated for with gather_metrics * Removed unused variable samples_seen	2022-08-04 07:29:32 -04:00
Ritik Nandwal	3db4378bd7	Update no trainer scripts for language modeling and image classification examples (#18443 ) * Update no_trainer script for image-classification * Update no_trainer scripts for language-modeling examples * Remove unused variable * Removing truncation from losses array for language modeling examples	2022-08-03 08:33:18 -04:00
Yih-Dar	5546fb61ab	fix run_clip README (#18332 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-02 19:14:46 +02:00
Sylvain Gugger	941d233153	Fix ROUGE add example check and update README (#18398 ) * Fix ROUGE add example check and update README * Stay consistent in values	2022-08-01 11:14:49 -04:00
Ogundepo Odunayo	679d68a11b	Correct the spelling of bleu metric (#18375 )	2022-08-01 07:51:27 -04:00

1 2 3 4 5 ...

301 Commits