transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-28 08:42:23 +06:00

Author	SHA1	Message	Date
kmckiern	9b3eb81014	if learning rate is a tensor, get item (float) (#18861 )	2022-09-02 07:46:31 -04:00
Steven Liu	142e12afb4	Split docs on modality (#18205 ) * update * 🖍 add missing files * 📝 add nested sections * 🖍 align titles with tasks * oops * remove quotes from titles	2022-09-01 15:19:11 -05:00
Ankur Goyal	23fab60b67	Pin revision for LayoutLMForQuestionAnswering and TFLayoutLMForQuestionAnswering tests (#18854 ) * Pin revision for tests * Fixup * Update revision in models * Shorten revisions Co-authored-by: Ankur Goyal <ankur@impira.com>	2022-09-01 12:52:33 -04:00
OlivierDehaene	ddb69e5af8	Add Image To Text Generation pipeline (#18821 ) * Add Image2TextGenerationPipeline to supported pipelines * Add Flax and Tensorflow support * Add Flax and Tensorflow small tests * Add default model for Tensorflow * Add docstring * Fix doc style * Add tiny models for pytorch and flax * Remove flax from pipeline. Fix tests * Use ydshieh/vit-gpt2-coco-en as a default for both PyTorch and Tensorflow * Fix Tensorflow support Co-authored-by: Olivier Dehaene <olivier@huggingface.co>	2022-09-01 12:07:14 -04:00
Sylvain Gugger	c61f116b63	Tie weights after preparing the model in run_clm (#18855 )	2022-09-01 12:06:56 -04:00
Cody Yu	1c381f3600	Cache results of is_torch_tpu_available() (#18777 ) * Cache results of is_torch_tpu_available() * Update src/transformers/utils/import_utils.py * Update src/transformers/utils/import_utils.py	2022-09-01 11:45:33 -04:00
Sayak Paul	954e18ab97	TensorFlow MobileViT (#18555 ) * initial implementation. * add: working model till image classification. * add: initial implementation that passes intg tests. Co-authored-by: Amy <aeroberts4444@gmail.com> * chore: formatting. * add: tests (still breaking because of config mismatch). Coo-authored-by: Yih <2521628+ydshieh@users.noreply.github.com> * add: corrected tests and remaning changes. * fix code style and repo consistency. * address PR comments. * address Amy's comments. * chore: remove from_pt argument. * chore: add full-stop. * fix: TFLite model conversion in the doc. * Update src/transformers/models/mobilevit/modeling_tf_mobilevit.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/mobilevit/modeling_tf_mobilevit.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/mobilevit/modeling_tf_mobilevit.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/mobilevit/modeling_tf_mobilevit.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/mobilevit/modeling_tf_mobilevit.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply formatting. * chore: remove comments from the example block. * remove identation in the example. Co-authored-by: Amy <aeroberts4444@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-09-01 10:35:15 -04:00
Gustavo de Rosa	fe58929ad6	Adds timeout argument to training_args to avoid socket timeouts in DDP (#18562 ) * chore(training_args): Adds support for timeout argument. * fix(training_args): Passes make style through changes. * fix(training_args): Removes wrong docstring sentence. * fix(training_args): Fixes timeout not being JSON serializable. * fix(training_args_sm): Also updates timeout to timeout_delta. * fix(training_args): Fixes PR according to suggestions.	2022-09-01 10:33:53 -04:00
kumapo	ab663b2274	reflect max_new_tokens in `Seq2SeqTrainer` (#18786 ) * reflect max_new_tokens in gen_kwargs to `trainer.generate()` * reflect max_new_tokens in `Seq2SeqTrainer` * remove unnecessary variable * Trigger CI * fix style	2022-09-01 09:12:38 -04:00
Pedro Cuenca	f719c0377f	Minor typo in prose of model outputs documentation. (#18848 )	2022-09-01 12:05:40 +02:00
Albert Villanova del Moral	fafbb57df1	Pin rouge_score (#18247 ) * Pin rouge_score * Pin also in dependency_versions_table * Update excluded versions * Revert "Update excluded versions" This reverts commit `0d0362df30`. * Revert "Revert "Update excluded versions"" This reverts commit `66c47af8a6`.	2022-09-01 12:04:49 +02:00
Yih-Dar	e7da38f5dc	add a script to get time info. from GA workflow jobs (#18822 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-01 12:02:52 +02:00
Joao Gante	6e016634f1	Generate: smaller TF serving test (#18840 )	2022-09-01 10:53:39 +01:00
Yih-Dar	563a8d58db	Delete `state_dict` to release memory as early as possible (#18832 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-09-01 10:55:30 +02:00
Albert Villanova del Moral	a26c752353	Unpin fsspec (#18846 )	2022-09-01 10:20:15 +02:00
flozi00	359f7b4b8d	Create pipeline_tutorial.mdx german docs (#18625 ) * Create pipeline_tutorial.mdx * Update _toctree.yml	2022-09-01 09:57:59 +02:00
Alara Dirik	5d81a56833	Owlvit memory leak fix (#18734 ) * fix memory leak * fix typos * use singular last hidden state variable names * eliminate double call to self.owlvit to return last hidden states * eliminate 2nd call to self.vision_model in OwlViTModel	2022-09-01 10:31:08 +03:00
lewtun	80367cd1fb	Add security warning about the from_pretrained() method (#18801 ) * Add security warning about from_pretrained() method * Add sentence about malware scanner Co-authored-by: Julien Chaumond <julien@huggingface.co>	2022-08-31 21:48:40 +02:00
NielsRogge	7e7f743481	Add SegFormer ONNX support (#18006 ) * Add ONNX support * Make height and width dynamic axes Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-08-31 20:58:44 +02:00
Ekagra Ranjan	89514f0541	Improve Text Generation doc (#18788 ) * fix args for bram search decoding in generation utils * fix missing PAD token in gpt2 * add PAD EOS change to TF * Update src/transformers/generation_tf_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/generation_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/generation_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-08-31 20:30:29 +02:00
Felix Schneider	86387fe87f	Add an option to `HfArgumentParser.parse_{dict,json_file}` to raise an Exception when there extra keys (#18692 ) * Update parser to track unneeded keys, off by default * Fix formatting * Fix docstrings and defaults in HfArgparser * Fix formatting	2022-08-31 20:26:45 +02:00
Ekagra Ranjan	f210e2a414	Improve GPT2 doc (#18787 ) * Minor typo in GPT2 doc * improve gpt2 label doc * update dim of label in GPT2ForTokenClassification * add change to tf	2022-08-31 19:26:39 +02:00
Sylvain Gugger	74690b62a1	Pin ffspec (#18837 ) * Pin ffspec * Typo	2022-08-31 19:04:04 +02:00
NielsRogge	3b6943e7a3	[DETR] Add num_channels attribute (#18714 ) * Add num_channels attribute * Fix code quality Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-08-31 18:04:42 +02:00
Shu Takayama	811c4c9f79	fix bug: register_for_auto_class should be defined on TFPreTrainedModel instead of TFSequenceSummary (#18607 )	2022-08-31 16:37:18 +02:00
Lysandre Debut	ee407024c4	Update location identification (#18834 )	2022-08-31 15:10:25 +02:00
Zachary Mueller	e4910213be	Warn on TPUs when the custom optimizer and model device are not the same (#18668 ) * Check optimizer for device on TPU * Typo	2022-08-31 08:46:31 -04:00
Wang, Yi	cdde85a0a0	oob performance improvement for cpu DDP (#18595 ) * oob performance improvement for cpu DDP Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * add is_psutil_available check Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>	2022-08-31 14:35:10 +02:00
Peter Jung	c3be98ebab	Fix cost condition in DetrHungarianMatcher and YolosHungarianMatcher to allow zero-cost (#18647 ) * Fix loss condition in DetrHungarianMatcher * Fix costs condition in YolosHungarianMatcher	2022-08-31 14:28:58 +02:00
Joao Gante	fea4636cfa	Pin max tf version (#18818 )	2022-08-31 10:07:53 +02:00
Ankur Goyal	5c4c869014	Add LayoutLMForQuestionAnswering model (#18407 ) * Add LayoutLMForQuestionAnswering model * Fix output * Remove TF TODOs * Add test cases * Add docs * TF implementation * Fix PT/TF equivalence * Fix loss * make fixup * Fix up documentation code examples * Fix up documentation examples + test them * Remove LayoutLMForQuestionAnswering from the auto mapping * Docstrings * Add better docstrings * Undo whitespace changes * Update tokenizers in comments * Fixup code and remove `from_pt=True` * Fix tests * Revert some unexpected docstring changes * Fix tests by overriding _prepare_for_class Co-authored-by: Ankur Goyal <ankur@impira.com>	2022-08-31 10:05:33 +02:00
Yih-Dar	e88e9ff045	Disable nightly CI temporarily (#18820 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-30 18:33:09 +02:00
Nicolas Patry	73c6273d48	Improving the documentation for "word", within the pipeline. (#18763 ) * Improving the documentation for "word", within the pipeline. * Quality.	2022-08-30 15:29:48 +02:00
Dan Tegzes	5727dfcebe	Added Docstrings for Deberta and DebertaV2 [PyTorch] (#18610 ) * Added Doctest for Deberta Pytorch * Added path in documentation test file * Added docstrings for DebertaV2 * Revert "Added docstrings for DebertaV2" This reverts commit `307185e62a`. * Added DebertaV2 Docstrings	2022-08-30 14:46:21 +02:00
anthony2261	a98f6a1da0	LayoutXLMProcessor: ensure 1-to-1 mapping between samples and images, and add test for it (#18774 )	2022-08-30 14:43:14 +02:00
Dhruv Karan	220da3b8a1	Adds GroupViT to models exportable with ONNX (#18628 ) * groupvit to onnx * dynamic shape for pixel values dim	2022-08-30 14:31:35 +02:00
Dhruv Karan	46d0e26a27	Adds OWLViT to models exportable with ONNX (#18588 ) * onnx conversion for owlvit * .T to .t() * dynamic shapes for pixel values	2022-08-30 14:30:59 +02:00
NielsRogge	b83796ded7	Remove ViltForQuestionAnswering from check_repo (#18762 ) Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-08-30 14:15:36 +02:00
amyeroberts	ef91a2d135	Run tests if skip condition not met (#18764 ) * Run tests if skip condition not met * Update comment - remove outdated ref to TF 2.8	2022-08-30 14:03:28 +02:00
Christoffer Koo Øhrstrøm	de8548ebf3	[LayoutLMv3] Add TensorFlow implementation (#18678 ) Co-authored-by: Esben Toke Christensen <esben.christensen@visma.com> Co-authored-by: Lasse Reedtz <lasse.reedtz@visma.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2022-08-30 11:48:11 +01:00
NielsRogge	7320d95d98	[Swin, Swinv2] Fix attn_mask dtype (#18803 ) * Add dtype * Fix Swinv2 as well Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-08-30 12:31:34 +02:00
Li-Huai (Allan) Lin	5c702175eb	up (#18805 )	2022-08-30 12:30:46 +02:00
Ekagra Ranjan	da02b4035c	Add docstring for BartForCausalLM (#18795 ) * add docstring for BartForCausalLM * doc-style fic	2022-08-30 12:19:03 +02:00
amyeroberts	8c4a11493f	Revert to and safely handle flag in owlvit config (#18750 )	2022-08-29 18:48:24 +02:00
Yih-Dar	da5bb29219	send model to the correct device (#18800 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-29 18:46:30 +02:00
NielsRogge	f1fd460694	Add SegFormer and ViLT links (#18808 ) Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>	2022-08-29 18:46:07 +02:00
Lucain	169b8cde47	Fix mock in `test_cached_files_are_used_when_internet_is_down` (#18804 )	2022-08-29 15:56:08 +02:00
Yih-Dar	8b67f20935	Fix memory leak issue in `torch_fx` tests (#18547 ) Co-authored-by: Lysandre Debut <hi@lysand.re> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-29 11:43:20 +02:00
fatih	b10a3b3760	fix a possible typo in auto feature extraction (#18779 )	2022-08-29 11:24:53 +02:00
Yih-Dar	5f06a09b9f	fix missing block when there is no failure (#18775 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-08-29 09:10:13 +02:00

... 89 90 91 92 93 ...

15053 Commits