transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
IMvision12	ebb649a4e3	Add Missing tokenization test [electra] (#22997 ) * Create test_tokenization_electra.py * Update tests/models/electra/test_tokenization_electra.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-05-17 10:45:15 -04:00
cyy	a2789adddf	[Reland] search model buffers for dtype as the last resort (#23319 ) search model buffers for dtype as the last resort	2023-05-17 09:05:07 -04:00
Taras Tsugrii	3d764fe860	Return early once stop token is found. (#23421 ) Previously even after finding a stop token, other stop tokens were considered, which is unnecessary and slows down processing. Currently, this unnecessary overhead is negligible since there are usually 2 stop tokens considered and they are fairly short, but in future it may become more expensive.	2023-05-17 09:00:08 -04:00
Younes Belkada	3d3c7d4213	[`SAM`] fix sam slow test (#23376 ) * fix sam slow test * oops * fix error message	2023-05-17 14:27:43 +02:00
Yih-Dar	22a0769933	Update 3 docker files to use cu118 (#23406 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-17 14:26:50 +02:00
Taras Tsugrii	a6c9643ce7	Use dict.items to avoid unnecessary lookups. (#23415 ) It's more efficient to iterate over key, value dict pairs instead of iterating over keys and performing value lookups on each iteration. It's also more idiomatic.	2023-05-17 11:25:29 +01:00
Taras Tsugrii	43f146208e	Fix a typo in HfAgent docstring. (#23420 )	2023-05-17 09:43:02 +01:00
Yih-Dar	46d2468695	Update `ConvNextV2ModelIntegrationTest::test_inference_image_classification_head` (#23402 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-16 23:35:11 +02:00
Yih-Dar	ca3df9f0cf	Run doctest (in PRs) only when some doc example(s) are modified (#23387 ) * fix * fix * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-16 23:29:02 +02:00
ropoctl	17d0290e57	Why crash the whole run when HFHub gives a 50x error? (#23320 ) Logging an error and continuing is probably following the principle of least surprise.	2023-05-16 15:46:53 -04:00
Sylvain Gugger	d712ebd86d	Fix smdistributed check (#23414 )	2023-05-16 15:18:31 -04:00
Taras Tsugrii	4e244b8817	Replace appends with list comprehension. (#23359 ) It's more idiomatic and significantly more efficient because 1) it avoids repeated `append` call that Python has to resolve on each iteration 2) can preallocate the size of the final list avoiding resizing	2023-05-16 20:14:11 +01:00
Joao Gante	918a06e25d	Generate: add test to check KV format (#23403 ) Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-05-16 19:28:19 +01:00
Sylvain Gugger	9cf4a8b456	Build with non Python files (#23405 ) * Add a test of the built release * Polish everything * Trigger CI	2023-05-16 14:23:10 -04:00
Joao Gante	5b1ad0eb73	Docs: add link to assisted generation blog post (#23397 )	2023-05-16 18:54:34 +01:00
Stas Bekman	bbbc5c15d4	[AutoModel] fix `torch_dtype=auto` in `from_pretrained` (#23379 ) * [automodel] fix torch_dtype=auto in from_pretrained * add test * fix logic * Update src/transformers/models/auto/auto_factory.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-05-16 10:21:42 -07:00
Zachary Mueller	8a58809312	Fix translation no_trainer (#23407 ) * Fix translation	2023-05-16 13:10:42 -04:00
Joao Gante	130e154291	Generate: faster `can_generate` check on TF and Flax (#23398 )	2023-05-16 15:12:21 +01:00
Younes Belkada	2922e394e3	[`Pix2Struct`] Add conditional generation on docstring example (#23399 ) add conditional generation on docstring	2023-05-16 15:59:18 +02:00
Lucain	52d516c3a9	Minor fixes in transformers-tools (#23364 ) * Few fixes in new Tools implementation * code quality	2023-05-16 15:55:44 +02:00
Sohyun Sim	728c5e82cc	🌐 [i18n-KO] Translated `asr.mdx` to Korean (#23106 ) * docs: ko: task/asr.mdx * feat: manual draft * fix: resolve suggestions Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> --------- Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>	2023-05-16 09:22:56 -04:00
Ivan Sedykh	770a1275d3	Fix chat prompt in HFAgent (#23335 ) fix chat prompts	2023-05-16 09:18:58 -04:00
Joao Gante	466af1a356	OPT/BioGPT: Improved attention mask shape exception (#23270 )	2023-05-16 13:59:53 +01:00
Yih-Dar	21741e8c7e	Update `test_batched_inference_image_captioning_conditioned` (#23391 ) * fix * fix * fix test + add more docs --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: younesbelkada <younesbelkada@gmail.com>	2023-05-16 14:49:24 +02:00
Yih-Dar	d765717c76	Fix `RwkvModel` (#23392 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-16 12:14:54 +02:00
ready-research	80ca924709	Use `mkstemp` to replace deprecated `mktemp` (#23372 ) * Use `mkstemp` to replace deprecated `mktemp` The `tempfile.mktemp` function is [deprecated](https://docs.python.org/3/library/tempfile.html#tempfile.mktemp) due to [security issues](https://cwe.mitre.org/data/definitions/377.html). * Update src/transformers/utils/hub.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-05-16 11:10:54 +01:00
Koki Tanaka	ba6815e824	Replace NumPy Operations with JAX NumPy Equivalents for JIT Compilation Compatibility (#23356 ) * Replace numpy operations with jax.numpy for JIT compatibility Replaced numpy operations with their jax.numpy equivalents in the transformer library. This change was necessary to prevent errors during JIT compilation. Specifically, the modifications involve changing numpy's in-place assignments to jax.numpy's immutable update methods. * rm numpy import * rm numpy import and fix np->jnp * fixed slices bug * fixed decoder_start_tokens -> decoder_start_token_id * fixed jnp in modleing mt5 * doc fix * rm numpy import * make	2023-05-16 10:54:19 +01:00
dewa	c2393cad08	Added type hints for `Graphormer` pytorch version (#23073 ) * Added type hints for `Graphormer` pytorch version added type hints for graphormers pytorch , checked formating issues . * made the code less bloated	2023-05-15 18:27:41 +01:00
LWprogramming	ee3be05310	Fix test typos - audio feature extractors (#23310 )	2023-05-15 17:22:10 +01:00
Yih-Dar	8f76dc8e5a	Skip failing `AlignModelTest::test_multi_gpu_data_parallel_forward` (#23374 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-15 16:46:58 +02:00
AinL	41d47db90f	[Bugfix] `OPTDecoderLayer` does not return attentions when `gradient_checkpointing` and `training` is enabled. (#23367 ) Update modeling_opt.py	2023-05-15 13:31:53 +01:00
Yih-Dar	569a97adb2	Revert "Only add files with modification outside doc blocks" (#23371 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-15 14:28:36 +02:00
Yih-Dar	c94f7a1cce	Fix `OwlViTForObjectDetection.image_guided_detection` doc example (#23370 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-15 14:17:09 +02:00
Yih-Dar	380280d994	Fix `BigBirdForMaskedLM` doctest (#23369 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-15 14:15:43 +02:00
Yih-Dar	96ae83a0d2	Fix some `is_xxx_available` (#23365 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-15 14:08:45 +02:00
richardachen	65b885027a	Typo suggestion (#23360 ) Update graphormer.mdx Typo suggestion	2023-05-15 12:04:16 +01:00
Yih-Dar	81a73fa638	Fix issue introduced in PR #23163 (#23363 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-15 11:38:44 +02:00
Gregory	2958b55fe5	Removing one of the twice defined position_embeddings in LongFormer (#23343 ) Removing twice defined position_embeddings The self.position_embeddings in LongFormerEmbeddings is defined twice. Removing the first with padding_idx	2023-05-15 10:35:55 +01:00
Yih-Dar	cf11493dce	Use cu118 with cudnn >= 8.6 in docker file (#23339 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-12 21:58:15 +02:00
Susnato Dhar	79743cedab	replaced assert with raise ValueError for t5, switch_transformers, pix2struct, mt5, longt5, gptsan_japanese. (#23273 ) * replaced assert with raise ValueError * one liner * reverse one liner and cache-decoder check	2023-05-12 19:29:50 +01:00
Alisamar Husain	291c5e9b25	Handle padding warning in generation when using `inputs_embeds` (#23131 ) * Handle padding warning in generation when using `inputs_embeds` * Simpler condition * Black formatter * Changed warning logic	2023-05-12 17:06:15 +01:00
hwuebben	65d7b21b77	OR am I crazy? (#23295 ) or or and	2023-05-12 16:47:40 +01:00
Steven Liu	ef3e25ce4e	[docs] Fix Agents and Tools docstring (#23313 ) fix kwargs	2023-05-12 08:29:13 -07:00
Yih-Dar	a3975f94f3	Only add files with modification outside doc blocks (#23327 ) * min. version for pytest * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-12 16:35:15 +02:00
Mario Lezcano Casado	7f8b909189	Compute the mask in-place, with less memory reads, and on CUDA on `XLNetLMHeadModel` (#23332 ) When working on TorchInductor, I realised that there was a part from `XLNetLMHeadModel` that was being compiled to CPU code. This PR should allow to fuse this operation with other CUDA operations in `torch.compile`. It also should be faster on eager mode, as it has a this implementation has a lower foot-print. If in-place operations are not allowed even in non-grad context, I still believe that doing ones + tril rather than a ones + tril + zeros + cat should be faster simply due to the number of memory reads/writes. I tested that this code produces the same results for `0 <= qlen,mlen < 10` and `same_length in (True, False)`.	2023-05-12 14:35:37 +01:00
Yih-Dar	8c8744a94a	Fix docker image (caused by `tensorflow_text`) (#23321 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-12 13:37:37 +02:00
Shehan Munasinghe	c045249049	Add swiftformer (#22686 ) * Commit the automatically generated code using add-new-model-like * Update description at swiftformer.mdx file * remove autogenerated code for MaskedImageModeling * update weight conversion scripts * Update modeling_swiftformer.py * update configuration_swiftformer.py * Update test_modeling_swiftformer.py * update modeling code - remove einops dependency * Update _toctree.yml * update modeling code - remove copied from comments * update docs * Revert "update docs" This reverts commit `c2e05e2998`. * update docs * remove unused reference SwiftFormerImageProcessor * update dependency_versions_table.py * update swiftformer.mdx * update swiftformer.mdx * change model output type - no attentions * update model org name * Fix typo * fix copies * Update tests/models/swiftformer/test_modeling_swiftformer.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/auto/image_processing_auto.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/auto/feature_extraction_auto.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/model_doc/swiftformer.mdx Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/swiftformer/configuration_swiftformer.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Apply suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Apply suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Apply suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update modeling_swiftformer.py fix-copies * make style, make quality, fix-copies * Apply suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Apply suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * make style Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Add suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Add suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * make fix-copies * Update modeling_swiftformer.py * Update modeling_swiftformer.py * Add suggestions from code review Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-05-12 11:52:31 +01:00
Yih-Dar	364ced6893	Remove `LanguageIdentificationTool` in `__init__.py` as we don't have it yet (#23326 ) remove LanguageIdentificationTool Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-12 12:11:20 +02:00
Sylvain Gugger	273f5ba026	Revert "search buffers for dtype" (#23308 ) Revert "search buffers for dtype (#23159)" This reverts commit `ef42c2c487`.	2023-05-11 15:31:59 -04:00
Yih-Dar	ba71d9e94c	unpin tf prob (#23293 ) * unpin tf prob --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-05-11 21:28:08 +02:00

1 2 3 4 5 ...

12893 Commits