transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 04:40:06 +06:00

Author	SHA1	Message	Date
Rémi Ouazan	493cf1554b	[seamless_m4t] Skip some tests when speech is not available (#38430 ) * Added the require_speech decorator * Added require_speecj to some seamless_m4t tests * Changed skip message	2025-06-02 09:17:28 +00:00
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟	64d14ef28d	Fix setting FLASH_ATTENTION_DETERMINISTIC after importing (#37185 ) transformers.enable_full_determinism enables deterministic flash attention using `FLASH_ATTENTION_DETERMINISTIC` `800510c67b/src/transformers/trainer_utils.py (L79)` However, current checks use a global variable `deterministic_g`, which will do the environment variable check as soon as importing, this will cause issues as users can call `transformers.enable_full_determinism` after `transformers.modeling_flash_attention_utils` is imported. This behavior is introduced in https://github.com/huggingface/transformers/pull/33932/files#r1806668579 to fix the graph break. As a result, this PR implement fixes by delaying the environment variable check to the first time when `_flash_attention_forward` is executed, so that we can fix this issue and we won't introduce a graph break. Signed-off-by: Hollow Man <hollowman@opensuse.org>	2025-06-02 11:08:20 +02:00
Yuanyuan Chen	fde1120b6c	Remove deprecated use_flash_attention_2 parameter (#37131 ) Signed-off-by: cyy <cyyever@outlook.com>	2025-06-02 11:06:25 +02:00
Fanli Lin	51d732709e	[docs] add xpu environment variable for gpu selection (#38194 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Has been cancelled Details Build documentation / build (push) Has been cancelled Details Slow tests on important models (on Push - A10) / Get all modified files (push) Has been cancelled Details Self-hosted runner (push-caller) / Check if setup was changed (push) Has been cancelled Details Secret Leaks / trufflehog (push) Has been cancelled Details Update Transformers metadata / build_and_package (push) Has been cancelled Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Has been cancelled Details Self-hosted runner (push-caller) / build-docker-containers (push) Has been cancelled Details Self-hosted runner (push-caller) / Trigger Push CI (push) Has been cancelled Details * squash commits * rename gpu * rename accelerator * change _toctree.yml * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: sdp <sdp@a4bf01943ff7.jf.intel.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>	2025-05-30 16:05:07 +00:00
Marc Sun	c7f2b79dd8	protect dtensor import (#38496 ) protect	2025-05-30 17:36:00 +02:00
Marc Sun	051a8acc9a	Align TP check (#38328 ) align tp check	2025-05-30 17:15:39 +02:00
M Saqlain	e0545ef0b8	[Tests] Reduced model size for albert-test model (#38480 ) * Reduced model size for albert-test model * Run checks * Removed test_save_load * Removed test skipping functions	2025-05-30 14:22:32 +00:00
dependabot[bot]	f962c862ff	Bump torch from 2.2.0 to 2.6.0 in /examples/flax/vision (#37618 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details New model PR merged notification / Notify new model (push) Has been cancelled Details Bumps [torch](https://github.com/pytorch/pytorch) from 2.2.0 to 2.6.0. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md) - [Commits](https://github.com/pytorch/pytorch/compare/v2.2.0...v2.6.0) --- updated-dependencies: - dependency-name: torch dependency-version: 2.6.0 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-05-30 14:04:52 +01:00
islemyakoubi	98568d1e25	Fix incorrect bbox_embed initialization when decoder_bbox_embed_share=False in GroundingDINO (#38238 ) * A shallow copy in groundingdino Fixes #37333 * Supprimer une ligne vide dans la classe GroundingDinoForObjectDetection * Translate comments in the GroundingDinoForObjectDetection class from French to English	2025-05-30 15:02:18 +02:00
Winston Castorp	d0fccbf7ef	Fix convert_internvl_weights_to_hf.py to support local paths (#38264 ) fix(internvl): add local path support to convert_internvl_weights_to_hf.py	2025-05-30 14:56:32 +02:00
Arthur	858ce6879a	make it go brrrr (#38409 ) * make it go brrrr * date time * update * fix * up * uppp * up * no number i * udpate * fix * [paligemma] fix processor with suffix (#38365) fix pg processor * [video utils] group and reorder by number of frames (#38374) fix * Fix convert to original state dict for VLMs (#38385) * fix convert to original state dict * fix * lint * Update modeling_utils.py * update * warn * no verbose * fginal * ouft * style --------- Co-authored-by: Raushan Turganbay <raushan@huggingface.co> Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>	2025-05-30 11:19:42 +02:00
Luc Georges	ab5067e7fd	fix: handle no scheduler passed by user (#38407 )	2025-05-30 11:00:44 +02:00
XING, Zhenghao	42ef218b58	[Qwen2.5-Omni] Fix dtype of cos,sin when used with flash attention (#38453 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * Fix dtype of cos,sin when used with flash attention * Fix dtype of cos,sin when used with flash attention	2025-05-29 18:24:40 +00:00
Yih-Dar	81cff7ad34	Fix `Gemma3IntegrationTest` (#38471 ) * check * check * check * check * check * check * check * test style bot * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-29 16:51:12 +02:00
Lukas Geiger	e508965df7	Cleanup `BatchFeature` and `BatchEncoding` (#38459 ) * Use dict comprehension to create dict * Fix type annotation Union[Any] doesn't really make any sense * Remove methods that are already implemented in the `UserDict` parent class	2025-05-29 14:13:43 +00:00
Rahul	8e5cefcb1e	Fix TypeError in save_pretrained error handling (fixes #38422 ) (#38449 )	2025-05-29 13:58:16 +00:00
Raushan Turganbay	ad9dd3d17b	🔴 [VLM] modeling updates (#38317 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * updates * fixup * fix tests * fix test * fix * let it be here for now, till monday * two more fixes * persimmon * fixup * fix * fixup * make sure fuyu runs now that LM has new attn API * fixup + tests * qwen vl uses new mask interface as well * qwen image features format * update * remove image_sizes * address comments * i am dumb...	2025-05-29 11:08:23 +00:00
Yaswanth Gali	a6f7acb603	[Tests] Clean up test cases for few models (#38315 ) * Update tests * revert aria change * too slow hence revert	2025-05-29 08:21:28 +00:00
Luc Georges	8010f3cf61	feat: add cache retention for requests (#38446 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * feat: add cache retention for requests * fix: propagate `manual_eviction` param & refactor `finish_request` `finish_request` now only takes `request_id: str` as an input rather than the full `RequestState`, which was not needed and simplifies calling from `ContinuousBatchingManager::evict_request_from_cache` * refactor: pop req from `active_requests` * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-05-28 18:15:10 +00:00
Yih-Dar	66da700145	Fix GLM4 checkpoints (#38412 ) * fix * fix * fix * fix * fix * fix * test style bot * Apply style fixes --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-05-28 16:40:08 +00:00
Avasam	2872e8bac5	Merge type hints from `microsoft/python-type-stubs` (post dropping support for Python 3.8) (#38335 ) * Merge type hints from microsoft/python-type-stubs (post Python 3.8) * Remove mention of pylance * Resolved conflict * Merge type hints from microsoft/python-type-stubs (post Python 3.8) * Remove mention of pylance * Resolved conflict * Update src/transformers/models/auto/configuration_auto.py Co-authored-by: Avasam <samuel.06@hotmail.com> --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2025-05-28 16:21:40 +00:00
Yuanzhou Cai	942c60956f	Model card for mobilenet v1 and v2 (#37948 ) * doc: #36979 * doc: update hfoptions * add model checkpoints links * add model checkpoints links * update example output * update style #36979 * add pipeline tags * improve comments * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * apply suggested changes * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-28 09:20:19 -07:00
Jiwook Han	9a8510572b	Updated the model card for ViTMAE (#38302 ) * Update vit_mae.md * badge float:right * Update docs/source/en/model_doc/vit_mae.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/vit_mae.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/vit_mae.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/vit_mae.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/vit_mae.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/vit_mae.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/vit_mae.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/vit_mae.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/vit_mae.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update model_doc/vit_mae.md * fix --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-28 09:19:43 -07:00
Vanshu	c9fcbd5bf9	Updated the Model docs - for the ALIGN model (#38072 ) * Updated the Model docs - for the ALIGN model * Update docs/source/en/model_doc/align.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/align.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Updated align.md * Update docs/source/en/model_doc/align.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/align.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update align.md * fix --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-28 09:19:09 -07:00
Yoni Gozlan	cba94e9272	Fix handling of slow/fast image processors in image_processing_auto.py (#38161 ) Fix wrong error when torchvision is not installed	2025-05-28 16:00:23 +00:00
Yoni Gozlan	21b10d9aa4	Fix `from_args_and_dict` ProcessorMixin (#38296 ) * fix-from-args-and-dict-processormixin * change used_kwargs to valid_kwargs * remove manual valid_kwargs * fix copies * fix modular aria	2025-05-28 11:46:33 -04:00
Matt	f844733568	Fix MoE gradient test (#38438 )	2025-05-28 16:44:20 +01:00
Matt	0ed6f7e6b4	Remove redundant test_sdpa_equivalence test (#38436 ) * Remove redundant test * make fixup	2025-05-28 17:22:25 +02:00
Yih-Dar	51e0fac29f	Trigger doc-builder job after style bot (#38398 ) * update * update * update * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-28 17:15:34 +02:00
Yoni Gozlan	c24d18bbae	Fix convert weights for InternVL (#38233 ) Fix internvl convert weights	2025-05-28 11:14:56 -04:00
Matthew Ngan	8850427242	Fix typo in tokenization_utils_base.py docstring (#38418 ) Fix typo in tokenization_utils_base.py	2025-05-28 14:52:10 +00:00
Peter St. John	bab40c6838	[core] support tensor-valued _extra_state values in `from_pretrained` (#38155 ) Support tensor-valued _extra_state values TransformerEngine uses the pytorch get/set_extra_state API to store FP8 layer config information as bytes Tensor in the _extra_state entry in the state dict. With recent changes to from_pretrained, this functionality has broken and loading a model that uses this API doesn't appear to work. This PR fixes the save/load pretrained functions for extra state entries that use a pytorch tensor, and adds a (currently x-failing) test for a dictionary extra state. Signed-off-by: Peter St. John <pstjohn@nvidia.com>	2025-05-28 15:38:42 +02:00
Anton Vlasjuk	badc71b9f6	🔴[`Attention`] Attention refactor for Whisper-based models (#38235 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Update Transformers metadata / build_and_package (push) Waiting to run Details Self-hosted runner (AMD mi250 scheduled CI caller) / Model CI (push) Has been cancelled Details Self-hosted runner (AMD mi250 scheduled CI caller) / Torch pipeline CI (push) Has been cancelled Details Self-hosted runner (AMD mi250 scheduled CI caller) / Example CI (push) Has been cancelled Details Self-hosted runner (AMD mi250 scheduled CI caller) / DeepSpeed CI (push) Has been cancelled Details Self-hosted runner scale set (AMD mi300 scheduled CI caller) / Model CI (push) Has been cancelled Details Self-hosted runner scale set (AMD mi300 scheduled CI caller) / Torch pipeline CI (push) Has been cancelled Details Self-hosted runner scale set (AMD mi300 scheduled CI caller) / Example CI (push) Has been cancelled Details Self-hosted runner scale set (AMD mi300 scheduled CI caller) / DeepSpeed CI (push) Has been cancelled Details Secret Leaks / trufflehog (push) Has been cancelled Details * start refactoring whisper * revert for now * first step * carry over attn fixes * check if this works * whisper has an off by one somewhere - cutting mask in any interface * make it based on interface * remove some tests that were skipped but now work * some fixes for whisper tests * interface changes * change the order of fix * some attention adjustments for eager + TP * fix scaling * mask changes * why does whisper contain those extra seq lens? * fix from config for fa2 as input_ids is invalid * fix another test * another fix * disable flex attn due to compile issues * copies and refactor for qwen audio since it somewhat relies on whisper * fix scaling and smaller things * retrigger * new new interface version + more fixups * adjust qwen * add comment * forgot this one * change copies as whisper cuts on the mask * add guard * add flex attention * switch to new mask function + add skips for torchscript * remove old api with cache position * last changes? * trigger ci	2025-05-28 13:32:38 +02:00
JJJYmmm	565a0052ed	make Llama4TextMoe forward more readable (#37529 ) * update forward of Llama4TextMoe * remove redudant transpose * fix formatting --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2025-05-28 11:54:45 +02:00
Yih-Dar	defeb04299	Fix CircleCI not triggered when PR is opened from a branch of `huggingface/transformers` (#38413 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-28 11:25:43 +02:00
Cyril Vallez	593276fe1e	Update error when using additional and/or masks (#38429 ) update error	2025-05-28 11:08:49 +02:00
ivarflakstad	3aab6e95cb	Disable mi210 scheduled CI (#38411 )	2025-05-28 10:35:41 +02:00
Yao Matrix	fb82a98717	enable large_gpu and torchao cases on XPU (#38355 ) * cohere2 done Signed-off-by: Matrix Yao <matrix.yao@intel.com> * enable torchao cases on XPU Signed-off-by: Matrix YAO <matrix.yao@intel.com> * fix Signed-off-by: Matrix YAO <matrix.yao@intel.com> * fix Signed-off-by: Matrix YAO <matrix.yao@intel.com> * fix Signed-off-by: Matrix YAO <matrix.yao@intel.com> * rename Signed-off-by: Matrix YAO <matrix.yao@intel.com> * fix Signed-off-by: Matrix YAO <matrix.yao@intel.com> * fix comments Signed-off-by: Matrix YAO <matrix.yao@intel.com> --------- Signed-off-by: Matrix Yao <matrix.yao@intel.com> Signed-off-by: Matrix YAO <matrix.yao@intel.com>	2025-05-28 10:30:16 +02:00
Yih-Dar	cea254c909	Update `CsmForConditionalGenerationIntegrationTest` (#38424 ) * require_read_token * ruff --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-28 10:20:43 +02:00
Raushan Turganbay	baddbdd24b	[qwen-vl] Look for vocab size in text config (#38372 ) fix qwen	2025-05-28 09:32:26 +02:00
Koki Ryu	a974e3b4e1	Fix an error in verify_tp_plan for keys without '.' (#38420 )	2025-05-28 09:30:43 +02:00
ivarflakstad	b1eae943a2	Change slack channel for mi250 CI (#38410 )	2025-05-28 09:20:34 +02:00
ivarflakstad	5f49e180a6	Add mi300 to amd daily ci workflows definition (#38415 )	2025-05-28 09:17:41 +02:00
Andy Vu	3b3ebcec40	Updated model card for OLMo2 (#38394 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * Updated OLMo2 model card * added command line * Add suggestions Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Added suggestions Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Indented code block as per suggestions --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 16:24:36 -07:00
Yoni Gozlan	f5307272f5	Falcon-H1 - Fix auto_docstring and add can_return_tuple decorator (#38260 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details Fix auto_docstring and add can_return_tuple	2025-05-27 16:18:05 -04:00
Tanuj Rai	a092f6babf	Update granite.md (#37791 ) * Update granite.md * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update granite.md * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/granite.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * minor fixes --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 12:55:15 -07:00
RogerSinghChugh	be7aa3210b	New bart model card (#37858 ) * Modified BART documentation wrt to issue #36979. * Modified BART documentation wrt to issue #36979. * fixed a typo. * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bart.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * blank commit. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 11:51:41 -07:00
RogerSinghChugh	587c1b0ed1	Updated BERTweet model card. (#37981 ) * Updated BERTweet model card. * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * updated toctree (EN). * Updated BERTweet model card. * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * updated toctree (EN). * Updated BERTweet model card. * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/bertweet.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * updated toctree (EN). --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 11:51:22 -07:00
RogerSinghChugh	b73faef52f	Updated BigBird Model card as per #36979 . (#37959 ) * Updated BigBird Model card as per #36979. * Update docs/source/en/model_doc/big_bird.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/big_bird.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/big_bird.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/big_bird.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-27 11:24:28 -07:00
Madhav Kumar	538e847c06	Updated Zoedepth model card (#37898 ) * Edited zoedepth model card according to specifications. * Edited Zoedepth model file * made suggested changes.	2025-05-27 10:06:53 -07:00

... 3 4 5 6 7 ...

19358 Commits