Yijun Lee
c15d01fa1d
🌐 [i18n-KO] Translated file_utils.md
to Korean ( #33803 )
...
* docs: ko: file_utils.md
* feat: nmt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
---------
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
2024-10-08 17:57:17 -07:00
Jiwook Han
f0f8077025
🌐 [i18n-KO] Translated swin.md
to Korean ( #33510 )
...
* ko: doc: model_doc/swin.md
* feat: nmt draft
* fix: manual edits
* fix: manual edits
* fix: manual edits
* fix: manual edits
* fix: manual edits
* Update docs/source/ko/model_doc/swin.md
Co-authored-by: Yijun Lee <119404328+yijun-lee@users.noreply.github.com>
* resolve conflicts
* resolve conflicts - 2
---------
Co-authored-by: Yijun Lee <119404328+yijun-lee@users.noreply.github.com>
2024-10-08 17:57:03 -07:00
Yijun Lee
0d0ec1dbfb
🌐 [i18n-KO] Translated tokenization_utils.md
to Korean ( #33813 )
...
* docs: ko: tokenization_utils.md
* feat: nmt draft
* fix: manual edits
2024-10-08 17:56:30 -07:00
Sungmin Oh
386401eca0
🌐 [i18n-KO] Translated main_classes/onnx.md
to Korean ( #33601 )
...
* docs: ko: main_classes/onnx.md
* feat: nmt draft
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
* fix: resolve suggestions
* fix: resolve suggestions
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: resolve suggestions
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
---------
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
2024-10-08 17:15:46 -07:00
Sungmin Oh
db5f117b8a
🌐 [i18n-KO] Translated model_doc/deberta-v2.md
to Korean ( #33968 )
...
* docs: ko: model_doc/deberta-v2.md
* feat: nmt draft
* fix: resolve suggestions
Co-authored-by: Chaewon Song <chaewon1019@ewhain.net>
* fix: resolve suggestions
* fix: resolve suggestions
---------
Co-authored-by: Chaewon Song <chaewon1019@ewhain.net>
2024-10-08 17:15:33 -07:00
Sungmin Oh
cd9a3c49b8
🌐 [i18n-KO] Translated model_doc/dbrx.md
to Korean ( #33951 )
...
* docs: ko: model_doc/dbrx.md
* feat: nmt draft
* fix: resolve suggestions
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: resolve suggestions
* fix: resolve suggestions
---------
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
2024-10-08 17:14:42 -07:00
Sungmin Oh
d6d07f9c77
🌐 [i18n-KO] Translated model_doc/cohere.md
to Korean ( #33885 )
...
* docs: ko: model_doc/cohere.md
* feat: nmt draft
* fix: resolve suggestions
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: resolve suggestions
---------
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
2024-10-08 17:14:25 -07:00
Sungmin Oh
48e80284fa
🌐 [i18n-KO] Translated model_doc/mistral.md
to Korean ( #33648 )
...
* docs: ko: model_doc/mistral.md
* feat: nmt draft
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
Co-authored-by: Chaewon Song <chaewon1019@ewhain.net>
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
* fix: resolve suggestions
* fix: resolve suggestions
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
---------
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
Co-authored-by: Chaewon Song <chaewon1019@ewhain.net>
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
2024-10-08 17:14:12 -07:00
Sungmin Oh
adb14b93f4
🌐 [i18n-KO] Translated model_doc/llama3.md
to Korean ( #33635 )
...
* docs: ko: model_doc/llama3.md
* fix: resolve suggestions
* fix: resolve suggestions
Co-authored-by: Chaewon Song <chaewon1019@ewhain.net>
* fix: resolve suggestions
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
* fix: resolve suggestions
* fix: resolve suggestions
Co-authored-by: Chaewon Song <chaewon1019@ewhain.net>
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
---------
Co-authored-by: Chaewon Song <chaewon1019@ewhain.net>
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
2024-10-08 17:13:57 -07:00
Sungmin Oh
291e707868
🌐 [i18n-KO] Translated model_doc/paligemma.md
to Korean ( #33612 )
...
* docs: ko: model_doc/paligemma.md
* feat: nmt draft
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
* fix: resolve suggestions
---------
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
2024-10-08 17:13:25 -07:00
Sungmin Oh
dd43dafa39
🌐 [i18n-KO] Translated model_doc/clip.md
to Korean ( #33610 )
...
* docs: ko: model_doc/clip.md
* feat: nmt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
* fix: resolve suggestions
* fix: resolve suggestions
* fix: resolve suggestions
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
---------
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
2024-10-08 17:13:07 -07:00
Sungmin Oh
acde6c7d9d
🌐 [i18n-KO] Translated model_doc/patchtsmixer.md
to Korean ( #33587 )
...
* docs: ko: model_doc/patchtsmixer.md
* feat: nmt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
* fix: resolve suggestions
---------
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
2024-10-08 17:11:48 -07:00
Sungmin Oh
bb825dde73
🌐 [i18n-KO] Translated model_doc/autoformer.md
to Korean ( #33574 )
...
* docs: ko: model_doc/autoformer.md
* feat: nmt draft
* fix: manual edits
* fix: resolve suggestions
2024-10-08 17:11:19 -07:00
Sungmin Oh
1d458437dd
🌐 [i18n-KO] Translated model_doc/mamba.md
to Korean ( #33626 )
...
* docs: ko: model_doc/mamba.md
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
* fix: resolve suggestions
---------
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
2024-10-08 17:11:11 -07:00
Sungmin Oh
47da2c528b
🌐 [i18n-KO] Translated main_classes/configuration.md
to Korean ( #33952 )
...
* docs: ko: main_classes/configuration.md
* feat: nmt draft
2024-10-08 17:11:02 -07:00
Sungmin Oh
2e8de976bd
🌐 [i18n-KO] Translated main_classes/quantization.md
to Korean ( #33959 )
...
* docs: ko: main_classes/quantization.md
* feat: nmt draft
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
* fix: resolve suggestions
---------
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
2024-10-08 17:10:41 -07:00
Chaewon Song
2fe77783c3
🌐 [i18n-KO] Translated rag.md
to Korean ( #33989 )
...
* fix: toctree edits
* feat: nmt-draft
* fix: edit Inline TOC
2024-10-08 17:10:26 -07:00
Ahnjj_DEV
1ed98773e5
🌐 [i18n-KO] Translated gpt_neox_japanese.md
to Korean ( #33894 )
...
* docs: ko: gpt_neox_japanese.md
* Update _toctree.yml
* fix: manual edits
* Update docs/source/ko/model_doc/gpt_neox_japanese.md
Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com>
* Update docs/source/ko/model_doc/gpt_neox_japanese.md
Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com>
* Update docs/source/ko/model_doc/gpt_neox_japanese.md
Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com>
---------
Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com>
2024-10-08 17:08:06 -07:00
Ahnjj_DEV
79af52ad9a
🌐 [i18n-KO] Translated bertweet.md
to Korean ( #33891 )
...
* docs: ko: bertweet.md
* Update _toctree.yml
* fix: manual edits
* Update docs/source/ko/model_doc/bertweet.md
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
---------
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
2024-10-08 17:07:13 -07:00
Yijun Lee
d49999ce11
🌐 [i18n-KO] Translated feature_extractor.md
to Korean ( #33775 )
...
* docs: ko: feature_extractor.md
* feat: nmt draft
* fix: manual edits
2024-10-08 17:06:56 -07:00
Cyril Vallez
17806d11ba
Improve modular converter ( #33991 )
...
* improve modular
* style
* Update modular_model_converter.py
* pretty print warning
* style
* Support to remove unused classes as part of added dependencies as well
* nits
* correct bug
* add example
* style
* Add documentation
2024-10-08 14:53:58 +02:00
Yoni Gozlan
e2001c3413
Add auto model for image-text-to-text ( #32472 )
...
* Add Auto model for image-text-to-text
* Remove donut from processing auto, add chameleon ti image text to text models
* add qwen2_vl and llava_onevision
* add pixtral to auto model for image-text-to-text
* add mllama and idefics3
* remove models in IGNORE_NON_AUTO_CONFIGURED
* add AutoModelForImageTextToText to tests and doc
2024-10-08 14:26:43 +02:00
Arthur
a3add29097
Add support for __all__ and potentilly deleting functions ( #33859 )
...
* Add support for __all__ and potentailly deleting functions
* updates
* update
* nits
* remove dummies
* fix warning
* fixup
* style
* update
* fixup
* skip copied from when # skip
* remove log
* bring dummies back
* fixup
* remove copied from
* fixup
* remove warnings from `make fix-copies`
* fix doc issues
* nits
* Better error message !
* add support for more flexible naming!
* style
* breaking style?
* fix super() renaming issues
* del not needed when you don't call super().__init__()
* style
* no more fmt on :)
* properly remove `self`
* fixup
* fix
* doc nits
* add some doc 🫡
2024-10-08 10:19:17 +02:00
Yijun Lee
d6ba1ac041
🌐 [i18n-KO] Translated gemma.md
to Korean ( #33936 )
...
* docs: ko: gemma.md
* feat: nmt draft
* fix: manual edits
2024-10-07 15:59:14 -07:00
Jiwook Han
46f146a2b5
🌐 [i18n-KO] Translated vit.md
to Korean ( #33884 )
...
* docs: ko: model_doc/vit.md
* feat: nmt draft
* fix: manual edits
* fix: manual edits
* Update docs/source/ko/model_doc/vit.md
Co-authored-by: Yijun Lee <119404328+yijun-lee@users.noreply.github.com>
* Update docs/source/ko/model_doc/vit.md
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr>
---------
Co-authored-by: Yijun Lee <119404328+yijun-lee@users.noreply.github.com>
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr>
2024-10-07 15:35:11 -07:00
Jiwook Han
1ecca92f03
🌐 [i18n-KO] Translated swin2sr.md
to Korean ( #33795 )
...
* ko: doc: model_doc/swin2sr.md
* feat: nmt draft
* Update docs/source/ko/model_doc/swin2sr.md
Co-authored-by: Yijun Lee <119404328+yijun-lee@users.noreply.github.com>
---------
Co-authored-by: Yijun Lee <119404328+yijun-lee@users.noreply.github.com>
2024-10-07 15:34:56 -07:00
boyunJang
8258219c4c
🌐 [i18n-KO] Translated auto.md
to Korean ( #33590 )
...
* docs: ko: model_doc/auto.md
* feat: nmt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: wony617 <49024958+Jwaminju@users.noreply.github.com>
Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com>
* fix: resolve suggestions
---------
Co-authored-by: wony617 <49024958+Jwaminju@users.noreply.github.com>
Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com>
2024-10-07 15:34:45 -07:00
Chaewon Song
253a9a9d6f
🌐 [i18n-KO] Translated logging.md
to Korean ( #33543 )
...
* docs: ko: main_classes/logging.md
* feat: nmt-draft
* fix: update toctree.yml
* Update docs/source/ko/main_classes/logging.md
Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com>
* Update docs/source/ko/main_classes/logging.md
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com>
* Apply suggestions from code review
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
---------
Co-authored-by: Sungmin Oh <fabxoe.kor@gmail.com>
Co-authored-by: HyeokJun SHIN <96534680+jun048098@users.noreply.github.com>
Co-authored-by: Ahnjj_DEV <ahnjj.dev@gmail.com>
2024-10-07 15:34:34 -07:00
Yijun Lee
178d707b7e
🌐 [i18n-KO] Translated chameleon.md
to Korean ( #33799 )
...
* docs: ko: chameleon.md
* feat: nmt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr>
---------
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr>
2024-10-07 15:06:13 -07:00
Yijun Lee
13432f8409
🌐 [i18n-KO] Translated trainer.md
to Korean ( #33797 )
...
* docs: ko: trainer.md
* feat: nmt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr>
---------
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr>
2024-10-07 15:05:57 -07:00
Yijun Lee
e9fbe62965
🌐 [i18n-KO] Translated pipelines_utils.md
to Korean ( #33809 )
...
* docs: ko: pipelines_utils.md
* feat: nmt draft
* fix: manual edits
2024-10-07 15:05:17 -07:00
Yijun Lee
9c61ba2f25
🌐 [i18n-KO] Translated time_series_utils.md
to Korean ( #33806 )
...
* docs: ko: time_series_utils.md
* feat: nmt draft
* fix: manual edits
2024-10-07 15:05:00 -07:00
Yijun Lee
9c8bd3fc1b
🌐 [i18n-KO] Translated esm.md
to Korean ( #33796 )
...
* docs: ko: esm.md
* feat: nmt draft
* fix: manual edits
2024-10-07 13:39:22 -07:00
Yijun Lee
6996f2186a
🌐 [i18n-KO] Translated audio_utils.md
to Korean ( #33802 )
...
* docs: ko: audio_utils.md
* feat: nmt draft
* fix: manual edits
2024-10-07 13:39:10 -07:00
Jiwook Han
410c73af1d
🌐 [i18n-KO] Translated swinv2.md
to Korean ( #33566 )
...
* docs: ko: model_doc/swinv2.md
* feat: nmt draft
* fix: manual edits
* fix: manual edits
2024-10-07 12:50:43 -07:00
Yijun Lee
6c18cefed0
🌐 [i18n-KO] Translated gguf.md
to Korean ( #33764 )
...
* docs: ko: gguf.md
* feat nmt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr>
---------
Co-authored-by: Jiwook Han <33192762+mreraser@users.noreply.github.com>
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr>
2024-10-07 12:49:08 -07:00
Magnus
ad1a250719
[Docs] Add Developer Guide: How to Hack Any Transformers Model ( #33979 )
...
* docs: add example for separating q, k, v projections in SAM
* docs: How to Hack Any Transformers Model
* docs: remove changes from sam model docs
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-10-07 10:08:20 +02:00
NielsRogge
f5aeb7c1a5
[Docs] Improve VLM docs ( #33393 )
...
* Improve docs
* Update docs/source/en/model_doc/llava.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update docs/source/en/model_doc/llava.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Address comment
* Address comment
* Improve pixtral docs
---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-10-07 09:54:07 +02:00
TomLim
1bd604d11c
[WIP] Add Tokenizer for MyT5 Model ( #31286 )
...
* Initial commit for MyT5 model
* custom implementation of MyT5 tokenizer, unused files deleted
* unittest for myt5 tokenizer
* upadate of import structure and style
* removed remmanents of MyT5Config
* fixed docstrings
* Updates after review: filled documentaion file, new docstrings and tests added
* Fixed code style issues
* fixed copied from to refer to function
* updated loading myt5 tokenizer in tests, added sample byte map file to fixtures
* changes after review
* removed redundant copied from
* removed redundant copied from
* optimalization and loading model from hf
* [run_slow] myt5
* [run-slow] myt5
* Updated en documentation for myt5
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-10-06 10:33:16 +02:00
pglorio
f319ba16fa
Add Zamba ( #30950 )
...
* Update index.md
* Rebase
* Rebase
* Updates from make fixup
* Update zamba.md
* Batched inference
* Update
* Fix tests
* Fix tests
* Fix tests
* Fix tests
* Update docs/source/en/model_doc/zamba.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update docs/source/en/model_doc/zamba.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update configuration_zamba.py
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update modeling_zamba.py
* Update modeling_zamba.py
* Update modeling_zamba.py
* Update configuration_zamba.py
* Update modeling_zamba.py
* Update modeling_zamba.py
* Merge branch 'main' of https://github.com/Zyphra/transformers_zamba
* Update ZambaForCausalLM
* Update ZambaForCausalLM
* Describe diffs with original mamba layer
* Moved mamba init into `_init_weights`
* Update index.md
* Rebase
* Rebase
* Updates from make fixup
* Update zamba.md
* Batched inference
* Update
* Fix tests
* Fix tests
* Fix tests
* Fix tests
* Update docs/source/en/model_doc/zamba.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update docs/source/en/model_doc/zamba.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update configuration_zamba.py
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update modeling_zamba.py
* Update modeling_zamba.py
* Update modeling_zamba.py
* Update configuration_zamba.py
* Update modeling_zamba.py
* Update modeling_zamba.py
* Merge branch 'main' of https://github.com/Zyphra/transformers_zamba
* Update ZambaForCausalLM
* Moved mamba init into `_init_weights`
* Update ZambaForCausalLM
* Describe diffs with original mamba layer
* make fixup fixes
* quality test fixes
* Fix Zamba model path
* circleci fixes
* circleci fixes
* circleci fixes
* circleci fixes
* circleci fixes
* circleci fixes
* circleci fixes
* circleci fixes
* circleci fixes
* Update
* circleci fixes
* fix zamba test from merge
* fix ValueError for disabling mamba kernels
* add HF copyright
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* shared_transf --> shared_transformer
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Fixes
* Move attention head dim to config
* Fix circle/ci tests
* Update modeling_zamba.py
* apply GenerationMixin inheritance change from upstream
* apply import ordering
* update needed transformers version for zamba
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* add contribution author
* add @slow to avoid CI
* Update src/transformers/models/zamba/modeling_zamba.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Define attention_hidden_size
* Added doc for attention_head_size
* trigger CI
* Fix doc of attention_hidden_size
* [run-slow] zamba
* Fixed shared layer logic, swapped up<->gate in mlp
* shared_transformer -> shared_transf
* reformat HybridLayer __init__
* fix docstrings in zamba config
* added definition of _get_input_ids_and_config
* fixed formatting of _get_input_ids_and_config
---------
Co-authored-by: root <root@node-4.us-southcentral1-a.compute.internal>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: root <root@node-1.us-southcentral1-a.compute.internal>
Co-authored-by: Quentin Anthony <qganthony@yahoo.com>
2024-10-04 22:28:05 +02:00
Amit Garg
e3775539c8
PhiMoE ( #33363 )
...
* onboard phimoe model
* removed debug code
* added unit tests
* updated docs
* formatted
* fixed unit tests
* fixed test case
* fixed format
* refactored code
* fixed expected outputs in the integration tests
* Added a warning msg
* Addressed comments
* Addressed comments
* fixed test cases
* added paper link
* Addressed comments
* Refactored PhimoeForCausalLM forward fn
* Refactored PhimoeRotaryEmbedding class
* fixed test cases
* fixed testcase
* fixed test case
* Addressed comments
* fixed test cases
* fixed testcases
* Used cache position instead to get the seq len
2024-10-04 21:39:45 +02:00
jiqing-feng
b916efcb3c
Enables CPU AWQ model with IPEX version. ( #33460 )
...
* enable cpu awq ipex linear
* add doc for cpu awq with ipex kernel
* add tests for cpu awq
* fix code style
* fix doc and tests
* Update docs/source/en/quantization/awq.md
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Update tests/quantization/autoawq/test_awq.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* fix comments
* fix log
* fix log
* fix style
---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
2024-10-04 16:25:10 +02:00
Matt
de4112e4d2
Add a section on writing tool templates to the chat template docs ( #33924 )
...
* Add a section on writing tool templates to the chat template docs
* Small cleanups
2024-10-04 14:40:44 +01:00
Deepak Saldanha
b6a01df6e9
[Doc]: Broken link in Kubernetes doc ( #33879 )
...
* add relative path in .md and redirects to conf.py
* add redirects to conf.py and update .md
* modify links in .md
2024-10-04 11:20:56 +02:00
amyeroberts
b7474f211d
Trainer - deprecate tokenizer for processing_class ( #32385 )
...
* Trainer - deprecate tokenizer for processing_class
* Extend chage across Seq2Seq trainer and docs
* Add tests
* Update to FutureWarning and add deprecation version
2024-10-02 14:08:46 +01:00
Omar Salman
e7c8af7f33
Add sdpa for DistilBert ( #33724 )
...
* Add sdpa for DistilBert
* [run_slow] distilbert
* [run_slow] distilbert
* [run_slow] distilbert
* Try without slow tests
* [run_slow] distilbert
* [run_slow] distilbert
2024-10-02 13:55:19 +01:00
g-prz
fe484726aa
Add falcon gguf ( #33437 )
...
* feat(gguf): add falcon q2 k
* fix(gguf): remove useless renaming
* feat(gguf): seperate falcon 7b and 40b
* feat(gguf): apply fixup
* fix(test): error rebase
* feat(gguf): add fp16 weight comparison for falcon
* feat(gguf): test weight of all layers
* test(gguf): add falcon 40b under skip decorator
* feat(gguf): quick example for extracting model size
2024-10-02 14:10:39 +02:00
TrickEye
2292be6c1b
Fix: typo ( #33880 )
...
Update llm_tutorial.md: typo
2024-10-02 09:12:21 +01:00
pogpog
b77846a6e6
Fix link in gguf.md ( #33768 )
...
Change hyphen to underscore for URL in link to convert_hf_to_gguf.py
2024-09-30 20:17:33 +02:00
mobicham
f5247aca01
Hqq serialization ( #33141 )
...
* HQQ model serialization attempt
* fix hqq dispatch and unexpected keys
* style
* remove check_old_param
* revert to check HQQLinear in quantizer_hqq.py
* revert to check HQQLinear in quantizer_hqq.py
* update HqqConfig default params
* make ci happy
* make ci happy
* revert to HQQLinear check in quantizer_hqq.py
* check hqq_min version 0.2.0
* set axis=1 as default in quantization_config.py
* validate_env with hqq>=0.2.0 version message
* deprecated hqq kwargs message
* make ci happy
* remove run_expected_keys_check hack + bump to 0.2.1 min hqq version
* fix unexpected_keys hqq update
* add pre_quantized check
* add update_expected_keys to base quantizerr
* ci base.py fix?
* ci base.py fix?
* fix "quantization typo" src/transformers/utils/quantization_config.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fix post merge
---------
Co-authored-by: Marc Sun <marc@huggingface.co>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-09-30 14:47:18 +02:00