Marc Sun
06a1d75bd5
fix gptq nits ( #25500 )
...
* fix nits
* fix docstring
* fix doc
* fix damp_percent
* fix doc
2023-08-14 11:43:38 -04:00
Erfan Zekri Esfahani
892f9ea0db
import required torch and numpy libraries ( #25483 )
2023-08-13 19:26:40 +02:00
Marc Sun
55db70c63d
GPTQ integration ( #25062 )
...
* GTPQ integration
* Add tests for gptq
* support for more quantization model
* fix style
* typo
* fix method
* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* add dataclass and fix quantization_method
* fix doc
* Update tests/quantization/gptq/test_gptq.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
* modify dataclass
* add gtpqconfig import
* fix typo
* fix tests
* remove dataset as req arg
* remove tokenizer import
* add offload cpu quantization test
* fix check dataset
* modify dockerfile
* protect trainer
* style
* test for config
* add more log
* overwrite torch_dtype
* draft doc
* modify quantization_config docstring
* fix class name in docstring
* Apply suggestions from code review
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
* more warning
* fix 8bit kwargs tests
* peft compatibility
* remove var
* fix is_gptq_quantized
* remove is_gptq_quantized
* fix wrap
* Update src/transformers/modeling_utils.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
* add exllama
* skip test
* overwrite float16
* style
* fix skip test
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* fix docsting formatting
* add doc
* better test
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
2023-08-10 16:06:29 -04:00
Merve Noyan
e7b001db4f
Fix rendering for torch.compile()
docs ( #25432 )
...
fix rendering
2023-08-10 13:25:00 +02:00
TaeYupNoh
b14d4641f6
🌐 [i18n-KO] Translated philosophy.md
to Korean ( #25010 )
...
* docs: ko: philosophy.md
* feat: chatgpt draft
* fix: manual edits
* fix: resolve suggestions
2023-08-10 09:50:51 +02:00
Hyeonseo Yun
cf84738d2e
🌐 [i18n-KO] Translated model_summary.md
to Korean ( #24625 )
...
* docs: ko: model_summary.md
* feat: nmt and manual edit model_summary.mdx
* fix: resolve suggestions
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* fix: resolve suggestions2
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
---------
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
2023-08-09 18:27:27 +02:00
MinJae Kang
133aac09b0
🌐 [i18n-KO] Translated add_new_model.md
to Korean ( #24957 )
...
* docs: ko: add_new_model.md
* feat: chatgpt draft
* fix: manual edits
* fix: change document title
* fix: edit with reviewers
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
* fix: edit with reviewers
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
* fix: edit with reviewers
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
* fix: edit with reviewers
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
* fix: edit with reviewers
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: edit with reviewers
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: edit with reviewers
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
* fix: edit with reviewers
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
* fix: add anchor to header
* Update docs/source/ko/add_new_model.md
Co-authored-by: 이서정 <97655267+sjlee-wise@users.noreply.github.com>
* Update docs/source/ko/add_new_model.md
Co-authored-by: 이서정 <97655267+sjlee-wise@users.noreply.github.com>
* Update docs/source/ko/add_new_model.md
Co-authored-by: 이서정 <97655267+sjlee-wise@users.noreply.github.com>
* fix: edit with reviews
* feat: edit toctree
---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
Co-authored-by: 이서정 <97655267+sjlee-wise@users.noreply.github.com>
2023-08-09 18:24:29 +02:00
Maria Khalusova
f2a43c7383
VQA task guide ( #25244 )
...
* initial commit
* semi-finished task guide draft
* image link
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/tasks/visual_question_answering.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* feedback addressed
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* nits addressed
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-08-09 08:29:06 -04:00
Joao Gante
f456b4d10b
Generate: generation config validation fixes in docs ( #25405 )
2023-08-09 13:07:11 +01:00
Joao Gante
d59b872c9e
Docs: introduction to generation with LLMs ( #25240 )
...
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2023-08-09 11:09:20 +01:00
SeongWooChoi
1367142afd
🌐 [i18n-KO] Translated perf_train_cpu_many.md
to Korean ( #24923 )
...
* docs: ko: perf_train_cpu_many.md
* feat: chatgpt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
---------
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
2023-08-09 08:15:31 +02:00
나건주
6247d1b2b6
🌐 [i18n-KO] Translated add_tensorflow_model.md
to Korean ( #25017 )
...
* docs: ko: add_tensorflow_model.md
* feat: chatgpt draft
* fix: manual edits
* fix: manual edits
* fix: resolve suggestions
* fix: manual edits
2023-08-08 13:56:34 +02:00
Merve Noyan
5ee9693a1c
Docs: Added benchmarks for torch.compile()
for vision models ( #24748 )
...
* added benchmarks for compile
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* added more models
* added more models fr
* added visualizations
* minor fix
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update docs/source/en/perf_torch_compile.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Added links to models and put charts side by side
* Added batch comparisons
* Added more comparisons
* Fix table
* Added link to wheel
* Update perf_torch_compile.md
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-08-07 17:18:43 +01:00
Injin Paek
b9da44bd3e
🌐 [i18n-KO] Translated perf_infer_gpu_one.md
to Korean ( #24978 )
...
* docs: ko: perf_infer_gpu_one
* feat: chatgpt draft
* fix: manual edits
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: TaeYupNoh <107118671+TaeYupNoh@users.noreply.github.com>
* fix: resolve suggestions
* fix: resolve suggestions
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
---------
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: TaeYupNoh <107118671+TaeYupNoh@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
2023-08-07 08:37:29 +02:00
Sylvain Gugger
f0fd73a2de
Document check copies ( #25291 )
...
* Document check copies better and add tests
* Include header in check for copies
* Manual fixes
* Try autofix
* Fixes
* Clean tests
* Finalize doc
* Remove debug print
* More fixes
2023-08-04 14:56:29 +02:00
Victor Geislinger
641adca558
Fix typo: Roberta -> RoBERTa ( #25302 )
2023-08-03 14:17:30 -07:00
Howard Huang
33da2db5ea
[small] llama2.md typo ( #25295 )
...
`groupe` -> `grouped`
2023-08-03 14:17:06 -07:00
Yoach Lacombe
6d3f9c1e2e
add generate method to SpeechT5ForTextToSpeech ( #25233 )
...
* add generate method to SpeechT5ForTextToSpeech
* update speecht5forTTS docstrings
* Remove defaults to None in generate docstrings
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-08-03 14:12:07 +01:00
Yoach Lacombe
8455346c5c
Update bark doc ( #25234 )
...
* add mention to optimization in Bark docs
* add offload mention in docs
* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
* Update bark docs.
* Update bark.md
---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
2023-08-03 14:08:39 +01:00
Joao Gante
a8817371c9
Docs: separate generate section ( #25235 )
...
Separate generate doc section
2023-08-03 13:51:56 +01:00
Kevin Lloyd Bernal
ad8321512d
recommend DeepSpeed's Argument Parsing documentation ( #25268 )
2023-08-02 11:48:39 -04:00
heuristicwave
bef02fd6b9
🌐 [i18n-KO] Translated perf_infer_gpu_many.md
to Korean ( #24943 )
...
* doc: ko: perf_infer_gpu_many.mdx
* feat: chatgpt draft
* fix: manual edits
* Update docs/source/ko/perf_infer_gpu_many.md
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
---------
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
2023-08-02 16:06:35 +02:00
Younes Belkada
972fdcc778
[Docs
/quantization
] Clearer explanation on how things works under the hood. + remove outdated info ( #25216 )
...
* clearer explanation on how things works under the hood.
* Update docs/source/en/main_classes/quantization.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/en/main_classes/quantization.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* add `load_in_4bit` in `from_pretrained`
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-08-01 10:56:52 +02:00
Stas Bekman
5220606607
[quantization.md] fix ( #25190 )
...
Update quantization.md
2023-07-31 09:37:29 -07:00
Sohyun Sim
31f137c04f
🌐 [i18n-KO] Translated transformers_agents.md
to Korean ( #24881 )
...
* docs: ko: transformers_agents.md
* docs: ko: transformers_agents.md
* feat: deepl draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Juntae <79131091+sronger@users.noreply.github.com>
Co-authored-by: Injin Paek <71638597+eenzeenee@users.noreply.github.com>
---------
Co-authored-by: Juntae <79131091+sronger@users.noreply.github.com>
Co-authored-by: Injin Paek <71638597+eenzeenee@users.noreply.github.com>
2023-07-28 13:06:37 -04:00
Sanchit Gandhi
e93103632b
Add bloom flax ( #25094 )
...
* First commit
* step 1 working
* add alibi
* placeholder for `scan`
* add matrix mult alibi
* beta scaling factor for bmm
* working v1 - simple forward pass
* move layer_number from attribute to arg in call
* partial functioning scan
* hacky working scan
* add more modifs
* add test
* update scan for new kwarg order
* fix position_ids problem
* fix bug in attention layer
* small fix
- do the alibi broadcasting only once
* prelim refactor
* finish refactor
* alibi shifting
* incorporate dropout_add to attention module
* make style
* make padding work again
* update
* remove bogus file
* up
* get generation to work
* clean code a bit
* added small tests
* adding albii test
* make CI tests pass:
- change init weight
- add correct tuple for output attention
- add scan test
- make CI tests work
* fix few nits
* fix nit onnx
* fix onnx nit
* add missing dtype args to nn.Modules
* remove debugging statements
* fix scan generate
* Update modeling_flax_bloom.py
* Update test_modeling_flax_bloom.py
* Update test_modeling_flax_bloom.py
* Update test_modeling_flax_bloom.py
* fix small test issue + make style
* clean up
* Update tests/models/bloom/test_modeling_flax_bloom.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
* fix function name
* small fix test
* forward contrib credits from PR17761
* Fix failing test
* fix small typo documentation
* fix non passing test
- remove device from build alibi
* refactor call
- refactor `FlaxBloomBlockCollection` module
* make style
* upcast to fp32
* cleaner way to upcast
* remove unused args
* remove layer number
* fix scan test
* make style
* fix i4 casting
* fix slow test
* Update src/transformers/models/bloom/modeling_flax_bloom.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
* remove `layer_past`
* refactor a bit
* fix `scan` slow test
* remove useless import
* major changes
- remove unused code
- refactor a bit
- revert import `torch`
* major refactoring
- change build alibi
* remove scan
* fix tests
* make style
* clean-up alibi
* add integration tests
* up
* fix batch norm conversion
* style
* style
* update pt-fx cross tests
* update copyright
* Update src/transformers/modeling_flax_pytorch_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* per-weight check
* style
* line formats
---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: haileyschoelkopf <haileyschoelkopf@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-07-27 18:24:56 +01:00
Kihoon Son
ee63520a7b
🌐 [i18n-KO] Translated pipeline_webserver.md to Korean ( #24828 )
...
* translated pipeline_webserver.md
Co-Authored-By: Hyeonseo Yun <0525yhs@gmail.com>
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>
* Update pipeline_webserver.md
* Apply suggestions from code review
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sangam Lee <74291999+augustinLib@users.noreply.github.com>
Co-authored-by: Kim haewon <ehdvkf02@naver.com>
---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Sangam Lee <74291999+augustinLib@users.noreply.github.com>
Co-authored-by: Kim haewon <ehdvkf02@naver.com>
2023-07-26 08:40:37 -04:00
Yih-Dar
da5ff18a4a
Fix doctest ( #25031 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-07-25 22:10:06 +02:00
Sebastian Husch Lee
8f36ab3e22
[T5
, MT5
, UMT5
] Add [T5, MT5, UMT5]ForSequenceClassification ( #24726 )
...
* Initial addition of t5forsequenceclassification
* Adding imports and adding tests
* Formatting
* Running make fix-copies
* Adding mt5forseq
* Formatting
* run make fix-copies
* Adding to docs
* Add model_parallel
* Fix bug
* Fix
* Remove TODO
* Fixing tests for T5ForSequenceClassification
* Undo changes to dependency_versions_table.py
* Change classification head to work with T5Config directly
* Change seq length to let tests pass
* PR comments for formatting
* Formatting
* Initial addition of UMT5ForSequenceClassification
* Adding to inits and formatting
* run make fix-copies
* Add doc for UMT5ForSeqClass
* Update UMT5 config
* Fix docs
* Skip torch fx test for SequenceClassification
* Formatting
* Add skip to UMT5 tests as well
* Fix umt5 tests
* Running make fix-copies
* PR comments
* Fix for change to sentence_representation
* Rename seq_len to hidden_size since that's what it is
* Use base_model to follow format of the rest of the library
* Update docs
* Extract the decoder_input_ids changes and make one liner
* Make one-liner
2023-07-25 21:02:49 +02:00
김준재_T3056
b51312e24d
🌐 [i18n-KO] Translated perf_infer_cpu.md
to Korean ( #24920 )
...
* docs: ko: perf_infer_cpu.md
* feat: chatgpt draft
* fix: manual edits
* Update docs/source/ko/_toctree.yml
* Update docs/source/ko/perf_infer_cpu.md
* Update docs/source/ko/perf_infer_cpu.md
이 부분은 저도 걸리적거렸던 부분입니다. 반영하겠습니다!
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/perf_infer_cpu.md
동의합니다! 제가 원본에 너무 얽매여 있었네요!
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/perf_infer_cpu.md
말씀하신대로 원문에 너무 집착했던것 같습니다
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/perf_infer_cpu.md
더 나은 어휘 사용에 감사드립니다!
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/perf_infer_cpu.md
이 당시 '주기'란 용어를 생각해내질 못했네요...
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/perf_infer_cpu.md
좀 더 자연스러운 문맥이 됐네요!
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/perf_infer_cpu.md
굳이 원본 형식에 얽매일 필요가 없군요!
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* Update docs/source/ko/perf_infer_cpu.md
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
2023-07-25 16:04:14 +02:00
Arthur
dcb183f4bd
[MPT
] Add MosaicML's MPT
model to transformers ( #24629 )
...
* draft add new model like
* some cleaning of the config
* nits
* add nested configs
* nits
* update
* update
* added layer norms + triton kernels
* consider only LPLayerNorm for now.
* update
* all keys match.
* Update
* fixing nits here and there
* working forward pass.
* removed einops dependency
* nits
* format
* add alibi
* byebye head mask
* refactor attention
* nits.
* format
* fix nits.
* nuke ande updates
* nuke tokenizer test
* don't reshape query with kv heads
* added a bit of documentation.
* remove unneeded things
* nuke more stuff
* nit
* logits match - same generations
* rm unneeded methods
* 1 remaining failing CI test
* nit
* fix nits
* fix docs
* fix docs
* rm tokenizer
* fixup
* fixup
* fixup and fix tests
* fixed configuration object.
* use correct activation
* few minor fixes
* clarify docs a bit
* logits match à 1e-12
* skip and unskip a test
* added some slow tests.
* fix readme
* add more details
* Update docs/source/en/model_doc/mpt.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fix configuration issues
* more fixes in config
* added more models
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* remove unneeded position ids
* fix some comments
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* revert suggestion
* mpt alibi + added batched generation
* Update src/transformers/models/mpt/__init__.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* remove init config
* Update src/transformers/models/mpt/configuration_mpt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fix nit
* add another slow test
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* fits in one line
* some refactor because make fixup doesn't pass
* add ft notebook
* update md
* correct doc path
---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-07-25 14:32:40 +02:00
Harheem Kim
cb8abee511
🌐 [i18n-KO] Translated hpo_train.md
to Korean ( #24968 )
...
* dos: ko: hpo_train.mdx
* feat: chatgpt draft
* fix: manual edits
* fix: resolve suggestions
2023-07-25 08:28:20 -04:00
Xuehai Pan
6bc61aa7af
Set TF32
flag for PyTorch cuDNN backend ( #25075 )
2023-07-25 08:04:48 -04:00
Injin Paek
5dba88b2d2
fix: add TOC anchor link ( #25066 )
2023-07-25 08:02:33 -04:00
Sangam Lee
ee1eb3b325
🌐 [i18n-KO] Translated perf_hardware.md
to Korean ( #24966 )
...
* docs: ko: perf_hardware.md
* feat: nmt draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
* fix: resolve suggestions
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>
* Fix: manual edits
* fix: manual edits
* fix: manual edits
* fix: manual edits
* fix: fix rendering error of perf_hardware.md
---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>
2023-07-25 07:44:24 -04:00
Haewon Kim
f6fe1d5514
🌐 [i18n-KO] Translated <tf_xla>.md
to Korean ( #24904 )
...
* docs: ko: tf_xla.md
* feat: chatgpt draft
* fix: manual edits
* fix: manual edits
* fix: manual edits
* fix: resolve suggestions
2023-07-25 07:43:22 -04:00
Arthur
c53a6eae74
[RWKV
] Add note in doc on RwkvStoppingCriteria
( #25055 )
...
* Add note in doc on `RwkvStoppingCriteria`
* give some breathing space to the code
2023-07-25 10:15:00 +02:00
seank021
c0d1c33022
🌐 [i18n-KO] Translated perf_train_cpu.md
to Korean ( #24911 )
...
* dos: ko: perf_train_cpu.md
* feat: chatgpt draft
* fix: manual edits
* fix: resolve suggestions
* fix: manual edits
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>
---------
Co-authored-by: Haewon Kim <ehdvkf02@naver.com>
2023-07-24 17:54:13 +02:00
Rinat
a03d13c83d
Pvt model ( #24720 )
...
* pull and push updates
* add docs
* fix modeling
* Add and run test
* make copies
* add task
* fix tests and fix small issues
* Checks on a Pull Request
* fix docs
* add desc pvt.md
2023-07-24 15:34:19 +01:00
Sunmin Cho
9d2b983ed0
🌐 [i18n-KO] Translated testing.md
to Korean ( #24900 )
...
* docs: ko: testing.md
* feat: draft
* fix: manual edits
* fix: edit ko/_toctree.yml
* fix: manual edits
* fix: manual edits
* fix: manual edits
* fix: manual edits
* fix: resolve suggestions
2023-07-24 09:24:11 -04:00
Sangam Lee
383be1b763
🌐 [i18n-KO] Translated performance.md to Korean ( #24883 )
...
* dos: ko: performance.md
* feat: chatgpt draft
* fix: manual edits
* fix: manual edits
* Update docs/source/ko/performance.md
Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>
* Update docs/source/ko/performance.md
---------
Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>
2023-07-24 09:23:34 -04:00
Maria Khalusova
75317aefb3
[docs] Performance docs tidy up, part 1 ( #23963 )
...
* first pass at the single gpu doc
* overview: improved clarity and navigation
* WIP
* updated intro and deepspeed sections
* improved torch.compile section
* more improvements
* minor improvements
* make style
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* feedback addressed
* mdx -> md
* link fix
* feedback addressed
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2023-07-24 08:57:24 -04:00
Wonhyeong Seo
b257c46a07
🌐 [i18n-KO] Updated Korean serialization.md
( #24686 )
...
fix: update ko/serialization.md
* chatgpt draft
2023-07-21 19:23:59 -04:00
Sylvain Gugger
640e1b6c6f
Remove tokenizers from the doc table ( #24963 )
2023-07-21 09:41:36 -04:00
Sourab Mangrulkar
f4eb459ef2
fsdp fixes and enhancements ( #24980 )
...
* fix fsdp prepare to remove the warnings and fix excess memory usage
* Update training_args.py
* parity for FSDP+XLA
* Update trainer.py
2023-07-21 17:52:48 +05:30
Wonhyeong Seo
ec3dfe5e24
🌐 [i18n-KO] Fixed Korean and English quicktour.md
( #24664 )
...
* fix: english/korean quicktour.md
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>
* fix: follow glossary
* 파인튜닝 -> 미세조정
---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>
2023-07-21 08:19:28 -04:00
Tom Aarsen
79444f370f
Deprecate unused OpenLlama architecture ( #24922 )
...
* Resolve typo in check_repo.py
* Specify encoding when opening modeling files
* Deprecate the OpenLlama architecture
* Add disclaimer pointing to Llama
I'm open to different wordings here
* Match the capitalisation of LLaMA
2023-07-20 07:03:24 -04:00
Jungnerd
7381987f90
🌐 [i18n-KO] Translatedtasks/document_question_answering.md
to Korean ( #24588 )
...
* docs: ko: `document_question_answering.md`
* fix: resolve suggestions
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
---------
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
2023-07-20 06:19:36 -04:00
Travis Cline
3a43794dd6
Fix minor llama2.md model doc typos ( #24909 )
...
Update llama2.md
Fix typos in the llama2 model doc
2023-07-19 08:13:14 -04:00
Eliah Kagan
c035970212
Update tested versions in READMEs ( #24895 )
...
* Update supported Python and PyTorch versions in readme
* Update Python, etc. versions in non-English readmes
These were more out of date than in the English readme. This
updates all the versions the readmes claim the repository is tested
with to the same versions stated in the English readme.
Those versions are current at least in the case of the Python and
PyTorch versions (and less out of date for the others).
* Propagate trailing whitespace fix to model list
This runs "make fix-copies". The only change is the removal of
whitespace. No actual information or wording is changed.
* Update tested TensorFlow to 2.6 in all readmes
Per pinning in setup.py
Unlike Python and PyTorch, the minimum supported TensorFlow version
has not very recently changed, but old versions were listed in all
READMEs.
2023-07-19 07:17:34 -04:00