transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-16 11:08:23 +06:00

Author	SHA1	Message	Date
Akhil	093848d3cc	Added Telugu [te] translations (#26828 ) * Create index.md * Create _toctree.yml * Updated index.md in telugu * Update _toctree.yml * Create quicktour.md * Update quicktour.md * Create index.md * Update quicktour.md * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Delete docs/source/hi/index.md * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update build_documentation.yml Added telugu [te] * Update build_pr_documentation.yml Added Telugu [te] * Update _toctree.yml --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-20 15:27:55 -07:00
Diego Machado	9b1976697d	fix set_transform link docs (#26856 ) * fix set_transform link * Update docs/source/en/preprocessing.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * use doc-builder sintax --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-10-20 11:16:37 +02:00
Joao Gante	ae4fb84629	Generate: update basic llm tutorial (#26937 )	2023-10-19 16:53:28 +01:00
Mohamed Aymane Farhi	73dc23f786	Fix license (#26931 )	2023-10-19 15:36:41 +02:00
Patrick von Platen	734dd96e02	[Docs] Make sure important decode and generate method are nicely displayed in Whisper docs (#26927 ) better docstrings whisper	2023-10-19 13:01:47 +02:00
Pablo Montalvo	caa0ff0bf1	Add fuyu model (#26911 ) * initial commit * add processor, add fuyu naming * add draft processor * fix processor * remove dropout to fix loading of weights * add image processing fixes from Pedro * fix * fix processor * add basic processing fuyu test * add documentation and TODO * address comments, add tests, add doc * replace assert with torch asserts * add Mixins and fix tests * clean imports * add model tester, clean imports * fix embedding test * add updated tests from pre-release model * Processor: return input_ids used for inference * separate processing and model tests * relax test tolerance for embeddings * add test for logit comparison * make sure fuyu image processor is imported in the init * fix formattingh * more formatting issues * and more * fixups * remove some stuff * nits * update init * remove the fuyu file * Update integration test with release model * Update conversion script. The projection is not used, as confirmed by the authors. * improve geenration * Remove duplicate function * Trickle down patches to model call * processing fuyu updates * remove things * fix prepare_inputs_for_generation to fix generate() * remove model_input * update * add generation tests * nits * draft leverage automodel and autoconfig * nits * fix dtype patch * address comments, update READMEs and doc, include tests * add working processing test, remove refs to subsequences * add tests, remove Sequence classification * processing * update * update the conversion script * more processing cleanup * safe import * take out ModelTesterMixin for early release * more cl;eanup * more cleanup * more cleanup * and more * register a buffer * nits * add postprocessing of generate output * nits * updates * add one working test * fix test * make fixup works * fixup * Arthur's updates * nits * update * update * fix processor * update tests * passe more fixups * fix * nits * don't import torch * skip fuyu config for now * fixup done * fixup * update * oups * nits * Use input embeddings * no buffer * update * styling processing fuyu * fix test * update licence * protect torch import * fixup and update not doctested * kwargs should be passed * udpates * update the impofixuprts in the test * protect import * protecting imports * protect imports in type checking * add testing decorators * protect top level import structure * fix typo * fix check init * move requires_backend to functions * Imports * Protect types --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: ArthurZucker <arthur.zucker@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Lysandre <lysandre@huggingface.co>	2023-10-18 15:24:11 -07:00
Yeyang	732d2a8aac	[i18n-ZH] Translated fast_tokenizers.md to Chinese (#26910 ) docs: translate fast_tokenizers into Chinese	2023-10-18 10:45:41 -07:00
Rockerz	eec5a3a8d8	Refactor code part in documentation translated to japanese (#26900 ) Refactor code in documentation	2023-10-18 10:35:58 -07:00
Merve Noyan	280c757f6c	Knowledge distillation for vision guide (#25619 ) * Knowledge distillation for vision guide * Update knowledge_distillation_for_image_classification.md * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Rafael Padilla <31217453+rafaelpadilla@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Rafael Padilla <31217453+rafaelpadilla@users.noreply.github.com> * Iterated on Rafael's comments * Added to toctree * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Rafael Padilla <31217453+rafaelpadilla@users.noreply.github.com> * Addressed comments * Update knowledge_distillation_for_image_classification.md * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Rafael Padilla <31217453+rafaelpadilla@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update knowledge_distillation_for_image_classification.md * Update knowledge_distillation_for_image_classification.md * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Address comments * Update knowledge_distillation_for_image_classification.md * Explain KL Div --------- Co-authored-by: Rafael Padilla <31217453+rafaelpadilla@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Maria Khalusova <kafooster@gmail.com>	2023-10-18 04:42:32 -07:00
Rockerz	b002353dca	Translating `en/internal` folder docs to Japanese 🇯🇵 (#26747 ) * Add translation to fitst 3 file of internal folder * Update Toctree.md and add files * Update docs/source/ja/internal/generation_utils Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Rename generation_utils file * rename pipelines_utils.md * Change file names --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-17 15:01:21 -07:00
Bingchen Zhao	46092f763d	Fixed a typo in mistral.md (#26879 ) Fix a typo in mistral.md	2023-10-17 14:06:37 -07:00
Susheel Thapa	b3961f7291	Chore: Typo fixed in multiple files of docs/source/en/model_doc (#26833 ) * Chore: Typo fixed in multiple files of docs/source/en/model_doc * Update docs/source/en/model_doc/nllb-moe.md Co-authored-by: Aryan V S <avs050602@gmail.com> --------- Co-authored-by: Aryan V S <avs050602@gmail.com>	2023-10-17 07:10:08 +02:00
Patrick von Platen	805d5d2111	Add LLM doc (#26058 ) * [WIP] Add LLM doc * rename * latex * latex * Fix more latex * [LLMs] Getting most out of LLMS * improve * try again * Apply suggestions from code review Co-authored-by: Maria Khalusova <kafooster@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/en/llm_tutorial_optimization.md * Apply suggestions from code review * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Apply suggestions from code review * move file --------- Co-authored-by: Maria Khalusova <kafooster@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2023-10-16 16:09:50 +02:00
NielsRogge	570b3f9cdd	[OWL-ViT, OWLv2] Add resources (#26822 ) Add resources	2023-10-16 15:47:44 +02:00
Merve Noyan	5d997f227c	Image-to-Image Task Guide (#26595 ) * img2img task guide * Update year * Add to toctree * Update docs/source/en/tasks/image_to_image.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/image_to_image.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/image_to_image.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/image_to_image.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/image_to_image.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Update docs/source/en/tasks/image_to_image.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Addressed comments * Update docs/source/en/tasks/image_to_image.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Addressed comments --------- Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Maria Khalusova <kafooster@gmail.com>	2023-10-16 15:12:03 +02:00
Shreyas S	0dd58d96a0	Fixed typos (#26810 ) Update feature_extractor.md	2023-10-16 09:52:29 +02:00
Injin Paek	d6e5b02ef3	Add CLIP resources (#26534 ) * docs: feat: model resources for CLIP * fix: resolve suggestion Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix: resolve suggestion * fix: resolve suggestion * fix: resolve suggestion Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix: resolve suggestion * fix: resolve suggestions Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-13 11:12:59 -07:00
NielsRogge	762af3e3c7	Add OWLv2, bis (#26668 ) * First draft * Update conversion script * Update copied from statements * Fix style * Add copied from to config * Add copied from to processor * Run make fixup * Add docstring * Update docstrings * Add method * Improve docstrings * Fix docstrings * Improve docstrings * Remove onnx * Add flag * Address comments * Add copied from to model tests * Add flag to conversion script * Add code snippet * Address more comments * Address comment * Improve conversion script * More improvements * Add expected objectness logits * Skip test * Improve conversion script * Extend conversion script * Convert large checkpoint * Fix doc tests * Convert all checkpoints, update integration tests * Add checkpoint_path arg * Fix repo_id	2023-10-13 16:41:24 +02:00
Wonhyeong Seo	7790943c91	🌐 [i18n-KO] Translated `big_models.md` to Korean (#26245 ) * docs: ko: big_models.md * feat: chatgpt draft * fix: manual edits * fix: resolve suggestions Co-Authored-By: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> Co-Authored-By: heuristicwave <31366038+heuristicwave@users.noreply.github.com> Co-Authored-By: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> Co-Authored-By: heuristicwave <31366038+heuristicwave@users.noreply.github.com> Co-Authored-By: bolizabeth <68984363+bolizabeth@users.noreply.github.com> --------- Co-authored-by: bolizabeth <68984363+bolizabeth@users.noreply.github.com> Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-12 15:00:12 -07:00
Heinz-Alexander Fuetterer	883ed4b344	chore: fix typos (#26756 )	2023-10-12 18:00:27 +02:00
Maria Khalusova	0ebee8b933	[docs] LLM prompting guide (#26274 ) * llm prompting guide * updated code examples * an attempt to fix the code example tests * set seed in examples * added a doctest comment * added einops to the doc_test_job * string formatting * string formatting, again * added the toc to slow_documentation_tests.txt * minor list fix * string formatting + pipe renamed * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * replaced max_length with max_new_tokens and updated the outputs to match * minor formatting fix * removed einops from circleci config * Apply suggestions from code review Co-authored-by: Lysandre Debut <hi@lysand.re> * removed einops and trust_remote_code parameter --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Lysandre Debut <hi@lysand.re>	2023-10-12 08:48:01 -04:00
Lysandre Debut	ab0ddc99e8	Warnings controlled by logger level (#26527 ) * Logger level Co-authored-by: Sahil Bhosale <sahilbhosale63@live.com> Co-authored-by: Adithya4720 <hegdeadithyak@gmail.com> Co-authored-by: Sachin Singh <sachinishu02@gmail.com> Co-authored-by: Riya Dhanduke <113622644+riiyaa24@users.noreply.github.com> * More comprehensive documentation --------- Co-authored-by: Sahil Bhosale <sahilbhosale63@live.com> Co-authored-by: Adithya4720 <hegdeadithyak@gmail.com> Co-authored-by: Sachin Singh <sachinishu02@gmail.com> Co-authored-by: Riya Dhanduke <113622644+riiyaa24@users.noreply.github.com>	2023-10-12 10:48:38 +02:00
TERRY LEE	e1cec43415	Translated the accelerate.md file of the documentation to Chinese (#26161 ) * translate accelerate page * Update docs/source/zh/accelerate.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-10-11 10:54:22 -07:00
Rockerz	9b7668c03a	add japanese documentation (#26138 ) * udpaet * update * Update docs/source/ja/autoclass_tutorial.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * add codes workflows/build_pr_documentation.yml * Create preprocessing.md * added traning.md * Create Model_sharing.md * add quicktour.md * new * ll * Create benchmark.md * Create Tensorflow_model * add * add community.md * add create_a_model * create custom_model.md * create_custom_tools.md * create fast_tokenizers.md * create * add * Update docs/source/ja/_toctree.yml Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * md * add * commit * add * h * Update docs/source/ja/peft.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/ja/_toctree.yml Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/ja/_toctree.yml Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Suggested Update * add perf_train_gpu_one.md * added perf based MD files * Modify toctree.yml and Add transmartion to md codes * Add `serialization.md` and edit `_toctree.yml` * add task summary and tasks explained * Add and Modify files starting from T * Add testing.md * Create main_classes files * delete main_classes folder * Add toctree.yml * Update llm_tutorail.md * Update docs/source/ja/_toctree.yml Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update misspelled filenames * Update docs/source/ja/_toctree.yml Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/_toctree.yml * Update docs/source/ja/_toctree.yml * missplled file names inmrpovements * Update _toctree.yml * close tip block * close another tip block * Update docs/source/ja/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/pipeline_tutorial.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/pipeline_tutorial.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/preprocessing.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/peft.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/add_new_model.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/testing.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/task_summary.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/tasks_explained.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update glossary.md * Update docs/source/ja/transformers_agents.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/llm_tutorial.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/create_a_model.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/torchscript.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/benchmarks.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/troubleshooting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/troubleshooting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/troubleshooting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/add_new_model.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update perf_torch_compile.md * Update Year to default in en documentation * Final Update --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>	2023-10-11 10:26:37 -07:00
Ben Gubler	9f40639292	Update docs to explain disabling callbacks using report_to (#26155 ) * feat: update callback doc to explain disabling callbacks using report_to * docs: update report_to docstring	2023-10-11 07:50:23 -04:00
Tuowei Wang	a9862a0f49	Fix Typo: table in deepspeed.md (#26705 )	2023-10-10 11:50:10 +02:00
tom white	c7f01beece	fix typos in idefics.md (#26648 ) * fix typos in idefics.md Two typos found in reviewing this documentation. 1) max_new_tokens=4, is not sufficient to generate "Vegetables" as indicated - you will get only "Veget". (incidentally - some mention of how to select this value might be useful as it seems to change in each example) 2) inputs = processor(prompts, return_tensors="pt").to(device) as inputs need to be on the same device (as they are in all other examples on the page) * Update idefics.md Change device to cuda explicitly to match other examples	2023-10-09 12:18:02 +02:00
NielsRogge	2629c8f36a	[DINOv2] Convert more checkpoints (#26177 ) * Convert checkpoints * Update doc test * Address comment	2023-10-09 09:58:04 +02:00
Jabasukuriputo Wang	897a826d83	docs(zh): review and punctuation & space fix (#26627 )	2023-10-06 09:24:28 -07:00
Matt	ea52ed9dc8	Update chat template docs with more tips on writing a template (#26625 )	2023-10-06 12:04:40 +01:00
Maria Khalusova	18fbeec824	[docs] Update to scripts building index.md (#26546 ) * build the table in index.md with links to the model_doc * removed list generation on index.md * fixed missing models * make style	2023-10-05 10:20:41 -04:00
Yeyang	43bfd093e1	add zh translation for installation (#26084 ) * translate installation to zh * fix translation typo	2023-10-04 09:39:02 -07:00
Galland	f9ab07f920	Update mistral.md to update 404 link (#26590 )	2023-10-04 17:48:11 +02:00
Matt	8b03615b7b	Fix embarrassing typo in the doc chat template! (#26596 )	2023-10-04 16:28:53 +01:00
Matt	8b46c5bcfc	Add add_generation_prompt argument to apply_chat_template (#26573 ) * Add add_generation_prompt argument to apply_chat_template * Add add_generation_prompt argument to apply_chat_template and update default templates * Fix typo * Add generation prompts section to chat templating guide * Add generation prompts section to chat templating guide * Minor style fix	2023-10-04 15:15:29 +01:00
Sylvain Gugger	03af4c42a6	Docstring check (#26052 ) * Fix number of minimal calls to the Hub with peft integration * Alternate design * And this way? * Revert * Nits to fix * Add util * Print when changes are made * Add list to ignore * Add more rules * Manual fixes * deal with kwargs * deal with enum defaults * avoid many digits for floats * Manual fixes * Fix regex * Fix regex * Auto fix * Style * Apply script * Add ignored list * Add check that templates are filled * Adding to CI checks * Add back semi-fix * Ignore more objects * More auto-fixes * Ignore missing objects * Remove temp semi-fix * Fixes * Update src/transformers/models/pvt/configuration_pvt.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update utils/check_docstrings.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/utils/quantization_config.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Deal with float defaults * Fix small defaults * Address review comment * Treat * Post-rebase cleanup * Address review comment * Update src/transformers/models/deprecated/mctct/configuration_mctct.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comment --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2023-10-04 15:13:37 +02:00
김준재_T3056	2f3ea08a07	docs: feat: add clip notebook resources from OSSCA community (#26505 )	2023-10-03 11:20:22 -07:00
Jungnerd	2c7b26f508	🌐 [i18n-KO] Translated `semantic_segmentation.md` to Korean (#26515 ) * docs: ko: sementic_segmentation.md * feat: manual draft * fix: manual edits * fix: resolve suggestions Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * fix: resolve suggestions Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix: edit the title --------- Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-03 10:25:50 -07:00
Younes Belkada	ae9a344cce	[`Mistral`] Add Flash Attention-2 support for `mistral` (#26464 ) * add FA-2 support for mistral * fixup * add sliding windows * fixing few nits * v1 slicing cache - logits do not match * add comment * fix bugs * more mem efficient * add warning once * add warning once * oops * fixup * more comments * copy * add safety checker * fixup * Update src/transformers/models/mistral/modeling_mistral.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * copied from * up * raise when padding side is right * fixup * add doc + few minor changes * fixup --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-10-03 13:44:46 +02:00
Florian Zimmermeister	9ed538f2e6	[i18n-DE] contribute chapter (#26481 ) * start working on next chapter * finish testing * Update docs/source/de/testing.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/de/testing.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/de/testing.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-02 09:56:40 -07:00
Wonhyeong Seo	1470f731b6	🌐 [i18n-KO] Translated `tokenizer_summary.md` to Korean (#26243 ) * docs: ko: toknenizer_summary.md Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-Authored-By: Juntae <79131091+sronger@users.noreply.github.com> Co-Authored-By: Injin Paek <71638597+eenzeenee@users.noreply.github.com> * update review * fix: resolve suggestions Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com> Co-Authored-By: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix: resolve suggestions Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> --------- Co-authored-by: HanNayeoniee <nayeon2.han@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Juntae <79131091+sronger@users.noreply.github.com> Co-authored-by: Injin Paek <71638597+eenzeenee@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>	2023-10-02 09:55:33 -07:00
HelgeS	7d6627d0d9	Fix broken link to video classification task (#26487 )	2023-10-02 11:19:11 +02:00
Sanchit Gandhi	0b192de1f3	[ASR Pipe] Improve docs and error messages (#26476 ) * improve docs/errors * why whisper * Update docs/source/en/pipeline_tutorial.md Co-authored-by: Lysandre Debut <hi@lysand.re> * specify pt only --------- Co-authored-by: Lysandre Debut <hi@lysand.re>	2023-09-29 18:32:37 +01:00
Maria Khalusova	14170b784b	[docs] navigation improvement between text gen pipelines and text gen params (#26477 ) * navigation improvement between text generation pipelines and text generation docs * make style	2023-09-29 09:43:39 +02:00
Steven Liu	7bb1c0c147	[docs] Update offline mode docs (#26478 ) update	2023-09-29 09:42:21 +02:00
Wonhyeong Seo	ab37b801b1	🌐 [i18n-KO] Translated `perf_train_gpu_many.md` to Korean (#26244 ) * dos: ko: perf_train_gpu_many.mdx * feat: chatgpt draft * fix: manual edits * fix: resolve suggestions Change description Follow the glossary Fix discrepancies Co-Authored-By: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> Co-Authored-By: 이서정 <97655267+sjlee-wise@users.noreply.github.com> Co-Authored-By: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Hyunho <105839613+hyunhp@users.noreply.github.com> Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> Co-authored-by: 이서정 <97655267+sjlee-wise@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-09-27 13:51:15 -07:00
Wonhyeong Seo	a0922a538b	🌐 [i18n-KO] Translated `debugging.md` to Korean (#26246 ) * docs:ko:Debugging.md * feat: chatgpt draft * fix: resolve suggestions Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-Authored-By: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Jang KyuJin <106062329+kj021@users.noreply.github.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-09-27 13:47:44 -07:00
Florian Zimmermeister	ef81759e31	[i18n-DE] Complete first toc chapter (#26311 ) * initial * toctree * add tf model * run scripts * peft * llm and agents * Update docs/source/de/peft.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/de/peft.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/de/peft.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/de/run_scripts.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/de/run_scripts.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/de/transformers_agents.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/de/transformers_agents.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-09-27 11:33:05 -07:00
Chris Bamford	72958fcd3c	[Mistral] Mistral-7B-v0.1 support (#26447 ) * [Mistral] Mistral-7B-v0.1 support * fixing names * slightly longer test * fixups * not_doctested * wrongly formatted references * make fixuped --------- Co-authored-by: Timothee Lacroix <t@eugen.ai> Co-authored-by: timlacroix <t@mistral.ai>	2023-09-27 18:30:46 +02:00
Nour Eddine ZEKAOUI	777f2243f5	Update semantic_segmentation.md (#26419 )	2023-09-27 11:51:44 +02:00
titi	a8531f3bfd	Deleted duplicate sentence (#26394 )	2023-09-26 10:11:28 +02:00
NielsRogge	a09130feee	[ViTMatte] Add resources (#26317 ) Add resource	2023-09-26 07:06:38 +02:00
NielsRogge	ace74d16bd	Add Nougat (#25942 ) * Add conversion script * Add NougatImageProcessor * Add crop margin * More improvements * Add docs, READMEs * Remove print statements * Include model_max_length * Add NougatTokenizerFast * Fix imports * Improve postprocessing * Improve image processor * Fix image processor * Improve normalize method * More improvements * More improvements * Add processor, improve docs * Simplify fast tokenizer * Remove test file * Fix docstrings * Use NougatProcessor in conversion script * Add is_levensthein_available * Add tokenizer tests * More improvements * Use numpy instead of opencv * Add is_cv2_available * Fix cv2_available * Add is_nltk_available * Add image processor tests, improve crop_margin * Add integration tests * Improve integration test * Use do_rescale instead of hacks, thanks Amy * Remove random_padding * Address comments * Address more comments * Add import * Address more comments * Address more comments * Address comment * Address comment * Set max_model_input_sizes * Add tests * Add requires_backends * Add Nougat to exotic tests * Use to_pil_image * Address comment regarding nltk * Add NLTK * Improve variable names, integration test * Add test * refactor, document, and test regexes * remove named capture groups, add comments * format * add non-markdown fixed tokenization * format * correct flakyness of args parse * add regex comments * test functionalities for crop_image, align long axis and expected output * add regex tests * remove cv2 dependency * test crop_margin equality between cv2 and python * refactor table regexes to markdown add newline * change print to log, improve doc * fix high count tables correction * address PR comments: naming, linting, asserts * Address comments * Add copied from * Update conversion script * Update conversion script to convert both small and base versions * Add inference example * Add more info * Fix style * Add require annotators to test * Define all keyword arguments explicitly * Move cv2 annotator * Add tokenizer init method * Transfer checkpoints * Add reference to Donut * Address comments * Skip test * Remove cv2 method * Add copied from statements * Use cached_property * Fix docstring * Add file to not doctested --------- Co-authored-by: Pablo Montalvo <pablo.montalvo.leroux@gmail.com>	2023-09-26 07:06:04 +02:00
Gabriel Yang	5e09af2acd	🌐 [i18n-KO] Translated `audio_classification.mdx` to Korean (#26200 ) * 🌐 [i18n-KO] Translated to Korean * update translation * fix some sentence editing and fixing punctuation * Update docs/source/ko/_toctree.yml Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> * Apply suggestions from code review Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> --------- Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>	2023-09-25 10:24:45 -07:00
Maria Khalusova	546e7679e7	[docs] removed MaskFormerSwin and TimmBackbone from the table on index.md (#26347 ) removed MaskFormerSwin and TimmBackbone from the table	2023-09-25 09:41:59 -04:00
Nino Risteski	6accd5effb	Update add_new_model.md (#26365 ) fixed typos	2023-09-25 12:58:11 +02:00
LeviVasconcelos	576cd45a57	Add image to image pipeline (#25393 ) * Add image to image pipeline Add image to image pipeline * remove swin2sr from tf auto * make ImageToImage importable * make style make style make style make style * remove tf support * remove nonused imports * fix postprocessing * add important comments; add unit tests * add documentation * remove support for TF * make fixup * fix typehint Image.Image * fix documentation code * address review request; fix unittest type checking * address review request; fix unittest type checking * make fixup * address reviews * Update src/transformers/pipelines/image_to_image.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * enhance docs * make style * make style * improve docetest time * improve docetest time * Update tests/pipelines/test_pipelines_image_to_image.py Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> * Update tests/pipelines/test_pipelines_image_to_image.py Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> * make fixup * undo faulty merge * undo faulty merge * add image-to-image to test pipeline mixin * Update src/transformers/pipelines/image_to_image.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update tests/pipelines/test_pipelines_image_to_image.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * improve docs --------- Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-09-22 19:53:55 +03:00
Younes Belkada	368a58e61c	[`core` ] Integrate Flash attention 2 in most used models (#25598 ) * v1 * oops * working v1 * fixup * add some TODOs * fixup * padding support + try with module replacement * nit * alternative design * oops * add `use_cache` support for llama * v1 falcon * nit * a bit of refactor * nit * nits nits * add v1 padding support falcon (even though it seemed to work before) * nit * falcon works * fixup * v1 tests * nit * fix generation llama flash * update tests * fix tests + nits * fix copies * fix nit * test- padding mask * stype * add more mem efficient support * Update src/transformers/modeling_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fixup * nit * fixup * remove it from config when saving * fixup * revert docstring * add more checks * use values * oops * new version * fixup * add same trick for falcon * nit * add another test * change tests * fix issues with GC and also falcon * fixup * oops * Update src/transformers/models/falcon/modeling_falcon.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * add init_rope * updates * fix copies * fixup * fixup * more clarification * fixup * right padding tests * add docs * add FA in docker image * more clarifications * add some figures * add todo * rectify comment * Change to FA2 * Update docs/source/en/perf_infer_gpu_one.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * split in two lines * change test name * add more tests * some clean up * remove `rearrange` deps * add more docs * revert changes on dockerfile * Revert "revert changes on dockerfile" This reverts commit `8d72a66b4b`. * revert changes on dockerfile * Apply suggestions from code review Co-authored-by: Lysandre Debut <hi@lysand.re> * address some comments * docs * use inheritance * Update src/transformers/testing_utils.py Co-authored-by: Lysandre Debut <hi@lysand.re> * fixup * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/modeling_utils.py * final comments * clean up * style * add cast + warning for PEFT models * fixup --------- Co-authored-by: Felix Marty <9808326+fxmarty@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Lysandre Debut <hi@lysand.re>	2023-09-22 17:42:10 +02:00
Maria Khalusova	dcbfd93d7a	[doc] fixed indices in obj detection example (#26343 ) fixed indexes in obj detection example	2023-09-22 10:29:27 -04:00
NielsRogge	7d6354e047	Add ViTMatte (#25843 ) * First draft * Simplify image processor * Fix rebase * Address comments * Address more comments * Address more comments * Address more comments * Address more comments * Improve pad_image * Add tests * Update integration test * Fix image processor tests * Fix model tests * Convert checkpoints * Fix doc tests * Remove file * Apply suggestions * Address comments * Fix typing hint * Add batch_norm_eps * Address comments * Fix style	2023-09-19 10:56:10 -03:00
Aleksandar Ivanovski	373d0d9985	[docs] Fix model reference in zero shot image classification example (#26206 )	2023-09-19 00:45:12 +02:00
Nino Risteski	500dfb5b03	Update add_new_pipeline.md (#26197 ) fixed a few typos	2023-09-19 00:41:16 +02:00
SeongWooChoi	42791a5753	🌐 [i18n-KO] Translated `whisper.md` to Korean (#26002 ) * docs: ko-whisper.md * fix: chatgpt draft * feat: manual edits * Feat: manual edits * fix: resolve suggestions Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> --------- Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>	2023-09-18 22:12:41 +02:00
Yih-Dar	f02b915ba2	Remove `utils/documentation_tests.txt` (#26213 ) * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-18 13:33:01 +02:00
Maria Khalusova	8b13471494	[docs] IDEFICS guide and task guides restructure (#26035 ) * initial commit for the IDEFICS task guide * conversational example * updated TOC * fixed typos * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * addressed feedback * bad_words_ids * Apply suggestions from code review Co-authored-by: Victor SANH <victorsanh@gmail.com> * rank classification note * feedback addressed --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Victor SANH <victorsanh@gmail.com>	2023-09-15 12:15:07 -04:00
Matt	2518e36810	Tweaks to Chat Templates docs (#26168 ) * Put tokenizer methods in the right alphabetical order in the docs * Quick tweak to ConversationalPipeline * Typo fixes in the developer doc * make fixup	2023-09-15 12:50:57 +01:00
Jinho Park	17fdd35481	Add BROS (#23190 ) * add Bros boilerplate * copy and pasted modeling_bros.py from official Bros repo * update copyright of bros files * copy tokenization_bros.py from official repo and update import path * copy tokenization_bros_fast.py from official repo and update import path * copy configuration_bros.py from official repo and update import path * remove trailing period in copyright line * copy and paste bros/__init__.py from official repo * save formatting * remove unused unnecessary pe_type argument - using only crel type * resolve import issue * remove unused model classes * remove unnecessary tests * remove unused classes * fix original code's bug - layer_module's argument order * clean up modeling auto * add bbox to prepare_config_and_inputs * set temporary value to hidden_size (32 is too low because of the of the Bros' positional embedding) * remove decoder test, update create_and_check* input arguemnts * add missing variable to model tests * do make fixup * update bros.mdx * add boilerate plate for no_head inference test * update BROS_PRETRAINED_MODEL_ARCHIVE_LIST (add naver-clova-ocr prefix) * add prepare_bros_batch_inputs function * update modeling_common to add bbox inputs in Bros Model Test * remove unnecessary model inference * add test case * add model_doc * add test case for token_classification * apply fixup * update modeling code * update BrosForTokenClassification loss calculation logic * revert logits preprocessing logic to make sure logits have original shape * - update class name * - add BrosSpadeOutput - update BrosConfig arguments * add boilerate plate for no_head inference test * add prepare_bros_batch_inputs function * add test case * add test case for token_classification * update modeling code * update BrosForTokenClassification loss calculation logic * revert logits preprocessing logic to make sure logits have original shape * apply masking on the fly * add BrosSpadeForTokenLinking * update class name put docstring to the beginning of the file * separate the logits calculation logic and loss calculation logic * update logic for loss calculation so that logits shape doesn't change when return * update typo * update prepare_config_and_inputs * update dummy node initialization * update last_hidden_states getting logic to consider when return_dict is False * update box first token mask param * bugfix: remove random attention mask generation * update keys to ignore on load missing * run make style and quality * apply make style and quality of other codes * update box_first_token_mask to bool type * update index.md * apply make style and quality * apply make fix-copies * pass check_repo * update bros model doc * docstring bugfix fix * add checkpoint for doc, tokenizer for doc * Update README.md * Update docs/source/en/model_doc/bros.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update bros.md * Update src/transformers/__init__.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/model_doc/bros.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * apply suggestions from code review * apply suggestions from code review * revert test_processor_markuplm.py * Update test_processor_markuplm.py * apply suggestions from code review * apply suggestions from code review * apply suggestions from code review * update BrosSpadeELForTokenClassification head name to entity linker * add doc string for config params * update class, var names to more explicit and apply suggestions from code review * remove unnecessary keys to ignore * update relation extractor to be initialized with config * add bros processor * apply make style and quality * update bros.md * remove bros tokenizer, add bros processor that wraps bert tokenizer * revert change * apply make fix-copies * update processor code, update itc -> initial token, stc -> subsequent token * add type hint * remove unnecessary condition branches in embedding forward * fix auto tokenizer fail * update docstring for each classes * update bbox input dimension as standard 2 points and convert them to 4 points in forward pass * update bros docs * apply suggestions from code review : update Bros -> BROS in bros.md * 1. box prefix var -> bbox 2. update variable names to be more explicit * replace einsum with torch matmul * apply style and quality * remove unused argument * remove unused arguments * update docstrings * apply suggestions from code review: add BrosBboxEmbeddings, replace einsum with classical matrix operations * revert einsum update * update bros processor * apply suggestions from code review * add conversion script for bros * Apply suggestions from code review * fix readme * apply fix-copies --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-09-14 18:02:37 +01:00
Matt	866df66fe4	Overhaul Conversation class and prompt templating (#25323 ) * First commit while I figure this out * make fixup * Remove unused method * Store prompt attrib * Fix prompt argument for tests * Make same changes in fast tokenizer * Remove global prompts from fast tokenizer too * stash commit * stash commit * Migrate PromptConfig to its True Final Location * Replace Conversation entirely with the new class * Import/dependency fixes * Import/dependency fixes * Change format for lots of default prompts * More default prompt fixups * Revert llama old methods so we can compare * Fix some default configs * Fix some default configs * Fix misspelled kwarg * Fixes for Blenderbot * make fixup * little rebase cleanup * Add basic documentation * Quick doc fix * Truncate docstring for now * Add handling for the case when messages is a single string * Quick llama merges * Update conversational pipeline and tests * Add a couple of legacy properties for backward compatibility * More legacy handling * Add docstring for build_conversation_input_ids * Restructure PromptConfig * Let's start T E M P L A T I N G * Refactor all default configs to use templates instead * Revert changes to the special token properties since we don't need them anymore * More class templates * Make the sandbox even sandier * Everything replaced with pure templating * Remove docs for PromptConfig * Add testing and optional requirement boilerplate * Fix imports and make fixup * Fix LLaMA tests and add Conversation docstring * Finally get LLaMA working with the template system * Finally get LLaMA working with the template system * make fixup * make fixup * fmt-off for the long lists of test tokens * Rename method to apply_chat_template for now * Start on documentation * Make chat_template a property that reads through to the default if it's not set * Expand docs * Expand chat templating doc some more * trim/lstrip blocks by default and update doc * Few doc tweaks * rebase cleanup * Clarify docstring * rebase cleanup * rebase cleanup * make fixup * Quick doc edit * Reformat the standard template to match ChatML * Re-add PEFT check * Update docs/source/en/chat_templating.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Add apply_chat_template to the tokenizer doc * make fixup * Add doc links * Fix chat links * Fix chat links * Explain system messages in the doc * Add chat template test * Proper save-loading for chat template attribute * Add test skips for layout models * Remove _build_conversation_input_ids, add default_chat_template to code_llama * Make sure all LLaMA models are using the latest template * Remove default_system_prompt block in code_llama because it has no default prompt * Update ConversationPipeline preprocess * Add correct #Copied from links to the default_chat_templates * Remove unneeded type checking line * Add a dummy mark_processsed method * Reorganize Conversation to have *deprecated_kwargs Update chat_templating.md * Quick fix to LLAMA tests * Small doc tweaks * Add proper docstrings and "copied from" statements to all default chat templates * Merge use_default_system_prompt support for code_llama too * Improve clarity around self.chat_template * Docstring fix * Fix blenderbot default template * More doctest fix * Break out some tokenizer kwargs * Update doc to explain default templates * Quick tweaks to tokenizer args * Cleanups for tokenizer args * Add note about cacheing * Quick tweak to the chat-templating doc * Update the LLaMA template with error checking and correct system message embedding * make fixup * make fixup * add requires_jinja * Cleanup to expected output formatting * Add cacheing * Fix typo in llama default template * Update LLaMA tests * Update documentation * Improved legacy handling in the Conversation class * Update Jinja template with proper error handling * Quick bugfix * Proper exception raising * Change cacheing behaviour so it doesn't try to pickle an entire Jinja env * make fixup * rebase cleanup --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-14 15:10:34 +01:00
Maria Khalusova	9709ab116c	[docs] last hidden state vs hidden_states[-1] (#26142 ) * last hidden state clarification * feedback addressed	2023-09-13 14:35:42 -04:00
김준재_T3056	a6ae2bd059	docs: feat: add llama2 notebook resources from OSSCA community (#26076 )	2023-09-13 08:27:41 -07:00
Wang, Yi	8f609ab9e0	enable optuna multi-objectives feature (#25969 ) * enable optuna multi-objectives feature Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * update hpo doc * update docstring Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * extend direction to List[str] type Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * Update src/transformers/integrations/integration_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-09-12 18:01:22 +01:00
MinJae Kang	92f2fbad50	🌐 [i18n-KO] Translated `contributing.md` to Korean (#25877 ) * docs: ko-contributing.md * feat: chatGPT draft * feat: manual edits * feat: change linked document * fix: resolve suggestion Co-authored-by: Haewon Kim <ehdvkf02@naver.com> * fix: resolve suggestion Co-authored-by: Haewon Kim <ehdvkf02@naver.com> * fix: resolve suggestion Co-authored-by: Haewon Kim <ehdvkf02@naver.com> * fix: resolve suggestion Co-authored-by: Haewon Kim <ehdvkf02@naver.com> * fix: resolve suggestion Co-authored-by: Haewon Kim <ehdvkf02@naver.com> * fix: resolve suggestion Co-authored-by: Haewon Kim <ehdvkf02@naver.com> * fix: resolve suggestion Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> * fix: resolve suggestion * fix: resolve suggestion * feat: delete file to resolve error --------- Co-authored-by: Haewon Kim <ehdvkf02@naver.com> Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>	2023-09-12 08:35:29 -07:00
Maria Khalusova	1fe7ce48f1	[docs] Updates to TTS task guide with regards to the new TTS pipeline (#26095 ) * tts guide updates with a pipeline * Apply suggestions from code review Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com> * Update docs/source/en/tasks/text-to-speech.md Co-authored-by: Vaibhav Srivastav <vaibhavs10@gmail.com> --------- Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com> Co-authored-by: Vaibhav Srivastav <vaibhavs10@gmail.com>	2023-09-12 11:29:06 -04:00
MinJae Kang	be9438ed43	🌐 [i18n-KO] Translated `llama2.md` to Korean (#26047 ) * docs: ko-llama2.md * feat: chatGPT draft and manul edits * feat: added inline TOC * fix: inline TOC * fix: resolve suggestions Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> * fix: resolve suggestion Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> --------- Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>	2023-09-12 08:04:26 -07:00
Joao Gante	3319eb5490	Generate: legacy mode is only triggered when `generation_config` is untouched (#25962 )	2023-09-12 12:08:17 +01:00
Arthur	9cccb3a838	[`Persimmon`] Add support for persimmon (#26042 ) * intiial commit * updates * nits * update conversion script * update conversion script * use path to load * add tips etc * some modeling logic * modeling update * more nits * nits * normal layer norm * update config and doc * nits * update doc remove unused * update * fix inits and stuff * fixup * revert wrong changes * updates * more nits * add default config values to the configuration file * fixup happy * update * 2 tests left * update readmes * more nits * slow test and more documentation * update readme * fix licences * styling * use fast if possible when saving tokenizer * remove todo * remove tokenization tests * small last nits * Apply suggestions from code review Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * nits to skip the timout doctest * fix integration test * fix test * update eos token * update to allow fast tokenization * styling * fix codeLlama as well for the update post processor * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add more copied from statements * update * doc passes doctest * remove `# final layer norm?` * change docstring prompot * update * Update README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * don't doctest the conversion script as it requires more packages * don't init a model in the config * oups * fix doctest --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-09-12 11:33:27 +02:00
Phuc Van Phan	9cebae64ad	docs: update link huggingface map (#26077 )	2023-09-11 12:57:04 +01:00
Harheem Kim	d53606031f	🌐 [i18n-KO] Translated `llama.md` to Korean (#26044 ) * docs: ko-llama.md * fix: chatgpt draft * feat: manual edits * fix: resolve suggestions	2023-09-08 12:38:41 -07:00
Muskan Kumar	02c4a77f57	Added HerBERT to README.md (#26020 ) * Added HerBERT to README.md * Update README.md to contain HerBERT (#26016) * Resolved #26016: Updated READMEs and index.md to contain Herbert Updated READMEs and ran make fix-copies	2023-09-07 19:51:45 +01:00
Harheem Kim	fa522d8d7b	🌐[i18n-KO] Translated `llm_tutorial.md` to Korean (#25791 ) * docs: ko: llm_tutoroal.md * feat: chatgpt draft * fix: manual edits * fix: resolve suggestions * fix: resolve suggestions	2023-09-06 07:40:03 -07:00
zspo	3e203f92be	Fix small typo README.md (#25934 ) * fix some samll bugs in readme * Update docs/README.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-09-06 14:07:29 +01:00
Injin Paek	6206f599e1	Add LLaMA resources (#25859 ) * docs: feat: model resources for llama * fix: resolve suggestion Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-09-05 10:50:08 -07:00
raghavanone	1110b565d6	Add TFDebertaV2ForMultipleChoice (#25932 ) * Add TFDebertaV2ForMultipleChoice * Import newer model in main init * Fix import issues * Fix copies * Add doc * Fix tests * Fix copies * Fix docstring	2023-09-05 17:13:06 +01:00
Julien Chaumond	6316ce8d27	[doc] Always call it Agents for consistency (#25958 )	2023-09-05 12:27:20 +01:00
Susnato Dhar	52a46dc57b	Add `Pop2Piano` space demo. (#25975 ) Update pop2piano.md	2023-09-05 11:07:02 +01:00
Matt	034bc5d26a	Add proper Falcon docs and conversion script (#25954 ) * Add proper Falcon docs and conversion script * Autodetect the decoder architecture instead of using an arg * Update docs now that we can autodetect * Fix doc error * Add doc to toctree * Quick doc update	2023-09-04 17:18:34 +01:00
Sanchit Gandhi	f435003e0c	[MMS] Fix pip install in docs (#25949 )	2023-09-04 11:53:41 +01:00
Nino Risteski	d4407a3bd1	Update autoclass_tutorial.md (#25929 ) fixed typos	2023-09-04 11:16:49 +01:00
Nino Risteski	51e1e8120b	Update community.md (#25928 ) fixed a few typos	2023-09-04 11:16:34 +01:00
omahs	0f0e1a2c2b	Fix typos (#25936 ) * fix typo * fix typo * fix typo * fix typos * fix typos * fix typo * fix typo * fix typo * fix typos * fix typo * fix typo * fix typo * fix typos * fix typos	2023-09-04 11:15:12 +01:00
Nino Risteski	0afa5071bd	Update model_memory_anatomy.md (#25896 ) typo fixes	2023-09-01 12:27:01 -07:00
Arthur	a4dd53d88e	Update-llama-code (#25826 ) * some bug fixes * updates * Update code_llama.md Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> * Add co author Co-authored-by: pcuenca <pedro@latenitesoft.com> * add a test * fixup * nits * some updates * fix-coies * adress comments * nits * nits * fix docsting * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * update * add int for https://huggingface.co/spaces/hf-accelerate/model-memory-usage --------- Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> Co-authored-by: pcuenca <pedro@latenitesoft.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-09-01 20:40:40 +02:00
Sanchit Gandhi	1fa2d89a9b	[MMS] Update docs with HF TTS implementation (#25907 ) * [MMS] Update docs with HF TTS implementation * Update docs/source/en/model_doc/mms.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * add uromanise to docs --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-09-01 16:50:59 +01:00
Omar Sanseviero	69c5b8f186	Remove broken docs for MusicGen (#25905 ) Update musicgen.md	2023-09-01 15:26:42 +01:00
Matthijs Hollemans	4ece3b9433	add VITS model (#24085 ) * add VITS model * let's vits * finish TextEncoder (mostly) * rename VITS to Vits * add StochasticDurationPredictor * ads flow model * add generator * correctly set vocab size * add tokenizer * remove processor & feature extractor * add PosteriorEncoder * add missing weights to SDP * also convert LJSpeech and VCTK checkpoints * add training stuff in forward * add placeholder tests for tokenizer * add placeholder tests for model * starting cleanup * let the great renaming begin! * use config * global_conditioning * more cleaning * renaming variables * more renaming * more renaming * it never ends * reticulating the splines * more renaming * HiFi-GAN * doc strings for main model * fixup * fix-copies * don't make it a PreTrainedModel * fixup * rename config options * remove training logic from forward pass * simplify relative position * use actual checkpoint * style * PR review fixes * more review changes * fixup * more unit tests * fixup * fix doc test * add integration test * improve tokenizer tests * add tokenizer integration test * fix tests on GPU (gave OOM) * conversion script can handle repos from hub * add conversion script for all MMS-TTS checkpoints * automatically create a README for the converted checkpoint * small changes to config * push README to hub * only show uroman note for checkpoints that need it * remove conversion script because code formatting breaks the readme * make WaveNet layers configurable * rename variables * simplifying the math * output attentions and hidden states * remove VitsFlip in flow model * also got rid of the other flip * fix tests * rename more variables * rename tokenizer, add phonemization * raise error when phonemizer missing * re-order config docstrings to match method * change config naming * remove redundant str -> list * fix copyright: vits authors -> kakao enterprise * (mean, log_variances) -> (prior_mean, prior_log_variances) * if return dict -> if not return dict * speed -> speaking rate * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * update fused tanh sigmoid * reduce dims in tester * audio -> output_values * audio -> output_values in tuple out * fix return type * fix return type * make _unconstrained_rational_quadratic_spline a function * all nn's to accept a config * add spectro to output * move {speaking rate, noise scale, noise scale duration} to config * path -> attn_path * idxs -> valid idxs -> padded idxs * output values -> waveform * use config for attention * make generation work * harden integration test * add spectrogram to dict output * tokenizer refactor * make style * remove 'fake' padding token * harden tokenizer tests * ron norm test * fprop / save tests deterministic * move uroman to tokenizer as much as possible * better logger message * fix vivit imports * add uroman integration test * make style * up * matthijs -> sanchit-gandhi * fix tokenizer test * make fix-copies * fix dict comprehension * fix config tests * fix model tests * make outputs consistent with reverse/not reverse * fix key concat * more model details * add author * return dict * speaker error * labels error * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/vits/convert_original_checkpoint.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * remove uromanize * add docstrings * add docstrings for tokenizer * upper-case skip messages * fix return dict * style * finish tests * update checkpoints * make style * remove doctest file * revert * fix docstring * fix tokenizer * remove uroman integration test * add sampling rate * fix docs / docstrings * style * add sr to model output * fix outputs * style / copies * fix docstring * fix copies * remove sr from model outputs * Update utils/documentation_tests.txt Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add sr as allowed attr --------- Co-authored-by: sanchit-gandhi <sanchit@huggingface.co> Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-09-01 10:50:06 +01:00
Vibhor Kumar	99fc3ac8ac	Modify efficient GPU training doc with now-available adamw_bnb_8bit optimizer (#25807 ) * Modify single-GPU efficient training doc with now-available adamw_bnb_8bit optimizer * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-08-31 10:55:10 +01:00
Joao Gante	459bc6738c	Docs: fix example failing doctest in `generation_strategies.md` (#25874 )	2023-08-30 16:23:44 +01:00
Lysandre Debut	ed290b0837	Remote tools are turned off (#25867 )	2023-08-30 09:40:39 -04:00
Haylee Schäfer	dbc16f4404	Support loading base64 images in pipelines (#25633 ) * support loading base64 images * add test * mention in docs * remove the logging * sort imports * update error message * Update tests/utils/test_image_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * restructure to catch base64 exception * doesn't like the newline * download files * format * optimize imports * guess it needs a space? * support loading base64 images * add test * remove the logging * sort imports * restructure to catch base64 exception * doesn't like the newline * download files * optimize imports * guess it needs a space? --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-08-29 19:24:24 +01:00
Sohyun Sim	aade754b27	🌐 [i18n-KO] Translated `community.md` to Korean (#25674 ) * docs: ko: community.md * feat: deepl draft * fix: manual edits * fix: resolve suggestions Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> --------- Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>	2023-08-29 11:47:24 -04:00

1 2 3 4 5 ...

2212 Commits