transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-05 22:00:09 +06:00

Author	SHA1	Message	Date
NielsRogge	8211c59b9a	[KOSMOS-2] Update docs (#27157 ) Update docs	2023-10-30 21:42:19 +01:00
Rockerz	84724efd10	Translating `en/main_classes` folder docs to Japanese 🇯🇵 (#26894 ) * add * add * add * Add deepspeed.md * Add * add * Update docs/source/ja/main_classes/callback.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/main_classes/output.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/main_classes/pipelines.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/main_classes/processors.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/main_classes/processors.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/main_classes/text_generation.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/main_classes/processors.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update logging.md * Update toctree.yml * Update docs/source/ja/main_classes/deepspeed.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Add suggesitons * m * Update docs/source/ja/main_classes/trainer.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update toctree.yml * Update Quantization.md * Update docs/source/ja/_toctree.yml Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update toctree.yml * Update docs/source/en/main_classes/deepspeed.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/main_classes/deepspeed.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-30 09:39:14 -07:00
Yeyang	9093b19b13	🌐 [i18n-ZH] Translate serialization.md into Chinese (#27076 ) * docs(zh): translate serialization.md * docs(zh): add space around links	2023-10-30 08:50:29 -07:00
Yih-Dar	691fd8fdde	Add `Kosmos-2` model (#24709 ) * Add KOSMOS-2 model * update * update * update * address review comment - 001 * address review comment - 002 * address review comment - 003 * style * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * fix * address review comment - 004 * address review comment - 005 * address review comment - 006 * address review comment - 007 * address review comment - 008 * address review comment - 009 * address review comment - 010 * address review comment - 011 * update readme * fix * fix * fix * [skip ci] fix * revert the change in _decode * fix docstring * fix docstring * Update docs/source/en/model_doc/kosmos-2.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * no more Kosmos2Tokenizer * style * remove "returned when being computed by the model" * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * UTM5 Atten * fix attn mask * use present_key_value_states instead of next_decoder_cache * style * conversion scripts * conversion scripts * conversion scripts * Add _reorder_cache * fix doctest and copies * rename 1 * rename 2 * rename 3 * make fixup * fix table * fix docstring * rename 4 * change repo_id * remove tip * update md file * make style * update md file * put docs/source/en/model_doc/kosmos-2.md to slow * update conversion script * Use CLIPImageProcessor in Kosmos2Processor * Remove Kosmos2ImageProcessor * Remove to_dict in Kosmos2Config * Remove files * fix import * Update conversion * normalized=False * Not using hardcoded values like <image> * elt --> element * Apply suggestion * Not using hardcoded values like </image> * No assert * No nested functions * Fix md file * copy * update doc * fix docstring * fix name * Remove _add_remove_spaces_around_tag_tokens * Remove dummy docstring of _preprocess_single_example * Use `BatchEncoding` * temp * temp * temp * Update * Update * Make Kosmos2ProcessorTest a bit pretty * Update gradient checkpointing * Fix gradient checkpointing test * Remove one liner remove_special_fields * Simplify conversion script * fix add_eos_token * update readme * update tests * Change to microsoft/kosmos-2-patch14-224 * style * Fix doc --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-10-30 13:32:17 +01:00
jiaqiw09	ef23b68ebf	translate transformers_agents.md to Chinese (#27046 ) * update translation * fix problems mentioned in reviews	2023-10-27 12:45:43 -07:00
Arthur	90ee9cea19	Revert "add exllamav2 arg" (#27102 ) Revert "add exllamav2 arg (#26437)" This reverts commit `8214d6e7b1`.	2023-10-27 11:23:06 +02:00
Marc Sun	8214d6e7b1	add exllamav2 arg (#26437 ) * add_ xllamav2 arg * add test * style * add check * add doc * replace by use_exllama_v2 * fix tests * fix doc * style * better condition * fix logic * add deprecate msg	2023-10-26 10:15:05 -04:00
Aarya Balwadkar	a2f55a65cd	Hindi translation of pipeline_tutorial.md (#26837 ) * hindi translation of pipeline_tutorial.md * Update pipeline_tutorial.md * Update build_documentation.yml * Update build_pr_documentation.yml * Updated build_documentation.yml --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-25 11:21:49 -07:00
Yeyang	ba5144f7a9	🌐 [i18n-ZH] Translate custom_models.md into Chinese (#27065 ) * docs(zh): translate custom_models.md * minor fix in customer_models Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-25 11:20:32 -07:00
Younes Belkada	c34c50cdc0	[`docs`] Add `MaskGenerationPipeline` in docs (#27063 ) * add `MaskGenerationPipeline` in docs * Update __init__.py * fix repo consistency and clarify docstring * add on check docstirngs * actually we do have a tf sam * oops	2023-10-25 19:31:36 +02:00
Maria Khalusova	9333bf0769	[docs] Performance docs refactor p.2 (#26791 ) * initial edits * improvements for clarity and flow * improvements for clarity and flow, removed the repetead section * removed two docs that had no content * Revert "removed two docs that had no content" This reverts commit `e98fa2fa0d`. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * feedback addressed * more feedback addressed * feedback addressed --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-24 13:10:06 -04:00
Alex McKinney	9da451713d	Device agnostic testing (#25870 ) * adds agnostic decorators and availability fns * renaming decorators and fixing imports * updating some representative example tests bloom, opt, and reformer for now * wip device agnostic functions * lru cache to device checking functions * adds `TRANSFORMERS_TEST_DEVICE_SPEC` if present, imports the target file and updates device to function mappings * comments `TRANSFORMERS_TEST_DEVICE_SPEC` code * extra checks on device name * `make style; make quality` * updates default functions for agnostic calls * applies suggestions from review * adds `is_torch_available` guard * Add spec file to docs, rename function dispatch names to backend_* * add backend import to docs example for spec file * change instances of to * Move register backend to before device check as per @statelesshz changes * make style * make opt test require fp16 to run --------- Co-authored-by: arsalanu <arsalanu@graphcore.ai> Co-authored-by: arsalanu <hzji210@gmail.com>	2023-10-24 16:49:26 +02:00
Leandro von Werra	b18e31407c	add info on TRL docs (#27024 ) * add info on TRL docs * add TRL link * tweak text * tweak text	2023-10-24 14:56:00 +02:00
Yeyang	32f799db0d	🌐 [i18n-ZH] Translate create_a_model.md into Chinese (#27026 ) docs(zh): translate create_a_model.md	2023-10-23 15:44:42 -07:00
jiaqiw09	b0d1d7f71a	translate `preprocessing.md` to Chinese (#26955 ) * translate preprocessing.md to Chinese * update files fixing problems mentioned in review * update files fixing problems mentioned in review --------- Co-authored-by: jiaqiw <wangjiaqi50@huawei.com>	2023-10-23 10:36:24 -07:00
Yeyang	19ae0505ae	🌐 [i18n-ZH] Translate multilingual into Chinese (#26935 ) translate multilingual into Chinese Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-23 10:35:17 -07:00
jiaqiw09	f09a081d27	Translate `pipeline_tutorial.md` to chinese (#26954 ) * update translation of pipeline_tutorial and preprocessing(Version1.0) * update translation of pipeline_tutorial and preprocessing(Version2.0) * update translation docs * update to fix problems mentioned in review --------- Co-authored-by: jiaqiw <wangjiaqi50@huawei.com>	2023-10-23 08:58:00 -07:00
Yoach Lacombe	cb45f71c4d	Add Seamless M4T model (#25693 ) * first raw commit * still POC * tentative convert script * almost working speech encoder conversion scripts * intermediate code for encoder/decoders * add modeling code * first version of speech encoder * make style * add new adapter layer architecture * add adapter block * add first tentative config * add working speech encoder conversion * base model convert works now * make style * remove unnecessary classes * remove unecessary functions * add modeling code speech encoder * rework logics * forward pass of sub components work * add modeling codes * some config modifs and modeling code modifs * save WIP * new edits * same output speech encoder * correct attention mask * correct attention mask * fix generation * new generation logics * erase comments * make style * fix typo * add some descriptions * new state * clean imports * add tests * make style * make beam search and num_return_sequences>1 works * correct edge case issue * correct SeamlessM4TConformerSamePadLayer copied from * replace ACT2FN relu by nn.relu * remove unecessary return variable * move back a class * change name conformer_attention_mask ->conv_attention_mask * better nit code * add some Copied from statements * small nits * small nit in dict.get * rename t2u model -> conditionalgeneration * ongoing refactoring of structure * update models architecture * remove SeamlessM4TMultiModal classes * add tests * adapt tests * some non-working code for vocoder * add seamlessM4T vocoder * remove buggy line * fix some hifigan related bugs * remove hifigan specifc config * change * add WIP tokenization * add seamlessM4T working tokenzier * update tokenization * add tentative feature extractor * Update converting script * update working FE * refactor input_values -> input_features * update FE * changes in generation, tokenizer and modeling * make style and add t2u_decoder_input_ids * add intermediate outputs for ToSpeech models * add vocoder to speech models * update valueerror * update FE with languages * add vocoder convert * update config docstrings and names * update generation code and configuration * remove todos and update config.pad_token_id to generation_config.pad_token_id * move block vocoder * remove unecessary code and uniformize tospeech code * add feature extractor import * make style and fix some copies from * correct consistency + make fix-copies * add processor code * remove comments * add fast tokenizer support * correct pad_token_id in M4TModel * correct config * update tests and codes + make style * make some suggested correstion - correct comments and change naming * rename some attributes * rename some attributes * remove unecessary sequential * remove option to use dur predictor * nit * refactor hifigan * replace normalize_mean and normalize_var with do_normalize + save lang ids to generation config * add tests * change tgt_lang logic * update generation ToSpeech * add support import SeamlessM4TProcessor * fix generate * make tests * update integration tests, add option to only return text and update tokenizer fast * fix wrong function call * update import and convert script * update integration tests + update repo id * correct paths and add first test * update how new attention masks are computed * update tests * take first care of batching in vocoder code * add batching with the vocoder * add waveform lengths to model outputs * make style * add generate kwargs + forward kwargs of M4TModel * add docstrings forward methods * reformate docstrings * add docstrings t2u model * add another round of modeling docstrings + reformate speaker_id -> spkr_id * make style * fix check_repo * make style * add seamlessm4t to toctree * correct check_config_attributes * write config docstrings + some modifs * make style * add docstrings tokenizer * add docstrings to processor, fe and tokenizers * make style * write first version of model docs * fix FE + correct FE test * fix tokenizer + add correct integration tests * fix most tokenization tests * make style * correct most processor test * add generation tests and fix num_return_sequences > 1 * correct integration tests -still one left * make style * correct position embedding * change numbeams to 1 * refactor some modeling code and correct one test * make style * correct typo * refactor intermediate fnn * refactor feedforward conformer * make style * remove comments * make style * fix tokenizer tests * make style * correct processor tests * make style * correct S2TT integration * Apply suggestions from Sanchit code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * correct typo * replace torch.nn->nn + make style * change Output naming (waveforms -> waveform) and ordering * nit renaming and formating * remove return None when not necessary * refactor SeamlessM4TConformerFeedForward * nit typo * remove almost copied from comments * add a copied from comment and remove an unecessary dropout * remove inputs_embeds from speechencoder * remove backward compatibiliy function * reformate class docstrings for a few components * remove unecessary methods * split over 2 lines smthg hard to read * make style * replace two steps offset by one step as suggested * nice typo * move warnings * remove useless lines from processor * make generation non-standard test more robusts * remove torch.inference_mode from tests * split integration tests * enrich md * rename control_symbol_vocoder_offset->vocoder_offset * clean convert file * remove tgt_lang and src_lang from FE * change generate docstring of ToText models * update generate docstring of tospeech models * unify how to deal withtext_decoder_input_ids * add default spkr_id * unify tgt_lang for t2u_model * simplify tgt_lang verification * remove a todo * change config docstring * make style * simplify t2u_tgt_lang_id * make style * enrich/correct comments * enrich .md * correct typo in docstrings * add torchaudio dependency * update tokenizer * make style and fix copies * modify SeamlessM4TConverter with new tokenizer behaviour * make style * correct small typo docs * fix import * update docs and add requirement to tests * add convert_fairseq2_to_hf in utils/not_doctested.txt * update FE * fix imports and make style * remove torchaudio in FE test * add seamless_m4t.md to utils/not_doctested.txt * nits and change the way docstring dataset is loaded * move checkpoints from ylacombe/ to facebook/ orga * refactor warning/error to be in the 119 line width limit * round overly precised floats * add stereo audio behaviour * refactor .md and make style * enrich docs with more precised architecture description * readd undocumented models * make fix-copies * apply some suggestions * Apply suggestions from code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * correct bug from previous commit * refactor a parameter allowing to clean the code + some small nits * clean tokenizer * make style and fix * make style * clean tokenizers arguments * add precisions for some tests * move docs from not_tested to slow * modify tokenizer according to last comments * add copied from statements in tests * correct convert script * correct parameter docstring style * correct tokenization * correct multi gpus * make style * clean modeling code * make style * add copied from statements * add copied statements * add support with ASR pipeline * remove file added inadvertently * fix docstrings seamlessM4TModel * add seamlessM4TConfig to OBJECTS_TO_IGNORE due of unconventional markdown * add seamlessm4t to assisted generation ignored models --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-10-23 14:49:48 +02:00
Omar Sanseviero	d33d313192	Nits in Llama2 docstring (#26996 ) Update llama2.md	2023-10-23 14:19:59 +02:00
Akhil	093848d3cc	Added Telugu [te] translations (#26828 ) * Create index.md * Create _toctree.yml * Updated index.md in telugu * Update _toctree.yml * Create quicktour.md * Update quicktour.md * Create index.md * Update quicktour.md * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Delete docs/source/hi/index.md * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/te/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update build_documentation.yml Added telugu [te] * Update build_pr_documentation.yml Added Telugu [te] * Update _toctree.yml --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-20 15:27:55 -07:00
Diego Machado	9b1976697d	fix set_transform link docs (#26856 ) * fix set_transform link * Update docs/source/en/preprocessing.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * use doc-builder sintax --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-10-20 11:16:37 +02:00
Joao Gante	ae4fb84629	Generate: update basic llm tutorial (#26937 )	2023-10-19 16:53:28 +01:00
Mohamed Aymane Farhi	73dc23f786	Fix license (#26931 )	2023-10-19 15:36:41 +02:00
Patrick von Platen	734dd96e02	[Docs] Make sure important decode and generate method are nicely displayed in Whisper docs (#26927 ) better docstrings whisper	2023-10-19 13:01:47 +02:00
Pablo Montalvo	caa0ff0bf1	Add fuyu model (#26911 ) * initial commit * add processor, add fuyu naming * add draft processor * fix processor * remove dropout to fix loading of weights * add image processing fixes from Pedro * fix * fix processor * add basic processing fuyu test * add documentation and TODO * address comments, add tests, add doc * replace assert with torch asserts * add Mixins and fix tests * clean imports * add model tester, clean imports * fix embedding test * add updated tests from pre-release model * Processor: return input_ids used for inference * separate processing and model tests * relax test tolerance for embeddings * add test for logit comparison * make sure fuyu image processor is imported in the init * fix formattingh * more formatting issues * and more * fixups * remove some stuff * nits * update init * remove the fuyu file * Update integration test with release model * Update conversion script. The projection is not used, as confirmed by the authors. * improve geenration * Remove duplicate function * Trickle down patches to model call * processing fuyu updates * remove things * fix prepare_inputs_for_generation to fix generate() * remove model_input * update * add generation tests * nits * draft leverage automodel and autoconfig * nits * fix dtype patch * address comments, update READMEs and doc, include tests * add working processing test, remove refs to subsequences * add tests, remove Sequence classification * processing * update * update the conversion script * more processing cleanup * safe import * take out ModelTesterMixin for early release * more cl;eanup * more cleanup * more cleanup * and more * register a buffer * nits * add postprocessing of generate output * nits * updates * add one working test * fix test * make fixup works * fixup * Arthur's updates * nits * update * update * fix processor * update tests * passe more fixups * fix * nits * don't import torch * skip fuyu config for now * fixup done * fixup * update * oups * nits * Use input embeddings * no buffer * update * styling processing fuyu * fix test * update licence * protect torch import * fixup and update not doctested * kwargs should be passed * udpates * update the impofixuprts in the test * protect import * protecting imports * protect imports in type checking * add testing decorators * protect top level import structure * fix typo * fix check init * move requires_backend to functions * Imports * Protect types --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: ArthurZucker <arthur.zucker@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Lysandre <lysandre@huggingface.co>	2023-10-18 15:24:11 -07:00
Yeyang	732d2a8aac	[i18n-ZH] Translated fast_tokenizers.md to Chinese (#26910 ) docs: translate fast_tokenizers into Chinese	2023-10-18 10:45:41 -07:00
Rockerz	eec5a3a8d8	Refactor code part in documentation translated to japanese (#26900 ) Refactor code in documentation	2023-10-18 10:35:58 -07:00
Merve Noyan	280c757f6c	Knowledge distillation for vision guide (#25619 ) * Knowledge distillation for vision guide * Update knowledge_distillation_for_image_classification.md * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Rafael Padilla <31217453+rafaelpadilla@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Rafael Padilla <31217453+rafaelpadilla@users.noreply.github.com> * Iterated on Rafael's comments * Added to toctree * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Rafael Padilla <31217453+rafaelpadilla@users.noreply.github.com> * Addressed comments * Update knowledge_distillation_for_image_classification.md * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Rafael Padilla <31217453+rafaelpadilla@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update knowledge_distillation_for_image_classification.md * Update knowledge_distillation_for_image_classification.md * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Update docs/source/en/tasks/knowledge_distillation_for_image_classification.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Address comments * Update knowledge_distillation_for_image_classification.md * Explain KL Div --------- Co-authored-by: Rafael Padilla <31217453+rafaelpadilla@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Maria Khalusova <kafooster@gmail.com>	2023-10-18 04:42:32 -07:00
Rockerz	b002353dca	Translating `en/internal` folder docs to Japanese 🇯🇵 (#26747 ) * Add translation to fitst 3 file of internal folder * Update Toctree.md and add files * Update docs/source/ja/internal/generation_utils Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Rename generation_utils file * rename pipelines_utils.md * Change file names --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-17 15:01:21 -07:00
Bingchen Zhao	46092f763d	Fixed a typo in mistral.md (#26879 ) Fix a typo in mistral.md	2023-10-17 14:06:37 -07:00
Susheel Thapa	b3961f7291	Chore: Typo fixed in multiple files of docs/source/en/model_doc (#26833 ) * Chore: Typo fixed in multiple files of docs/source/en/model_doc * Update docs/source/en/model_doc/nllb-moe.md Co-authored-by: Aryan V S <avs050602@gmail.com> --------- Co-authored-by: Aryan V S <avs050602@gmail.com>	2023-10-17 07:10:08 +02:00
Patrick von Platen	805d5d2111	Add LLM doc (#26058 ) * [WIP] Add LLM doc * rename * latex * latex * Fix more latex * [LLMs] Getting most out of LLMS * improve * try again * Apply suggestions from code review Co-authored-by: Maria Khalusova <kafooster@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/en/llm_tutorial_optimization.md * Apply suggestions from code review * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Apply suggestions from code review * move file --------- Co-authored-by: Maria Khalusova <kafooster@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2023-10-16 16:09:50 +02:00
NielsRogge	570b3f9cdd	[OWL-ViT, OWLv2] Add resources (#26822 ) Add resources	2023-10-16 15:47:44 +02:00
Merve Noyan	5d997f227c	Image-to-Image Task Guide (#26595 ) * img2img task guide * Update year * Add to toctree * Update docs/source/en/tasks/image_to_image.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/image_to_image.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/image_to_image.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/image_to_image.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/tasks/image_to_image.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Update docs/source/en/tasks/image_to_image.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Addressed comments * Update docs/source/en/tasks/image_to_image.md Co-authored-by: Maria Khalusova <kafooster@gmail.com> * Addressed comments --------- Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Maria Khalusova <kafooster@gmail.com>	2023-10-16 15:12:03 +02:00
Shreyas S	0dd58d96a0	Fixed typos (#26810 ) Update feature_extractor.md	2023-10-16 09:52:29 +02:00
Injin Paek	d6e5b02ef3	Add CLIP resources (#26534 ) * docs: feat: model resources for CLIP * fix: resolve suggestion Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix: resolve suggestion * fix: resolve suggestion * fix: resolve suggestion Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix: resolve suggestion * fix: resolve suggestions Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-13 11:12:59 -07:00
NielsRogge	762af3e3c7	Add OWLv2, bis (#26668 ) * First draft * Update conversion script * Update copied from statements * Fix style * Add copied from to config * Add copied from to processor * Run make fixup * Add docstring * Update docstrings * Add method * Improve docstrings * Fix docstrings * Improve docstrings * Remove onnx * Add flag * Address comments * Add copied from to model tests * Add flag to conversion script * Add code snippet * Address more comments * Address comment * Improve conversion script * More improvements * Add expected objectness logits * Skip test * Improve conversion script * Extend conversion script * Convert large checkpoint * Fix doc tests * Convert all checkpoints, update integration tests * Add checkpoint_path arg * Fix repo_id	2023-10-13 16:41:24 +02:00
Wonhyeong Seo	7790943c91	🌐 [i18n-KO] Translated `big_models.md` to Korean (#26245 ) * docs: ko: big_models.md * feat: chatgpt draft * fix: manual edits * fix: resolve suggestions Co-Authored-By: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> Co-Authored-By: heuristicwave <31366038+heuristicwave@users.noreply.github.com> Co-Authored-By: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> Co-Authored-By: heuristicwave <31366038+heuristicwave@users.noreply.github.com> Co-Authored-By: bolizabeth <68984363+bolizabeth@users.noreply.github.com> --------- Co-authored-by: bolizabeth <68984363+bolizabeth@users.noreply.github.com> Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> Co-authored-by: heuristicwave <31366038+heuristicwave@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-12 15:00:12 -07:00
Heinz-Alexander Fuetterer	883ed4b344	chore: fix typos (#26756 )	2023-10-12 18:00:27 +02:00
Maria Khalusova	0ebee8b933	[docs] LLM prompting guide (#26274 ) * llm prompting guide * updated code examples * an attempt to fix the code example tests * set seed in examples * added a doctest comment * added einops to the doc_test_job * string formatting * string formatting, again * added the toc to slow_documentation_tests.txt * minor list fix * string formatting + pipe renamed * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * replaced max_length with max_new_tokens and updated the outputs to match * minor formatting fix * removed einops from circleci config * Apply suggestions from code review Co-authored-by: Lysandre Debut <hi@lysand.re> * removed einops and trust_remote_code parameter --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Lysandre Debut <hi@lysand.re>	2023-10-12 08:48:01 -04:00
Lysandre Debut	ab0ddc99e8	Warnings controlled by logger level (#26527 ) * Logger level Co-authored-by: Sahil Bhosale <sahilbhosale63@live.com> Co-authored-by: Adithya4720 <hegdeadithyak@gmail.com> Co-authored-by: Sachin Singh <sachinishu02@gmail.com> Co-authored-by: Riya Dhanduke <113622644+riiyaa24@users.noreply.github.com> * More comprehensive documentation --------- Co-authored-by: Sahil Bhosale <sahilbhosale63@live.com> Co-authored-by: Adithya4720 <hegdeadithyak@gmail.com> Co-authored-by: Sachin Singh <sachinishu02@gmail.com> Co-authored-by: Riya Dhanduke <113622644+riiyaa24@users.noreply.github.com>	2023-10-12 10:48:38 +02:00
TERRY LEE	e1cec43415	Translated the accelerate.md file of the documentation to Chinese (#26161 ) * translate accelerate page * Update docs/source/zh/accelerate.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-10-11 10:54:22 -07:00
Rockerz	9b7668c03a	add japanese documentation (#26138 ) * udpaet * update * Update docs/source/ja/autoclass_tutorial.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * add codes workflows/build_pr_documentation.yml * Create preprocessing.md * added traning.md * Create Model_sharing.md * add quicktour.md * new * ll * Create benchmark.md * Create Tensorflow_model * add * add community.md * add create_a_model * create custom_model.md * create_custom_tools.md * create fast_tokenizers.md * create * add * Update docs/source/ja/_toctree.yml Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * md * add * commit * add * h * Update docs/source/ja/peft.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/ja/_toctree.yml Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/ja/_toctree.yml Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Suggested Update * add perf_train_gpu_one.md * added perf based MD files * Modify toctree.yml and Add transmartion to md codes * Add `serialization.md` and edit `_toctree.yml` * add task summary and tasks explained * Add and Modify files starting from T * Add testing.md * Create main_classes files * delete main_classes folder * Add toctree.yml * Update llm_tutorail.md * Update docs/source/ja/_toctree.yml Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update misspelled filenames * Update docs/source/ja/_toctree.yml Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/_toctree.yml * Update docs/source/ja/_toctree.yml * missplled file names inmrpovements * Update _toctree.yml * close tip block * close another tip block * Update docs/source/ja/quicktour.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/pipeline_tutorial.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/pipeline_tutorial.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/preprocessing.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/peft.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/add_new_model.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/testing.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/task_summary.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/tasks_explained.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update glossary.md * Update docs/source/ja/transformers_agents.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/llm_tutorial.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/create_a_model.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/torchscript.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/benchmarks.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/troubleshooting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/troubleshooting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/troubleshooting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ja/add_new_model.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update perf_torch_compile.md * Update Year to default in en documentation * Final Update --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>	2023-10-11 10:26:37 -07:00
Ben Gubler	9f40639292	Update docs to explain disabling callbacks using report_to (#26155 ) * feat: update callback doc to explain disabling callbacks using report_to * docs: update report_to docstring	2023-10-11 07:50:23 -04:00
Tuowei Wang	a9862a0f49	Fix Typo: table in deepspeed.md (#26705 )	2023-10-10 11:50:10 +02:00
tom white	c7f01beece	fix typos in idefics.md (#26648 ) * fix typos in idefics.md Two typos found in reviewing this documentation. 1) max_new_tokens=4, is not sufficient to generate "Vegetables" as indicated - you will get only "Veget". (incidentally - some mention of how to select this value might be useful as it seems to change in each example) 2) inputs = processor(prompts, return_tensors="pt").to(device) as inputs need to be on the same device (as they are in all other examples on the page) * Update idefics.md Change device to cuda explicitly to match other examples	2023-10-09 12:18:02 +02:00
NielsRogge	2629c8f36a	[DINOv2] Convert more checkpoints (#26177 ) * Convert checkpoints * Update doc test * Address comment	2023-10-09 09:58:04 +02:00
Jabasukuriputo Wang	897a826d83	docs(zh): review and punctuation & space fix (#26627 )	2023-10-06 09:24:28 -07:00
Matt	ea52ed9dc8	Update chat template docs with more tips on writing a template (#26625 )	2023-10-06 12:04:40 +01:00
Maria Khalusova	18fbeec824	[docs] Update to scripts building index.md (#26546 ) * build the table in index.md with links to the model_doc * removed list generation on index.md * fixed missing models * make style	2023-10-05 10:20:41 -04:00

1 2 3 4 5 ...

2181 Commits