transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-05 22:00:09 +06:00

Author	SHA1	Message	Date
Matt	508a704055	No more Tuple, List, Dict (#38797 ) * No more Tuple, List, Dict * make fixup * More style fixes * Docstring fixes with regex replacement * Trigger tests * Redo fixes after rebase * Fix copies * [test all] * update * [test all] * update * [test all] * make style after rebase * Patch the hf_argparser test * Patch the hf_argparser test * style fixes * style fixes * style fixes * Fix docstrings in Cohere test * [test all] --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-17 19:37:18 +01:00
Quentin Gallouédec	de24fb63ed	Use HF papers (#38184 ) * Use hf papers * Hugging Face papers * doi to hf papers * style	2025-06-13 11:07:09 +00:00
Lysandre Debut	d538293f62	Transformers cli clean command (#37657 ) * transformers-cli -> transformers * Chat command works with positional argument * update doc references to transformers-cli * doc headers * deepspeed --------- Co-authored-by: Joao Gante <joao@huggingface.co>	2025-04-30 12:15:43 +01:00
Mehant Kammakomati	7d76876498	(Part 2) feat: allow for tp_size attr for tplizing the model (#37054 ) * feat: custom tp_size, new transformers tp interface Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * fix: review cmt - error when tp_plan not set for tp_size Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * fix: nit in docs Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> --------- Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Matej Sirovatka <54212263+S1ro1@users.noreply.github.com>	2025-04-10 17:44:09 +02:00
omahs	cbf924b76c	Fix typos (#36910 ) * fix typos * fix typos * fix typos * fix typos	2025-03-24 14:08:29 +00:00
Matt	1e4286fd59	Remove research projects (#36645 ) * Remove research projects * Add new README to explain where the projects went * Trigger tests * Cleanup all references to research_projects	2025-03-11 13:47:38 +00:00
Mehant Kammakomati	c3ba53303b	feat: add support for tensor parallel training workflow with accelerate (#34194 ) * feat: add support for tensor parallel flow using accelerate Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * fix: add tp degree to env variable Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * fix: add version check for accelerate to allow TP Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * docs: tensor parallelism Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * nit: rename plugin name Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * fix: guard accelerate version before allow tp Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * docs: add more docs and updates related to TP Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> --------- Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>	2025-02-18 14:05:46 +01:00
Thomas Bauwens	8f137b2427	Move `DataCollatorForMultipleChoice` from the docs to the package (#34763 ) * Add implementation for DataCollatorForMultipleChoice based on docs. * Add DataCollatorForMultipleChoice to import structure. * Remove custom DataCollatorForMultipleChoice implementations from example scripts. * Remove custom implementations of DataCollatorForMultipleChoice from docs in English, Spanish, Japanese and Korean. * Refactor torch version of DataCollatorForMultipleChoice to be more easily understandable. * Apply suggested changes and run make fixup. * fix copies, style and fixup * add missing documentation * nits * fix docstring * style * nits * isort --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>	2025-02-13 12:01:28 +01:00
Jacky Lee	4302b27719	Fix typos in translated quicktour docs (#35302 ) * fix: quicktour typos * fix: one more	2024-12-17 09:32:00 -08:00
Fanli Lin	25f510a9c6	[docs] update not-working model revision (#34682 ) update revision	2024-11-11 07:09:31 -08:00
amyeroberts	b7474f211d	Trainer - deprecate tokenizer for processing_class (#32385 ) * Trainer - deprecate tokenizer for processing_class * Extend chage across Seq2Seq trainer and docs * Add tests * Update to FutureWarning and add deprecation version	2024-10-02 14:08:46 +01:00
S M Jishanul Islam	8defc95df3	Updated the custom_models.md changed cross_entropy code (#33118 )	2024-08-26 13:15:43 +02:00
Matt	edd68f4ed8	🚨 No more default chat templates (#31733 ) * No more default chat templates * Add the template to the GPT-SW3 tests since it's not available by default now * Fix GPT2 test * Fix Bloom test * Fix Bloom test * Remove default templates again	2024-07-24 17:36:32 +01:00
Aaron Jimenez	c73ee1333d	[docs] Spanish translation of tokenizer_summary.md (#31154 ) * add tokenizer_summary to es/_toctree.yml * add tokenizer_summary to es/ * fix link to Transformes XL in en/ * translate until Subword tokenization section * fix GPT link in en/ * fix other GPT link in en/ * fix typo in en/ * translate the doc * run make fixup * Remove .md in Transformer XL link * fix some link issues in es/ * fix typo	2024-06-03 16:52:23 -07:00
Lucain	c3044ec2f3	Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016 ) * Fix has_file in offline mode * harmonize env variable for offline mode * Switch to HF_HUB_OFFLINE * fix test * revert test_offline to test TRANSFORMERS_OFFLINE * Add new offline test * merge conflicts * docs	2024-05-29 11:55:43 +01:00
Aaron Jimenez	0df888ffb7	[docs] Spanish translation of model_memory_anatomy.md (#30885 ) * add model_memory_anatomy to es/_toctree.yml * copy model_memory_anatomy.md to es/ * translate first section * translate doc * chage forward activations * fix sentence and and link to Trainer * fix Trainer link	2024-05-20 16:48:52 -07:00
Aaron Jimenez	8ce4fefc52	[docs] Update link in es/pipeline_webserver.md (#30745 ) * update link * run make style	2024-05-10 09:29:26 -07:00
Aaron Jimenez	47735f5f0f	[docs] Update es/pipeline_tutorial.md (#30684 ) * copy en/ contect to es/ * translate first section * translate the doc * fix typos * run make style	2024-05-09 16:42:01 -07:00
amyeroberts	bbaa8ceff6	Fix canonical model --model_type in examples (#30480 ) Fix --model_type in examples	2024-05-01 15:47:05 +01:00
clinty	bdbe166211	Fix broken link to Transformers notebooks (#30512 ) Co-authored-by: Clint Adams <clint@debian.org>	2024-04-29 10:57:51 +01:00
Aaron Jimenez	a98c41798c	[docs] Spanish translation of pipeline_tutorial.md (#30252 ) * add pipeline_webserver to es/ * add pipeline_webserver to es/, translate first section * add comment for checking link * translate pipeline_webserver * edit pipeline_webserver * fix typo	2024-04-25 12:18:06 -07:00
Lysandre Debut	0eb8fbcdac	Remove task guides auto-update in favor of links towards task pages (#30429 )	2024-04-24 09:38:10 +02:00
Zach Mueller	60d5f8f9f0	🚨🚨🚨Deprecate `evaluation_strategy` to `eval_strategy`🚨🚨🚨 (#30190 ) * Alias * Note alias * Tests and src * Rest * Clean * Change typing? * Fix tests * Deprecation versions	2024-04-18 12:49:43 -04:00
Hafedh	0eaef0c709	add `push_to_hub` to pipeline (#29172 ) * add `push_to_hub` to pipeline * fix docs * format with ruff * update save_pretrained * update save_pretrained * remove unnecessary comment * switch to push_to_hub method in DynamicPipelineTester * remove unused imports * update docs for add_new_pipeline * fix docs for add_new_pipeline * add comment * fix italien docs * changes to token retrieval for pipelines * Update src/transformers/pipelines/base.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-04-16 15:34:04 +01:00
Yih-Dar	cbc2cc187a	More fixes for doctest (#30265 ) * fix * update * update * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-04-16 11:58:55 +02:00
NielsRogge	e9c23fa056	[Trainer] Undo #29896 (#30129 ) * Undo * Use tokenizer * Undo data collator	2024-04-09 12:55:42 +02:00
Utkarsha Gupte	0201f6420b	[#29174 ] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888 ) * ImportError: Trainer with PyTorch requires accelerate>=0.20.1 Fix Adding the evaluate and accelerate installs at the beginning of the cell to fix the issue * ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 * Import Error Fix * Update installation.md * Update quicktour.md * rollback other lang changes * Update _config.py * updates for other languages * fixing error * Tutorial Update * Update tokenization_utils_base.py * Just use an optimizer string to pass the doctest? --------- Co-authored-by: Matt <rocketknight1@gmail.com>	2024-04-08 14:21:16 +01:00
NielsRogge	1ab7136488	[Trainer] Allow passing image processor (#29896 ) * Add image processor to trainer * Replace tokenizer=image_processor everywhere	2024-04-05 10:10:44 +02:00
Aaron Jimenez	00c1d87a7d	[docs] Spanish translation of attention.md (#29681 ) * add attention to es/ and edit es/_toctree.yml * translate attention.md * fix transformers * fix transformers	2024-03-15 11:55:35 -07:00
njackman-2344	d3801aae2e	[docs] Spanish translate chat_templating.md & yml addition (#29559 ) * torchscript and trainer md es translation * corrected md es files and even corrected spelling in en md * made es corrections to trainer.md * deleted entrenamiento... title on yml * placed entrenamiento in right place * translated es chat_templating.md w/ yml addition * requested es changes to md and yml * last es changes to md	2024-03-13 09:28:11 -07:00
njackman-2344	e947683294	[Docs] Spanish Translation -Torchscript md & Trainer md (#29310 ) * torchscript and trainer md es translation * corrected md es files and even corrected spelling in en md * made es corrections to trainer.md * deleted entrenamiento... title on yml * placed entrenamiento in right place	2024-03-04 13:57:51 -08:00
Aaron Jimenez	9f7535bda8	[docs] Spanish translation of tasks_explained.md (#29224 ) * Add tasks_explained.md to es/ * Fix little typo in en/ version * translate speach/audio section * translate part of vision computer section \| fix little typo in en/ * Fix little typo in en/ * Translate vision computer section \| remove to * * in both files * Translate NLP section \| fix link to task/translation in en/ * Updete link in es/tasks_summary.md * Fix task_summary title link	2024-02-26 08:18:15 -08:00
Gustavo Isturiz	3c00b885b9	Added image_captioning version in es and included in toctree file (#29104 ) added image_captioning version in es and included in toctree file	2024-02-20 09:13:15 -08:00
Aaron Jimenez	ce4fff0be7	[Docs] Spanish translation of task_summary.md (#28844 ) * Add task_summary to es/_toctree.yml * Add task_summary.md to docs/es * Change title of task_summary.md * Translate firsts paragraphs * Translate middle paragraphs * Translte the rest of the doc * Edit firts paragraph	2024-02-16 15:50:06 -08:00
Lysandre Debut	f497f564bb	Update all references to canonical models (#29001 ) * Script & Manual edition * Update	2024-02-16 08:16:58 +01:00
Klaus Hipp	2749e479f3	[Docs] Fix broken links and syntax issues (#28918 ) * Fix model documentation links in attention.md * Fix external link syntax * Fix target anchor names of section links * Fix copyright statement comments * Fix documentation headings	2024-02-08 14:13:35 -08:00
Klaus Hipp	1c31b7aa3b	[Docs] Add missing language options and fix broken links (#28852 ) * Add missing entries to the language selector * Add links to the Colab and AWS Studio notebooks for ONNX * Use anchor links in CONTRIBUTING.md * Fix broken hyperlinks due to spaces * Fix links to OpenAI research articles * Remove confusing footnote symbols from author names, as they are also considered invalid markup	2024-02-06 12:01:01 -08:00
Klaus Hipp	4830f26965	[Docs] Fix backticks in inline code and documentation links (#28875 ) Fix backticks in code blocks and documentation links	2024-02-06 11:15:44 -08:00
Hankyeol Kyung	995a7ce9a8	Fix broken link on page (#28451 ) * [docs] Fix broken link Signed-off-by: Hankyeol Kyung <kghnkl0103@gmail.com> * [docs] Use shorter domain Signed-off-by: Hankyeol Kyung <kghnkl0103@gmail.com> --------- Signed-off-by: Hankyeol Kyung <kghnkl0103@gmail.com>	2024-01-11 09:26:13 -08:00
Kevin Herro	5d36025ca1	README: install transformers from conda-forge channel (#28313 ) Switch to the conda-forge channel for transformer installation, as the huggingface channel does not offer the latest version. Fixes #28248	2024-01-04 09:36:16 -08:00
Aaron Jimenez	6b8ec2588e	[docs] Sort es/toctree.yml \| Translate performance.md (#28262 ) * Sort es/_toctree.yml like en/_toctree.yml * Run make style * Add -Rendimiento y escalabilidad- section to es/_toctree.yml * Run make style * Add s to section * Add translate of performance.md * Add performance.md to es/_toctree.yml * Run make styele * Fix docs links * Run make style	2024-01-03 14:35:58 -08:00
Aaron Jimenez	815ea8e8a2	[Doc] Spanish translation of glossary.md (#27958 ) * Add glossary to es/_toctree.yml * Add glossary.md to es/ * A section translated * B and C section translated * Fix typo in en/glossary.md C section * D section translated \| Add a extra line in en/glossary.md * E and F section translated \| Fix typo in en/glossary.md * Fix words preentrenado * H and I section translated \| Fix typo in en/glossary.md * L section translated * M and N section translated * P section translated * R section translated * S section translated * T section translated * U and Z section translated \| Fix TensorParallel link in both files * Fix word	2023-12-13 09:21:59 -08:00
Aaron Jimenez	d6c3a3f137	[Doc] Spanish translation of pad_truncation.md (#27890 ) * Add pad_truncation to es/_toctree.yml * Add pad_truncation.md to es/ * Translated first two paragraph * Translated paddig argument section * Translated truncation argument section * Translated final paragraphs * Translated table * Fixed typo in the table of en/pad_truncation.md * Run make style \| Fix a word * Add Padding (relleno) y el Truncation (truncamiento) in the final paragraphs * Fix relleno and truncamiento words	2023-12-08 10:32:18 -08:00
Aaron Jimenez	da1d0d404f	Documentation: Spanish translation of perplexity.mdx (#27807 ) * Copy perplexity.md file to es/ folder * Adding perplexity to es/_toctree.yml * Translate first section * Calculating PPL section translate * Example section translate * fix translate of log-likehood * Fix title translate * Fix \ in second paragraph * Change verosimilitud for log-likelihood * Run 'make style'	2023-12-05 10:53:55 -08:00
Peter Pan	ce31508134	docs: replace torch.distributed.run by torchrun (#27528 ) * docs: replace torch.distributed.run by torchrun `transformers` now officially support pytorch >= 1.10. The entrypoint `torchrun`` is present from 1.10 onwards. Signed-off-by: Peter Pan <Peter.Pan@daocloud.io> * Update src/transformers/trainer.py with @ArthurZucker's suggestion Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Signed-off-by: Peter Pan <Peter.Pan@daocloud.io> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-11-27 16:26:33 +00:00
Yih-Dar	7293fdc5b9	Deprecate `TransfoXL` (#27607 ) * fix * fix * trigger * Apply suggestions from code review Co-authored-by: Lysandre Debut <hi@lysand.re> * tic * revert * revert --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Lysandre Debut <hi@lysand.re>	2023-11-24 11:48:02 +01:00
V.Prasanna kumar	ffbcfc0166	Broken links fixed related to datasets docs (#27569 ) fixed the broken links belogs to dataset library of transformers	2023-11-17 13:44:09 -08:00
V.Prasanna kumar	638d49983f	fixed broken link (#27560 )	2023-11-17 08:20:42 -08:00
Maria Khalusova	9beb2737d7	[docs] fixed links with 404 (#27327 ) * fixed links with 404 * make style	2023-11-06 19:45:03 +00:00
Phuc Van Phan	9cebae64ad	docs: update link huggingface map (#26077 )	2023-09-11 12:57:04 +01:00

1 2 3

114 Commits