transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Philipp Schmid	76800fb8e6	added new merged Trainer test (#11090 )	2021-04-06 15:12:21 +02:00
Philipp Schmid	b219d6b5a5	added social thumbnail for docs (#11083 )	2021-04-06 14:56:18 +02:00
Sylvain Gugger	6c1bee7d89	Link to new blog	2021-04-06 08:55:40 -04:00
Stas Bekman	f7328de46d	HF emoji unicode doesn't work in console (#11081 ) It doesn't look like using 🤗 is a great idea for printing to console. See attachment. This PR proposes to replace 🤗 with "HuggingFace" for an exception message. @LysandreJik	2021-04-06 08:03:00 -04:00
Hemil Desai	6ab7d1a429	Add Readme for language modeling scripts with accelerate (#11073 )	2021-04-05 20:56:12 -04:00
Sylvain Gugger	2199608ca6	Make a base init in FeatureExtractionMixin (#11074 )	2021-04-05 18:02:28 -04:00
Sylvain Gugger	04ceee7d24	Fix distributed gather for tuples of tensors of varying sizes (#11071 )	2021-04-05 16:21:49 -04:00
Sylvain Gugger	f05a8a0c5e	Document common config attributes (#11070 )	2021-04-05 15:29:01 -04:00
Sylvain Gugger	090e3e6896	Add center_crop to ImageFeatureExtractoMixin (#11066 )	2021-04-05 15:28:51 -04:00
konstin	abb7430003	Replace pkg_resources with importlib_metadata (#11061 ) * Replace pkg_resources with importlib_metadata Fixes #10964. The other reason for this change is that pkg_resources has been [deprecated](`8fe85c22ce`) in favor of importlib_metadata. * Reduce to a single importlib_metadata import switch * Trigger CI Co-authored-by: Stas Bekman <stas@stason.org>	2021-04-05 12:12:19 -07:00
Hemil Desai	b51b87c41d	Add `examples/language_modeling/run_clm_no_trainer.py` (#11026 ) * Initial draft for clm no trainer * Remove unwanted args * Fix bug * Update examples/language-modeling/run_clm_no_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-05 12:27:52 -04:00
Amala Deshmukh	e1c02e018c	Add example for registering callbacks with trainers (#10928 ) * Add example for callback registry Resolves: #9036 * Update callback registry documentation * Added comments for other ways to register callback	2021-04-05 12:27:23 -04:00
Lysandre Debut	9f4e0c23d6	Documentation about loading a fast tokenizer within Transformers (#11029 ) * Documentation about loading a fast tokenizer within Transformers * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * style Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-05 10:51:16 -04:00
Sylvain Gugger	6c25f5228e	Refactor AutoModel classes and add Flax Auto classes (#11027 ) * Refactor AutoModel classes and add Flax Auto classes * Add new objects to the init * Fix hubconf and sort models * Fix TF tests * Missing coma * Update src/transformers/models/auto/auto_factory.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Fix init * Fix dummies * Other init to fix Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-05 10:11:28 -04:00
Lysandre Debut	eb3479e7cf	Some models have no tokenizers (#11064 )	2021-04-05 09:37:49 -04:00
Lysandre Debut	773e4c7263	Remove unnecessary space (#11060 )	2021-04-05 09:36:20 -04:00
Lysandre Debut	ef62f038fd	Pin docutils (#11062 ) * Pin docutils * Versions table	2021-04-05 09:35:21 -04:00
Eren Şahin	6e31014110	[doc] update code-block rendering (#11053 ) double : prevents code-block section to be rendered, so made it single :	2021-04-05 09:06:07 -04:00
Stas Bekman	3d39226a51	s\|Pretrained\|PreTrained\| (#11048 )	2021-04-04 18:08:42 -07:00
Sylvain Gugger	b0d49fd536	Add a script to check inits are consistent (#11024 )	2021-04-04 20:41:34 -04:00
versis	335c0ca35c	fixed typo: logging instead of logger (#11025 )	2021-04-02 09:22:22 -04:00
Philipp Schmid	34e1bec649	added new notebook and merge of trainer (#11015 ) * added new notebook and merge of trainer * Update docs/source/sagemaker.md Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-01 23:13:47 +02:00
Julien Chaumond	e8da77d181	[doc] no more bucket	2021-04-01 14:25:47 -04:00
Joe Davison	f4ad3d8cea	minor typo fix negative log-likelihood	2021-04-01 11:58:37 -06:00
cronoik	57c1749efa	DebertaTokenizer Rework closes #10258 (#10703 ) * closes #10258 * typo * reworked deberta test * implemented the comments from BigBird01 regarding sequence pair encoding of deberta * Update style * VOCAB_FILES_NAMES is now a oneliner as suggested by @sgugger Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * added #fmt: on as requested by @sgugger * Style Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-04-01 13:53:53 -04:00
NielsRogge	30677dc743	Add Vision Transformer and ViTFeatureExtractor (#10950 ) * Squash all commits into one * Update ViTFeatureExtractor to use image_utils instead of torchvision * Remove torchvision and add Pillow * Small docs improvement * Address most comments by @sgugger * Fix tests * Clean up conversion script * Pooler first draft * Fix quality * Improve conversion script * Make style and quality * Make fix-copies * Minor docs improvements * Should use fix-copies instead of manual handling * Revert "Should use fix-copies instead of manual handling" This reverts commit `fd4e591bce`. * Place ViT in alphabetical order Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-01 11:16:05 -04:00
cchen-dialpad	af6732225c	Improve the speed of adding tokens from added_tokens.json (#10780 ) * use bisect to add one token to unique_no_split_tokens * fix style	2021-04-01 08:56:12 -04:00
Josh	c301c26370	Fix Adafactor documentation (recommend correct settings) (#10526 ) * Update optimization.py Fix documentation to reflect optimal settings for Adafactor * update and expand on the recommendations * style * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * flip scale_parameter to True for the 2nd recommendatoin Co-authored-by: Stas Bekman <stas@stason.org> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-31 21:03:38 -07:00
Hemil Desai	838f83d84c	Add `examples/language_modeling/run_mlm_no_trainer.py` (#11001 ) * Add initial script for finetuning MLM models with accelerate * Add evaluation metric calculation * Fix bugs * Use no_grad on evaluation * update script docstring * Update examples/language-modeling/run_mlm_no_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * PR feedback * Fix CI failure * Update examples/language-modeling/run_mlm_no_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-31 18:49:45 -04:00
JohnnyC08	455f81711f	Update training_args.py (#11000 ) In the group by length documentation length is misspelled as legnth	2021-03-31 18:28:07 -04:00
Patrick von Platen	01068abdb9	add blog to docs (#10997 )	2021-03-31 18:36:00 +03:00
Sylvain Gugger	cd56f3fe7e	Merge trainers (#10975 ) * Replace is_sagemaker_distributed_available * Merge SageMakerTrainer into Trainer * Test with shorter condition * Put back deleted line * Deprecate SageMakerTrainer and SageMakerTrainingArguments * Apply suggestions from code review Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com> Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>	2021-03-31 10:01:30 -04:00
Patrick von Platen	b6dddda4d2	add notebook (#10995 )	2021-03-31 17:00:56 +03:00
Sylvain Gugger	acc3bd9d2a	Enforce string-formatting with f-strings (#10980 ) * First third * Styling and fix mistake * Quality * All the rest * Treat %s and %d * typo * Missing ) * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-31 10:00:27 -04:00
Sylvain Gugger	d0b3797a3b	Add more metadata to the user agent (#10972 ) * Add more metadata to the user agent * Fix typo * Use DISABLE_TELEMETRY * Address review comments * Use global env * Add clean envs on circle CI	2021-03-31 09:36:07 -04:00
Suraj Patil	a8549bdd82	fix example in config (#10993 )	2021-03-31 17:38:57 +05:30
Lysandre Debut	a96edb85c9	GPT Neo configuration needs to be set to use GPT2 tokenizer (#10992 )	2021-03-31 08:03:20 -04:00
Lysandre Debut	bf0840accc	Fix the checkpoint for I-BERT (#10994 )	2021-03-31 08:02:51 -04:00
Philipp Schmid	ced7284a60	Sagemaker test fix (#10987 ) * wrong makefile command * ddp test fix	2021-03-31 07:44:22 -04:00
WybeKoper	645f45c462	Fixed some typos and removed legacy url (#10989 ) * Fixed typos * Removed legacy colab notebook from readme Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>	2021-03-31 16:53:15 +05:30
Patrick von Platen	e87505f3a1	[Flax] Add other BERT classes (#10977 ) * add first code structures * add all bert models * add to init and docs * correct docs * make style	2021-03-31 09:45:58 +03:00
Yih-Dar	e031162a6b	fix md file to avoid evaluation crash (#10962 )	2021-03-30 21:26:22 +03:00
Philipp Schmid	3e09d813aa	[examples/s2s] added py7zr dep (#10971 ) * added py7zr * comment out check_min for sagemaker test * added min version again	2021-03-30 23:17:12 +05:30
Nicolas Patry	c32b432a67	Fixed a bug where the `pipeline.framework` would actually contain (#10970 ) a fully qualified model. We simply forgot to change the call for this one when this landed: https://github.com/huggingface/transformers/pull/10888 It's odd that tests didn't catch that. Should we add some ? (It's a pretty edgy test case, but it does run within the API).	2021-03-30 13:26:35 -04:00
Philipp Schmid	e3c8443f08	improved sagemaker documentation for git_config and examples (#10966 ) * improved branch usage * fixed grammar and comma	2021-03-30 18:00:52 +02:00
Suraj Patil	83d38c9ff3	GPT Neo few fixes (#10968 ) * fix checkpoint names * auto model * fix doc	2021-03-30 11:15:55 -04:00
Patrick von Platen	7772ddb473	fix big bird gpu test (#10967 )	2021-03-30 17:03:48 +03:00
Suraj Patil	860264379f	GPT Neo (#10848 ) * lets begin * boom boom * fix out proj in attn * fix attention * fix local attention * add tokenizer * fix imports * autotokenizer * fix checkpoint name * cleanup * more clean-up * more cleanup * output attentions * fix attn mask creation * fix imports * config doc * add tests * add slow tests * quality * add conversion script * copyright * typo * another bites the dust * fix attention tests * doc * add embed init in convert function * fix copies * remove tokenizer * enable caching * address review comments * improve config and create attn layer list internally * more consistent naming * init hf config from mesh-tf config json file * remove neo tokenizer from doc * handle attention_mask in local attn layer * attn_layers => attention_layers * add tokenizer_class in config * fix docstring * raise if len of attention_layers is not same as num_layers * remove tokenizer_class from config * more consistent naming * fix doc * fix checkpoint names * fp16 compat * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-30 09:42:30 -04:00
Philipp Schmid	a04eb8d369	Fix summarization notebook link (#10959 )	2021-03-30 08:28:58 -04:00
Patrick von Platen	8780caa388	[WIP][Flax] Add general conversion script (#10809 ) * save intermediate * finish first version * delete some more * improve import * fix roberta * Update src/transformers/modeling_flax_pytorch_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_flax_pytorch_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * small corrections * apply all comments * fix deterministic * make fix-copies Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-30 12:13:59 +03:00

1 2 3 4 5 ...

6917 Commits