transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	6241c873cd	Document the various LM Auto models (#8118 )	2020-10-28 13:41:56 -04:00
Bram Vanroy	5193172f12	[DOC] Improve pipeline() docstrings for config and tokenizer (#8123 ) * Improve pipeline() docstrings * make style * Update wording for config	2020-10-28 13:26:12 -04:00
Boris Dayma	b4cacb7a63	fix(trainer_callback]: typo (#8121 )	2020-10-28 12:15:30 -04:00
Stas Bekman	5423f2a9d4	[testing] port test_trainer_distributed to distributed pytest + TestCasePlus enhancements (#8107 ) * move the helper code into testing_utils * port test_trainer_distributed to work with pytest * improve docs * simplify notes * doc * doc * style * doc * further improvements * torch might not be available * real fix * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-10-28 11:51:32 -04:00
Sylvain Gugger	47dfa65b0c	New run_clm script (#8105 ) * New run_clm script * Formatting * More comments * Remove unused imports * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address review comments * Change link to the hub Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-10-28 10:38:58 -04:00
Stas Bekman	8065fea870	[gh actions] run artifacts job always (#8110 )	2020-10-28 01:45:19 -04:00
Sylvain Gugger	1e01db3579	Remove header	2020-10-27 17:36:13 -04:00
Sylvain Gugger	b715e40ced	Fix typo	2020-10-27 17:34:05 -04:00
Sylvain Gugger	41cc5f3f59	Move installation instructions to the top (#8106 )	2020-10-27 17:32:20 -04:00
Joe Davison	556709ad92	rm multiclass option from model card	2020-10-27 17:11:43 -04:00
Sylvain Gugger	c5f3149f95	Adjust setup so that all extras run on Windows (#8102 )	2020-10-27 14:39:49 -04:00
Davide Fiocco	995006eabb	Add AzureML in integrations via dedicated callback (#8062 ) * first attempt to add AzureML callbacks * func arg fix * var name fix, but still won't fix error... * fixing as in https://discuss.huggingface.co/t/how-to-integrate-an-azuremlcallback-for-logging-in-azure/1713/2 * Avoid lint check of azureml import * black compliance * Make isort happy * Fix point typo in docs * Add AzureML to Callbacks docs * Attempt to make sphinx happy * Format callback docs * Make documentation style happy * Make docs compliant to style Co-authored-by: Davide Fiocco <davide.fiocco@frontiersin.net>	2020-10-27 14:21:54 -04:00
Lysandre Debut	a0906068cf	Fully remove codecov (#8093 )	2020-10-27 14:14:13 -04:00
Joe Davison	3e58b6b7b8	infer entailment label id on zero shot pipeline (#8059 ) * add entailment dim argument * rename dim -> id * fix last name change, style * rm arg, auto-infer only * typo * rm superfluous import	2020-10-27 14:09:55 -04:00
Jason Wolosonovich	9fefdb0751	DEP: pinned sentencepiece to 0.1.91 in setup.py (#8069 ) Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-10-27 14:09:31 -04:00
Stas Bekman	edd3721cd4	update/add setup targets (#8076 )	2020-10-27 13:54:57 -04:00
Julien Chaumond	55bc0c599a	[model_cards] Switch to a more explicit domain for the media bucket	2020-10-27 18:08:05 +01:00
Harutaka Kawamura	7bff0af0a4	Fix a bug for `CallbackHandler.callback_list` (#8052 ) * Fix callback_list * Add test Signed-off-by: harupy <17039389+harupy@users.noreply.github.com> * Fix test Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>	2020-10-27 10:37:04 -04:00
Harutaka Kawamura	8e28c327fc	Fix assertion error message for MLflowCallback (#8091 )	2020-10-27 10:34:51 -04:00
Sylvain Gugger	3220f21f14	Styling fix	2020-10-27 10:09:51 -04:00
Jonathan Chang	286dc19a4f	Fix IterableDataset with __len__ in Trainer (#8095 )	2020-10-27 09:52:35 -04:00
Sam Shleifer	d93acd6f13	Move style_doc to extra_quality_checks (#8081 )	2020-10-27 09:42:07 -04:00
Stas Bekman	bfd5e370a7	[CI] generate separate report files as artifacts (#7995 ) * better reports * a whole bunch of reports in their own files * clean up * improvements * github artifacts experiment * style * complete the report generator with multiple improvements/fixes * fix * save all reports under one dir to easy upload * can remove temp failing tests * doc fix * some cleanup	2020-10-27 09:25:07 -04:00
Lysandre Debut	33f6ef733a	Fix DeBERTa docs (#8092 ) * Fix DeBERTa docs * Tokenizer and config	2020-10-27 09:07:41 -04:00
Sylvain Gugger	c42596bc07	Doc styling fixes (#8074 ) * Fix a few docstrings * More fixes * Styling	2020-10-27 07:54:50 -04:00
Doug Blank	1496931b49	Fix comet_ml import and add ensure availability (#7933 ) * Fix comet_ml import and add ensure availability * Make isort happy * Make flake8 happy * Don't show comet_ml warn if COMET_MODE=DISABLED * Make isort happy	2020-10-27 07:31:07 -04:00
Chengxi Guo	985bba9096	fix doc bug (#8082 ) Signed-off-by: mymusise <mymusise1@gmail.com>	2020-10-27 07:29:25 -04:00
Sylvain Gugger	08f534d2da	Doc styling (#8067 ) * Important files * Styling them all * Revert "Styling them all" This reverts commit `7d029395fd`. * Syling them for realsies * Fix syntax error * Fix benchmark_utils * More fixes * Fix modeling auto and script * Remove new line * Fixes * More fixes * Fix more files * Style * Add FSMT * More fixes * More fixes * More fixes * More fixes * Fixes * More fixes * More fixes * Last fixes * Make sphinx happy	2020-10-26 18:26:02 -04:00
Sylvain Gugger	04a17f8550	Doc fixes in preparation for the docstyle PR (#8061 ) * Fixes in preparation for doc styling * More fixes * Better syntax * Fixes * Style * More fixes * More fixes	2020-10-26 15:01:09 -04:00
Philip May	8bbb74f211	[Model Card] new cross lingual sentence model for German and English (#8026 ) * mc for new cross lingual sentence model * fat text * url spelling fix * more url spelling fixes * slight thanks change * small improvements in text * multilingual word xchange * change colab link * xval fold number * add model links * line break in model names * Update README.md * Update README.md * new examples link * new examples link * add evaluation dataset name * add more about multi lingual * typo fix * typo * typos * hyperparameter typos * hyperparameter typo * add metadata * add metadata * Update README.md * typo fix * Small improvement	2020-10-26 14:48:26 -04:00
Lysandre Debut	3a10764574	Fix TF training arguments instantiation (#8063 )	2020-10-26 14:39:25 -04:00
Sam Shleifer	bc9332b545	[TF] from_pt should respect authorized_unexpected_keys (#8056 )	2020-10-26 13:53:27 -04:00
Stas Bekman	7ff7c4934b	fixing crash (#8057 )	2020-10-26 13:19:10 -04:00
Lysandre Debut	cbad90d86d	Fix + Test (#8049 )	2020-10-26 12:32:27 -04:00
Patrick von Platen	664c7ec453	[Seq2Seq Trainer] Make sure padding is implemented for models without pad_token (#8043 ) * make sure padding is implemented for non-padding tokens models as well * add better error message * add better warning * remove results files * Update examples/seq2seq/seq2seq_trainer.py * remove unnecessary copy line * correct usage of labels * delete test files	2020-10-26 17:28:16 +01:00
mohammadreza-Banaei73	098ddc2244	Update README.md (#8050 ) --wwm cant be used as an argument given run_language_modeling.py and should be changed to --whole_word_mask	2020-10-26 12:00:18 -04:00
Joe Davison	fbcddb8544	add mutliclass field to default zero shot example	2020-10-26 11:07:51 -04:00
Yusuke Mori	a9ac1db276	Minor error fix of 'bart-large-cnn' details in the pretrained_models doc (#8053 )	2020-10-26 11:05:16 -04:00
Samuel	fc2d6eac3c	Minor typo fixes to the preprocessing tutorial in the docs (#8046 ) * Fix minor typos Fix minor typos in the docs. * Update docs/source/preprocessing.rst Clearer data structure description. Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-10-26 10:22:29 -04:00
Joe Davison	b0a907615a	minor model card description updates (#8051 )	2020-10-26 10:04:20 -04:00
noise-field	c48b16b8da	Mlflow integration callback (#8016 ) * Add MLflow integration class Add integration code for MLflow in integrations.py along with the code that checks that MLflow is installed. * Add MLflowCallback import Add import of MLflowCallback in trainer.py * Handle model argument Allow the callback to handle model argument and store model config items as hyperparameters. * Log parameters to MLflow in batches MLflow cannot log more than a hundred parameters at once. Code added to split the parameters into batches of 100 items and log the batches one by one. * Fix style * Add docs on MLflow callback * Fix issue with unfinished runs The "fluent" api used in MLflow integration allows only one run to be active at any given moment. If the Trainer is disposed off and a new one is created, but the training is not finished, it will refuse to log the results when the next trainer is created. * Add MLflow integration class Add integration code for MLflow in integrations.py along with the code that checks that MLflow is installed. * Add MLflowCallback import Add import of MLflowCallback in trainer.py * Handle model argument Allow the callback to handle model argument and store model config items as hyperparameters. * Log parameters to MLflow in batches MLflow cannot log more than a hundred parameters at once. Code added to split the parameters into batches of 100 items and log the batches one by one. * Fix style * Add docs on MLflow callback * Fix issue with unfinished runs The "fluent" api used in MLflow integration allows only one run to be active at any given moment. If the Trainer is disposed off and a new one is created, but the training is not finished, it will refuse to log the results when the next trainer is created.	2020-10-26 09:41:58 -04:00
Lysandre Debut	8be9cb0aef	Tiny TF Bart fixes (#8023 )	2020-10-26 09:29:56 -04:00
Sylvain Gugger	077478637d	Fix label name in DataCollatorForNextSentencePrediction test (#8048 )	2020-10-26 09:23:12 -04:00
Sam Shleifer	8bbe8247f1	Cleanup pytorch tests (#8033 )	2020-10-26 08:59:06 -04:00
suliuzh	20a0894d1a	update version for scipy (#7998 )	2020-10-26 08:56:56 -04:00
Sam Shleifer	f20aec1de5	fsmt slow test uses lists (#8031 )	2020-10-26 08:32:36 -04:00
Stas Bekman	101186bc1f	[docs] [testing] distributed training (#7993 ) * distributed training * fix * fix formatting * wording	2020-10-26 08:15:05 -04:00
luyug	c153bcc5c8	Add mixed precision evaluation (#8036 ) * Add mixed precision evaluation * use original flag	2020-10-26 08:12:31 -04:00
Samuel	9aa2826687	Minor typo fixes to the tokenizer summary (#8045 ) Minor typo fixes to the tokenizer summary	2020-10-26 08:08:33 -04:00
Lysandre	829b9f8cc3	Remove codecov.yml	2020-10-26 08:05:02 -04:00

1 2 3 4 5 ...

5675 Commits