transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Sam Shleifer	fb78a90d6a	PL: --adafactor option (#6776 )	2020-08-27 22:19:46 -04:00
Stas Bekman	92ac2fa7d1	[transformers-cli] fix logger getter (#6777 )	2020-08-27 20:01:17 -04:00
Lysandre	42fddacd1c	Format	2020-08-27 18:31:51 +02:00
Stas Bekman	70fccc5cf3	new Makefile target: docs (#6510 ) * [doc] multiple corrections to "Summary of the tasks" * add a new "docs" target to validate docs and document it * fix mixup	2020-08-27 12:25:16 -04:00
Stas Bekman	dbfe34f2f5	[test schedulers] adjust to test the first step's reading (#6429 ) * [test schedulers] small improvement * cleanup	2020-08-27 12:23:28 -04:00
Stas Bekman	e6b811f0a7	[testing] replace hardcoded paths to allow running tests from anywhere (#6523 ) * [testing] replace hardcoded paths to allow running tests from anywhere * fix the merge conflict	2020-08-27 12:22:18 -04:00
Sam Shleifer	9d1b4db2aa	add nlp install (#6767 )	2020-08-27 11:08:14 -04:00
Tom Grek	c225e872ed	Fix it to work with BART (#6756 )	2020-08-27 09:04:50 -04:00
Lysandre	0d2c111a0c	Format	2020-08-27 14:56:47 +02:00
Julien Plu	6f289dc97a	Fix the TF Trainer gradient accumulation and the TF NER example (#6713 ) * Align TF NER example over the PT one * Fix Dataset call * Fix gradient accumulation training * Apply style * Address Sylvain's comments * Address Sylvain's comments * Apply style	2020-08-27 08:45:34 -04:00
Lysandre Debut	41aa2b4ef1	Adafactor docs (#6765 )	2020-08-27 05:16:50 -04:00
Nikolai Yakovenko	971d1802d0	Add AdaFactor optimizer from fairseq (#6722 ) * AdaFactor optimizer ported from fairseq. Tested for T5 finetuning and MLM -- reduced memory consumption compared to ADAM. * update PR fixes, add basic test * bug -- incorrect params in test * bugfix -- import Adafactor into test * bugfix -- removed accidental T5 include * resetting T5 to master * bugfix -- include Adafactor in __init__ * longer loop for adafactor test * remove double error class declare * lint * black * isort * Update src/transformers/optimization.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * single docstring * Cleanup docstring Co-authored-by: Nikolai Y <nikolai.yakovenko@point72.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-08-27 04:58:13 -04:00
Sam Shleifer	4bd7be9a42	s2s distillation uses AutoModelForSeqToSeqLM (#6761 )	2020-08-26 23:25:11 -04:00
Ahmed Elnaggar	05e7150a53	create ProtBert-BFD model card. (#6724 )	2020-08-27 02:19:19 +02:00
Sam Shleifer	61518e2df3	[s2s] run_eval.py QOL improvements and cleanup(#6746 )	2020-08-26 18:59:20 -04:00
Igli Manaj	434936f34a	Model Card for Multilingual Passage Reranking BERT (#6755 )	2020-08-26 18:00:27 -04:00
Joe Davison	10a34501f1	add __init__.py to utils (#6754 )	2020-08-26 23:51:10 +02:00
Ali Safaya	61b9ed8074	Model card for kuisailab/albert-large-arabic (#6730 ) * Create README.md * Update README.md	2020-08-26 17:27:56 -04:00
Ali Safaya	8e0d51e4f2	Model card for kuisailab/albert-xlarge-arabic (#6731 ) * Create README.md * Update README.md	2020-08-26 17:27:42 -04:00
Ali Safaya	70c96a10e9	Model card for kuisailab/albert-base-arabic (#6729 ) * Create README.md * Update README.md	2020-08-26 17:27:34 -04:00
Sagor Sarker	cc4ba79f68	added model card for codeswitch-spaeng-sentiment-analysis-lince (#6727 ) * added model card for codeswitch-spaeng-sentiment-analysis-lince model also update other model card * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * Update README.md	2020-08-26 17:26:32 -04:00
Tanmay Thakur	e10fb9cbe6	Create model card for lordtt13/COVID-SciBERT (#6718 )	2020-08-26 17:22:25 -04:00
Adam Montgomerie	baeba53e88	Adding model cards for 5 models (#6703 ) * Added model cards for 4 models Added model cards for: - roberta-base-bulgarian - roberta-base-bulgarian-pos - roberta-small-bulgarian - roberta-small-bulgarian-pos * fixed link text * Update README.md * Create README.md * removed trailing bracket * Add language metadata Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-26 17:20:55 -04:00
Julien Chaumond	3242e4d942	[model_cards] Fix tiny typos	2020-08-26 23:16:06 +02:00
Joe Davison	99407f9d1e	add xlm-roberta-large-xnli model card (#6723 ) * add xlm-roberta-large-xnli model card * update pt example * typo	2020-08-26 16:05:59 -04:00
Patrick von Platen	858b7d5873	[TF Longformer] Improve Speed for TF Longformer (#6447 ) * add tf graph compile tests * fix conflict * remove more tf transpose statements * fix conflicts * fix comment typos * move function to class function * fix black * fix black * make style	2020-08-26 14:55:41 -04:00
Lysandre	a75c64d80c	Black 20 release	2020-08-26 17:20:22 +02:00
Lysandre	e78c110338	isort 5	2020-08-26 17:13:49 +02:00
Julien Plu	02e8cd5584	Fix optimizer (#6717 )	2020-08-26 11:12:44 -04:00
Lysandre Debut	77abd1e79f	Centralize logging (#6434 ) * Logging * Style * hf_logging > utils.logging * Address @thomwolf's comments * Update test * Update src/transformers/benchmark/benchmark_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Revert bad change Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-26 11:10:36 -04:00
Jay Yip	461ae86812	Fix tf boolean mask in graph mode (#6741 )	2020-08-26 05:15:35 -04:00
Patrick von Platen	925f34bbbd	Add "tie_word_embeddings" config param (#6692 ) * add tie_word_embeddings * correct word embeddings in modeling utils * make style * make config param only relevant for torch * make style * correct typo * delete deprecated arg in transo-xl	2020-08-26 04:58:21 -04:00
Patrick von Platen	fa8ee8e855	fix torchscript docs (#6740 )	2020-08-26 04:51:56 -04:00
Sylvain Gugger	64c7c2bc15	Install nlp for github actions test (#6728 )	2020-08-25 14:58:38 -04:00
Sam Shleifer	624495706c	T5Tokenizer adds EOS token if not already added (#5866 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-25 14:56:08 -04:00
Sam Shleifer	e11d923bfc	Fix pegasus-xsum integration test (#6726 )	2020-08-25 14:06:28 -04:00
Tomo Lazovich	7e6397a7d8	[squad] make examples and dataset accessible from SquadDataset object (#6710 ) * [squad] make examples and dataset accessible from SquadDataset object * [squad] add support for legacy cache files	2020-08-25 13:32:56 -04:00
Funtowicz Morgan	ac9702c284	Fix ONNX test_quantize unittest (#6716 )	2020-08-25 13:24:40 -04:00
Zane Lim	074340339a	Create README.md (#6721 ) add model card for singbert large	2020-08-26 00:11:24 +08:00
Patrick von Platen	d17cce2270	add missing keys (#6719 )	2020-08-25 11:38:51 -04:00
Arnav Sharma	a25c9fc8e1	Selected typo fix (#6687 )	2020-08-25 15:39:02 +02:00
Funtowicz Morgan	625318f525	tensor.nonzero() is deprecated in PyTorch 1.6 (#6715 ) Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2020-08-25 08:12:54 -04:00
Sylvain Gugger	124c3d6adc	Add tokenizer to Trainer (#6689 )	2020-08-25 07:47:09 -04:00
Sylvain Gugger	abc0202194	More tests to Trainer (#6699 ) * More tests to Trainer * Add warning in the doc	2020-08-25 07:07:36 -04:00
Sylvain Gugger	f5bad031bc	Use generators tqdm progressbars (#6696 )	2020-08-25 07:06:58 -04:00
Sam Shleifer	a99d09c6f9	add new line to make examples run (#6706 )	2020-08-25 06:26:29 -04:00
Joel Hanson	4db2fa77d7	Allow tests in examples to use cuda or fp16,if they are available (#5512 ) * Allow tests in examples to use cuda or fp16,if they are available The tests in examples didn't use the cuda or fp16 even if they where available. - The text classification example (`run_glue.py`) didn't use the fp16 even if it was available but the device was take based on the availablity(cuda/cpu). - The language-modeling example (`run_language_modeling.py`) was having `--no_cuda` argument which made the test to work without cuda. This example is having issue when running with fp16 thus it not enabled (got an assertion error for perplexity due to it higher value). - The cuda and fp16 is not enabled for question-answering example (`run_squad.py`) as it is having a difference in the f1 score. - The text-generation example (`run_generation.py`) will take the cuda or fp16 whenever it is available. Resolves some of: #5057 * Unwanted import of is_apex_available was removed * Made changes to test examples file to have the pass --fp16 only if cuda and apex is avaliable - run_glue.py: Removed the check for cuda and fp16. - run_generation.py: Removed the check for cuda and fp16 also removed unwanted flag creation. * Incorrectly sorted imports fixed * The model needs to be converted to half precision * Formatted single line if condition statement to multiline * The torch_device also needed to be checked before running the test on examples - The tests in examples which uses cuda should also depend from the USE_CUDA flag, similarly to the rest of the test suite. Even if we decide to set USE_CUDA to True by default, setting USE_CUDA to False should result in the examples not using CUDA * Format some of the code in test_examples file * The improper import of is_apex_available was sorted * Formatted the code to keep the style standards * The comma at the end of list giving a flake8 issue was fixed * Import sort was fixed * Removed the clean_test_dir function as its not used right now	2020-08-25 06:02:07 -04:00
Yohei Tamura	841f071569	Add typing.overload for convert_ids_tokens (#6637 ) * add overload for type checker * black	2020-08-25 04:57:08 -04:00
Quentin Lhoest	0f16dd0ac2	Add DPR to models summary (#6690 ) * add dpr to models summary * minor * minor * Update docs/source/model_summary.rst qa -> question answering Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/model_summary.rst qa -> question ansering (cont'd) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-25 09:57:28 +02:00
Jay	4fca874ea9	Remove hard-coded uses of float32 to fix mixed precision use (#6648 )	2020-08-25 15:42:32 +08:00

1 2 3 4 5 ...

4993 Commits