transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Nikolai Yakovenko	971d1802d0	Add AdaFactor optimizer from fairseq (#6722 ) * AdaFactor optimizer ported from fairseq. Tested for T5 finetuning and MLM -- reduced memory consumption compared to ADAM. * update PR fixes, add basic test * bug -- incorrect params in test * bugfix -- import Adafactor into test * bugfix -- removed accidental T5 include * resetting T5 to master * bugfix -- include Adafactor in __init__ * longer loop for adafactor test * remove double error class declare * lint * black * isort * Update src/transformers/optimization.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * single docstring * Cleanup docstring Co-authored-by: Nikolai Y <nikolai.yakovenko@point72.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-08-27 04:58:13 -04:00
Sam Shleifer	4bd7be9a42	s2s distillation uses AutoModelForSeqToSeqLM (#6761 )	2020-08-26 23:25:11 -04:00
Ahmed Elnaggar	05e7150a53	create ProtBert-BFD model card. (#6724 )	2020-08-27 02:19:19 +02:00
Sam Shleifer	61518e2df3	[s2s] run_eval.py QOL improvements and cleanup(#6746 )	2020-08-26 18:59:20 -04:00
Igli Manaj	434936f34a	Model Card for Multilingual Passage Reranking BERT (#6755 )	2020-08-26 18:00:27 -04:00
Joe Davison	10a34501f1	add __init__.py to utils (#6754 )	2020-08-26 23:51:10 +02:00
Ali Safaya	61b9ed8074	Model card for kuisailab/albert-large-arabic (#6730 ) * Create README.md * Update README.md	2020-08-26 17:27:56 -04:00
Ali Safaya	8e0d51e4f2	Model card for kuisailab/albert-xlarge-arabic (#6731 ) * Create README.md * Update README.md	2020-08-26 17:27:42 -04:00
Ali Safaya	70c96a10e9	Model card for kuisailab/albert-base-arabic (#6729 ) * Create README.md * Update README.md	2020-08-26 17:27:34 -04:00
Sagor Sarker	cc4ba79f68	added model card for codeswitch-spaeng-sentiment-analysis-lince (#6727 ) * added model card for codeswitch-spaeng-sentiment-analysis-lince model also update other model card * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * Update README.md	2020-08-26 17:26:32 -04:00
Tanmay Thakur	e10fb9cbe6	Create model card for lordtt13/COVID-SciBERT (#6718 )	2020-08-26 17:22:25 -04:00
Adam Montgomerie	baeba53e88	Adding model cards for 5 models (#6703 ) * Added model cards for 4 models Added model cards for: - roberta-base-bulgarian - roberta-base-bulgarian-pos - roberta-small-bulgarian - roberta-small-bulgarian-pos * fixed link text * Update README.md * Create README.md * removed trailing bracket * Add language metadata Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-08-26 17:20:55 -04:00
Julien Chaumond	3242e4d942	[model_cards] Fix tiny typos	2020-08-26 23:16:06 +02:00
Joe Davison	99407f9d1e	add xlm-roberta-large-xnli model card (#6723 ) * add xlm-roberta-large-xnli model card * update pt example * typo	2020-08-26 16:05:59 -04:00
Patrick von Platen	858b7d5873	[TF Longformer] Improve Speed for TF Longformer (#6447 ) * add tf graph compile tests * fix conflict * remove more tf transpose statements * fix conflicts * fix comment typos * move function to class function * fix black * fix black * make style	2020-08-26 14:55:41 -04:00
Lysandre	a75c64d80c	Black 20 release	2020-08-26 17:20:22 +02:00
Lysandre	e78c110338	isort 5	2020-08-26 17:13:49 +02:00
Julien Plu	02e8cd5584	Fix optimizer (#6717 )	2020-08-26 11:12:44 -04:00
Lysandre Debut	77abd1e79f	Centralize logging (#6434 ) * Logging * Style * hf_logging > utils.logging * Address @thomwolf's comments * Update test * Update src/transformers/benchmark/benchmark_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Revert bad change Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-26 11:10:36 -04:00
Jay Yip	461ae86812	Fix tf boolean mask in graph mode (#6741 )	2020-08-26 05:15:35 -04:00
Patrick von Platen	925f34bbbd	Add "tie_word_embeddings" config param (#6692 ) * add tie_word_embeddings * correct word embeddings in modeling utils * make style * make config param only relevant for torch * make style * correct typo * delete deprecated arg in transo-xl	2020-08-26 04:58:21 -04:00
Patrick von Platen	fa8ee8e855	fix torchscript docs (#6740 )	2020-08-26 04:51:56 -04:00
Sylvain Gugger	64c7c2bc15	Install nlp for github actions test (#6728 )	2020-08-25 14:58:38 -04:00
Sam Shleifer	624495706c	T5Tokenizer adds EOS token if not already added (#5866 ) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-25 14:56:08 -04:00
Sam Shleifer	e11d923bfc	Fix pegasus-xsum integration test (#6726 )	2020-08-25 14:06:28 -04:00
Tomo Lazovich	7e6397a7d8	[squad] make examples and dataset accessible from SquadDataset object (#6710 ) * [squad] make examples and dataset accessible from SquadDataset object * [squad] add support for legacy cache files	2020-08-25 13:32:56 -04:00
Funtowicz Morgan	ac9702c284	Fix ONNX test_quantize unittest (#6716 )	2020-08-25 13:24:40 -04:00
Zane Lim	074340339a	Create README.md (#6721 ) add model card for singbert large	2020-08-26 00:11:24 +08:00
Patrick von Platen	d17cce2270	add missing keys (#6719 )	2020-08-25 11:38:51 -04:00
Arnav Sharma	a25c9fc8e1	Selected typo fix (#6687 )	2020-08-25 15:39:02 +02:00
Funtowicz Morgan	625318f525	tensor.nonzero() is deprecated in PyTorch 1.6 (#6715 ) Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>	2020-08-25 08:12:54 -04:00
Sylvain Gugger	124c3d6adc	Add tokenizer to Trainer (#6689 )	2020-08-25 07:47:09 -04:00
Sylvain Gugger	abc0202194	More tests to Trainer (#6699 ) * More tests to Trainer * Add warning in the doc	2020-08-25 07:07:36 -04:00
Sylvain Gugger	f5bad031bc	Use generators tqdm progressbars (#6696 )	2020-08-25 07:06:58 -04:00
Sam Shleifer	a99d09c6f9	add new line to make examples run (#6706 )	2020-08-25 06:26:29 -04:00
Joel Hanson	4db2fa77d7	Allow tests in examples to use cuda or fp16,if they are available (#5512 ) * Allow tests in examples to use cuda or fp16,if they are available The tests in examples didn't use the cuda or fp16 even if they where available. - The text classification example (`run_glue.py`) didn't use the fp16 even if it was available but the device was take based on the availablity(cuda/cpu). - The language-modeling example (`run_language_modeling.py`) was having `--no_cuda` argument which made the test to work without cuda. This example is having issue when running with fp16 thus it not enabled (got an assertion error for perplexity due to it higher value). - The cuda and fp16 is not enabled for question-answering example (`run_squad.py`) as it is having a difference in the f1 score. - The text-generation example (`run_generation.py`) will take the cuda or fp16 whenever it is available. Resolves some of: #5057 * Unwanted import of is_apex_available was removed * Made changes to test examples file to have the pass --fp16 only if cuda and apex is avaliable - run_glue.py: Removed the check for cuda and fp16. - run_generation.py: Removed the check for cuda and fp16 also removed unwanted flag creation. * Incorrectly sorted imports fixed * The model needs to be converted to half precision * Formatted single line if condition statement to multiline * The torch_device also needed to be checked before running the test on examples - The tests in examples which uses cuda should also depend from the USE_CUDA flag, similarly to the rest of the test suite. Even if we decide to set USE_CUDA to True by default, setting USE_CUDA to False should result in the examples not using CUDA * Format some of the code in test_examples file * The improper import of is_apex_available was sorted * Formatted the code to keep the style standards * The comma at the end of list giving a flake8 issue was fixed * Import sort was fixed * Removed the clean_test_dir function as its not used right now	2020-08-25 06:02:07 -04:00
Yohei Tamura	841f071569	Add typing.overload for convert_ids_tokens (#6637 ) * add overload for type checker * black	2020-08-25 04:57:08 -04:00
Quentin Lhoest	0f16dd0ac2	Add DPR to models summary (#6690 ) * add dpr to models summary * minor * minor * Update docs/source/model_summary.rst qa -> question answering Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/model_summary.rst qa -> question ansering (cont'd) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-25 09:57:28 +02:00
Jay	4fca874ea9	Remove hard-coded uses of float32 to fix mixed precision use (#6648 )	2020-08-25 15:42:32 +08:00
Sam Shleifer	0344428f79	[s2s] round bleu, rouge to 4 digits (#6704 )	2020-08-25 00:33:11 -04:00
Zane Lim	b6512d2357	Add model card for singbert. (#6674 ) * Add model card for singbert. Adding a model card for singbert- bert for singlish and manglish. * Update README.md Add additional tags and model name. * Update README.md Fix tag for malay. * Update model_cards/zanelim/singbert/README.md Fix language Co-authored-by: Kevin Canwen Xu <canwenxu@126.com> * Add examples and custom widget input. Add examples and custom widget input. Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>	2020-08-25 10:09:13 +08:00
Sylvain Gugger	d20cbb886b	Fix hyperparameter_search doc (#6695 )	2020-08-24 21:04:08 -04:00
Sam Shleifer	0ebc9699fa	[fixdoc] Add import to pegasus usage doc (#6698 )	2020-08-24 15:54:57 -04:00
Sylvain Gugger	6b4c617666	Move unused args to kwargs (#6694 )	2020-08-24 13:20:03 -04:00
Stas Bekman	912a21ec78	remove BartForConditionalGeneration.generate (#6659 ) As suggested here: https://github.com/huggingface/transformers/issues/6651#issuecomment-678594233 this removes generic `generate` doc with examples not-relevant to bart.	2020-08-25 00:42:34 +08:00
Stas Bekman	a8d6716ecb	Create PULL_REQUEST_TEMPLATE.md (#6660 ) * Create PULL_REQUEST_TEMPLATE.md Proposing to copy this neat feature from pytorch. This is a small template that let's a PR submitter tell which issue that PR closes. * Update .github/PULL_REQUEST_TEMPLATE.md Co-authored-by: Kevin Canwen Xu <canwenxu@126.com> Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>	2020-08-25 00:30:38 +08:00
Sylvain Gugger	8f98faf934	Lat fix for Ray HP search (#6691 )	2020-08-24 12:15:00 -04:00
Sylvain Gugger	3a7fdd3f52	Add hyperparameter search to Trainer (#6576 ) * Add optuna hyperparameter search to Trainer * @julien-c suggestions Co-authored-by: Julien Chaumond <chaumond@gmail.com> * Make compute_objective an arg function * Formatting * Rework to make it easier to add ray * Formatting * Initial support for Ray * Formatting * Polish and finalize * Add trial id to checkpoint with Ray * Smaller default * Use GPU in ray if available * Formatting * Fix test * Update install instruction Co-authored-by: Richard Liaw <rliaw@berkeley.edu> * Address review comments * Formatting post-merge Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-08-24 11:48:45 -04:00
vblagoje	dd522da004	Fix PL token classification examples (#6682 )	2020-08-24 11:30:06 -04:00
Sylvain Gugger	a573777901	Update repo to isort v5 (#6686 ) * Run new isort * More changes * Update CI, CONTRIBUTING and benchmarks	2020-08-24 11:03:01 -04:00

1 2 3 4 5 ...

4982 Commits