transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 17:22:25 +06:00

Author	SHA1	Message	Date
Patrick von Platen	9f1544b9e0	Create README.md	2020-09-18 11:37:20 +02:00
Sameer Zahid	5c1d5ea667	Fixed typo in README (#7233 )	2020-09-18 04:52:43 -04:00
Yuta Hayashibe	7719ecd19f	Fix a typo (#7225 )	2020-09-18 04:23:33 -04:00
Manuel Romero	4a26e8ac5f	Create README.md (#7205 )	2020-09-18 03:24:30 -04:00
Manuel Romero	94320c5b81	Add customized text to widget (#7204 )	2020-09-18 03:24:23 -04:00
Manuel Romero	3aefb24b20	Create README.md (#7209 )	2020-09-18 03:24:10 -04:00
Manuel Romero	a22e7a8dd4	Create README.md (#7210 )	2020-09-18 03:23:58 -04:00
Manuel Romero	c028b26481	Create README.md (#7212 )	2020-09-18 03:23:49 -04:00
Genta Indra Winata	c7cdd7b4fd	Create README.md for indobert-lite-base-p1 (#7182 )	2020-09-18 03:22:32 -04:00
Genta Indra Winata	bfb9150b8f	Create README.md for indobert-lite-large-p1 (#7184 ) * Create README.md * Update README.md	2020-09-18 03:22:11 -04:00
Genta Indra Winata	d193593403	Create README.md (#7183 )	2020-09-18 03:21:54 -04:00
Genta Indra Winata	e65d846674	Create README.md (#7185 )	2020-09-18 03:21:39 -04:00
Genta Indra Winata	e27d86d48d	Create README.md for indobert-large-p2 model card (#7181 )	2020-09-18 03:21:28 -04:00
Genta Indra Winata	881c0783e9	Create README.md for indobert-large-p1 model card (#7180 )	2020-09-18 03:21:16 -04:00
Genta Indra Winata	e0d58a5c87	Create README.md (#7179 )	2020-09-18 03:20:59 -04:00
Genta Indra Winata	1313a1d2a8	Create README.md for indobert-base-p2 (#7178 )	2020-09-18 03:20:29 -04:00
tuner007	cf24f43e76	Create README.md (#7095 ) Create model card for Pegasus QA	2020-09-18 03:19:45 -04:00
Sam Shleifer	67d9fc50d9	[s2s] remove double assert (#7223 )	2020-09-17 18:32:31 -04:00
Stas Bekman	edbaad2c5c	[model cards] fix metadata - 3rd attempt (#7218 )	2020-09-17 16:57:06 -04:00
Stas Bekman	999a1c957a	skip failing FSMT CUDA tests until investigated (#7220 )	2020-09-17 16:53:14 -04:00
Stas Bekman	51c4adf54c	[model cards] fix dataset yaml (#7216 )	2020-09-17 15:29:39 -04:00
Sam Shleifer	a5638b2b3a	[s2s] dynamic batch size with --max_tokens_per_batch (#7030 )	2020-09-17 15:19:34 -04:00
Stas Bekman	efeab6a3f1	[s2s] run_eval/run_eval_search tweaks (#7192 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-09-17 14:26:38 -04:00
Stas Bekman	9c5bcab5b0	[model cards] fix yaml in cards (#7207 )	2020-09-17 14:11:17 -04:00
Sohee Yang	e643a29722	Change to use relative imports in some files & Add python prompt symbols to example codes (#7202 ) * Move 'from transformers' statements to relative imports in some files * Add python prompt symbols in front of the example codes * Reformat the code * Add one missing space Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-09-17 12:30:45 -04:00
Stas Bekman	0fe6e435b6	[model cards] ported allenai Deep Encoder, Shallow Decoder models (#7153 ) * [model cards] ported allenai Deep Encoder, Shallow Decoder models * typo * fix references * add allenai/wmt19-de-en-6-6 model cards * fill-in the missing info for the build script as provided by the searcher.	2020-09-17 17:58:49 +02:00
Stas Bekman	1eeb206bef	[ported model] FSMT (FairSeq MachineTranslation) (#6940 ) * ready for PR * cleanup * correct FSMT_PRETRAINED_MODEL_ARCHIVE_LIST * fix * perfectionism * revert change from another PR * odd, already committed this one * non-interactive upload workaround * backup the failed experiment * store langs in config * workaround for localizing model path * doc clean up as in https://github.com/huggingface/transformers/pull/6956 * style * back out debug mode * document: run_eval.py --num_beams 10 * remove unneeded constant * typo * re-use bart's Attention * re-use EncoderLayer, DecoderLayer from bart * refactor * send to cuda and fp16 * cleanup * revert (moved to another PR) * better error message * document run_eval --num_beams * solve the problem of tokenizer finding the right files when model is local * polish, remove hardcoded config * add a note that the file is autogenerated to avoid losing changes * prep for org change, remove unneeded code * switch to model4.pt, update scores * s/python/bash/ * missing init (but doesn't impact the finetuned model) * cleanup * major refactor (reuse-bart) * new model, new expected weights * cleanup * cleanup * full link * fix model type * merge porting notes * style * cleanup * have to create a DecoderConfig object to handle vocab_size properly * doc fix * add note (not a public class) * parametrize * - add bleu scores integration tests * skip test if sacrebleu is not installed * cache heavy models/tokenizers * some tweaks * remove tokens that aren't used * more purging * simplify code * switch to using decoder_start_token_id * add doc * Revert "major refactor (reuse-bart)" This reverts commit `226dad15ca`. * decouple from bart * remove unused code #1 * remove unused code #2 * remove unused code #3 * update instructions * clean up * move bleu eval to examples * check import only once * move data+gen script into files * reuse via import * take less space * add prepare_seq2seq_batch (auto-tested) * cleanup * recode test to use json instead of yaml * ignore keys not needed * use the new -y in transformers-cli upload -y * [xlm tok] config dict: fix str into int to match definition (#7034) * [s2s] --eval_max_generate_length (#7018) * Fix CI with change of name of nlp (#7054) * nlp -> datasets * More nlp -> datasets * Woopsie * More nlp -> datasets * One last * extending to support allen_nlp wmt models - allow a specific checkpoint file to be passed - more arg settings - scripts for allen_nlp models * sync with changes * s/fsmt-wmt/wmt/ in model names * s/fsmt-wmt/wmt/ in model names (p2) * s/fsmt-wmt/wmt/ in model names (p3) * switch to a better checkpoint * typo * make non-optional args such - adjust tests where possible or skip when there is no other choice * consistency * style * adjust header * cards moved (model rename) * use best custom hparams * update info * remove old cards * cleanup * s/stas/facebook/ * update scores * s/allen_nlp/allenai/ * url maps aren't needed * typo * move all the doc / build /eval generators to their own scripts * cleanup * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * fix indent * duplicated line * style * use the correct add_start_docstrings * oops * resizing can't be done with the core approach, due to 2 dicts * check that the arg is a list * style * style Co-authored-by: Sam Shleifer <sshleifer@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-09-17 11:31:29 -04:00
Sylvain Gugger	492bb6aa48	Trainer multi label (#7191 ) * Trainer accep multiple labels * Missing import * Fix dosctrings	2020-09-17 08:15:37 -04:00
RafaelWO	709745927b	Transformer-XL: Remove unused parameters (#7087 ) * Removed 'tgt_len' and 'ext_len' from Transfomer-XL * Some changes are still to be done * Removed 'tgt_len' and 'ext_len' from Transfomer-XL (2) * Removed comments * Fixed quality * Changed warning to info	2020-09-17 06:10:34 -04:00
Dhaval Taunk	c183d81e27	added multilabel text classification notebook using distilbert to community notebooks (#7201 ) * added multilabel classification using distilbert notebook to community notebooks * added multilabel classification using distilbert notebook to community notebooks	2020-09-17 05:58:57 -04:00
Stas Bekman	79111b77d2	remove deprecated flag (#7171 ) ``` /home/circleci/.local/lib/python3.6/site-packages/isort/main.py:915: UserWarning: W0501: The following deprecated CLI flags were used and ignored: --recursive! "W0501: The following deprecated CLI flags were used and ignored: " ```	2020-09-17 05:52:12 -04:00
Stas Bekman	0cdafbf7ec	remove duplicated code (#7173 )	2020-09-17 05:51:40 -04:00
Sam Shleifer	45b0b1ff2f	[s2s] fix kwarg typo (#7196 )	2020-09-16 21:58:57 -04:00
Sam Shleifer	0203ad43bc	[s2s] distributed eval cleanup (#7186 )	2020-09-16 15:38:37 -04:00
sgugger	3babef815c	Formatting	2020-09-16 14:57:09 -04:00
Stas Bekman	42049b8e12	use the correct add_start_docstrings (#7174 )	2020-09-16 14:40:35 -04:00
Stas Bekman	fdaf8ab349	[s2s run_eval] new features (#7109 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-09-16 13:59:57 -04:00
Antoine Louis	df165065c3	[model_cards] antoiloui/belgpt2 🇧🇪 (#7166 ) * Create README.md * Update README.md	2020-09-16 12:16:01 -04:00
Sylvain Gugger	108c9aefcc	Update README (#7133 ) * Rewrite and update README * Typo and migration guide * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Address Clem's comments Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-09-16 12:12:12 -04:00
Donna Choi	9e376e156a	Add condition (#7161 )	2020-09-16 09:15:10 -04:00
Stas Bekman	f8590c56e6	[doc] improve/expand the Parametrization section (#7156 )	2020-09-16 08:45:50 -04:00
Stas Bekman	d3391c87fe	build/eval/gen-card scripts for fsmt (#7155 ) * build/eval/gen-card scripts for fsmt * adjust for model renames	2020-09-16 08:41:26 -04:00
Xi Ye	08bfc1718a	fix the warning message of overflowed sequence (#7151 )	2020-09-16 07:40:57 -04:00
Julien Plu	af8425b749	Refactoring the TF activations functions (#7150 ) * Refactoring the activations functions into a common file * Apply style * remove unused import * fix tests * Fix tests.	2020-09-16 07:03:47 -04:00
Stas Bekman	b00cafbde5	[docs] add testing documentation (#7101 ) * [docs] add testing documentation * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * tweaks as suggested * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * tweaks * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * more tweaks * suggestions from @LysandreJik Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-09-15 19:25:25 -04:00
Patrick von Platen	85ffda96fc	fix encoder decoder kwargs (#7131 )	2020-09-15 21:10:07 +02:00
Yih-Dar	4c62c6021a	fix ZeroDivisionError and epoch counting (#7125 ) * fix ZeroDivisionError and epoch counting * Add test for num_train_epochs calculation in trainer.py * Remove @require_non_multigpu for test_num_train_epochs_in_training	2020-09-15 11:51:50 -04:00
Patrick von Platen	7af2791d77	Create README.md	2020-09-15 16:47:36 +02:00
Sylvain Gugger	153ec2f154	Funnel model cards (#7147 )	2020-09-15 10:40:57 -04:00
Sylvain Gugger	7186ca6240	Multi predictions trainer (#7126 ) * Allow multiple outputs * Formatting * Move the unwrapping before metrics * Fix typo * Add test for non-supported config options	2020-09-15 10:27:24 -04:00

... 5 6 7 8 9 ...

5522 Commits