transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 21:30:07 +06:00

Author	SHA1	Message	Date
Matt	7f20bf0d43	Fixing requirements for TF LM models and use correct model mappings (#14372 ) * Fixing requirements for TF LM models and use correct model mappings * make style	2021-11-11 15:34:00 +00:00
Christopher Akiki	f9c16b02e3	Replace "Masked" with "Causal" in TF CLM example (#14014 )	2021-10-21 16:19:30 +01:00
Matt	702f4a49cd	Fixed CLM model still using MODEL_FOR_MASKED_LM_MAPPING (#13002 )	2021-08-31 13:21:39 +01:00
Stefan Schweter	4046e66e40	examples: only use keep_linebreaks when reading TXT files (#13320 ) * examples: only use keep_linebreaks when reading TXT files for all CLM examples * examples: only use keep_linebreaks when reading TXT files for all CLM examples * examples: only use keep_linebreaks when reading TXT files for all CLM examples	2021-08-28 16:22:29 +02:00
Stefan Schweter	319d840b46	examples: add keep_linebreaks option to CLM examples (#13150 ) * examples: add keep_linebreaks option to text dataset loader for all CLM examples * examples: introduce new keep_linebreaks option as data argument in CLM examples	2021-08-27 11:35:45 +02:00
Elysium1436	f3d0866ed9	Correct validation_split_percentage argument from int (ex:5) to float (0.05) (#12897 ) * Fixed train_test_split test_size argument * `Seq2SeqTrainer` set max_length and num_beams only when non None (#12899) * set max_length and num_beams only when non None * fix instance variables * fix code style * [FLAX] Minor fixes in CLM example (#12914) * readme: fix retrieval of vocab size for flax clm example * examples: fix flax clm example when using training/evaluation files * Fix module path for symbolic_trace example Co-authored-by: cchen-dialpad <47165889+cchen-dialpad@users.noreply.github.com> Co-authored-by: Stefan Schweter <stefan@schweter.it> Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-07-27 21:01:40 -04:00
Sylvain Gugger	6f1adc4334	Fix group_lengths for short datasets (#12558 )	2021-07-08 07:23:41 -04:00
Souvic Chakraborty	d5b8fe3b90	Validation split added: custom data files @sgugger, @patil-suraj (#12407 ) * Validation split added: custom data files Validation split added in case of no validation file and loading custom data * Updated documentation with custom file usage Updated documentation with custom file usage * Update README.md * Update README.md * Update README.md * Made some suggested stylistic changes * Used logger instead of print. Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Made similar changes to add validation split In case of a missing validation file, a validation split will be used now. * max_train_samples to be used for training only max_train_samples got misplaced, now corrected so that it is applied on training data only, not whole data. * styled * changed ordering * Improved language of documentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Improved language of documentation Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fixed styling issue * Update run_mlm.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-07-01 13:22:42 -04:00
Matt	7e22609e0f	Tensorflow LM examples (#12358 ) * Tensorflow MLM example * Add CLM example * Style fixes, adding missing checkpoint code from the CLM example * Fix TPU training, avoid massive dataset warnings * Fix incorrect training length calculation for multi-GPU training * Fix incorrect training length calculation for multi-GPU training * Refactors and nitpicks from the review * Style pass * Adding README	2021-06-28 19:31:44 +01:00

9 Commits