transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Sylvain Gugger	c67d1a0259	Tf model outputs (#6247 ) * TF outputs and test on BERT * Albert to DistilBert * All remaining TF models except T5 * Documentation * One file forgotten * TF outputs and test on BERT * Albert to DistilBert * All remaining TF models except T5 * Documentation * One file forgotten * Add new models and fix issues * Quality improvements * Add T5 * A bit of cleanup * Fix for slow tests * Style	2020-08-05 11:34:39 -04:00
Teven	bd0eab351a	Trainer + wandb quality of life logging tweaks (#6241 ) * added `name` argument for wandb logging, also logging model config with trainer arguments * Update src/transformers/training_args.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * added tf, post-review changes Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-08-05 09:05:52 -04:00
Julien Plu	33966811bd	Add SequenceClassification and MultipleChoice TF models to Electra (#6227 ) * Add SequenceClassification and MultipleChoice TF models to Electra * Apply style * Add summary_proj_to_labels to Electra config * Finally mirroring the PT version of these models * Apply style * Fix Electra test	2020-08-05 09:04:27 -04:00
Stas Bekman	376c02e9a9	[WIP] lightning_base: support --lr_scheduler with multiple possibilities (#6232 ) * support --lr_scheduler with multiple possibilities * correct the error message * add a note about supported schedulers * cleanup * cleanup2 * needs the argument default * style * add another assert in the test * implement requested changes * cleanups * fix relative import * cleanup	2020-08-05 09:01:17 -04:00
Zhu Baohe	d89acd07cc	fix (#6257 )	2020-08-05 07:37:57 -04:00
Ninnart Fuengfusin	24c5a6e351	Update optimization.py (#6261 )	2020-08-05 07:34:57 -04:00
Lilian Bordeau	ed6b8f3128	Update to match renamed attributes in fairseq master (#5972 ) * Update to match renamed attributes in fairseq master RobertaModel no longer have model.encoder and args.num_classes attributes as of 5/28/20. * Quality Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-08-05 07:23:55 -04:00
Ali Safaya	d9149f00d1	Update README.md (#6201 )	2020-08-04 17:44:14 -04:00
Ali Safaya	ddfdbb86c1	Update README.md (#6200 )	2020-08-04 17:44:05 -04:00
Ali Safaya	4f67955662	Update README.md (#6199 )	2020-08-04 17:43:48 -04:00
Ali Safaya	869ec441c9	Update README.md (#6198 )	2020-08-04 17:43:38 -04:00
Adam Montgomerie	5177dca634	Create README.md (#6123 )	2020-08-04 17:42:53 -04:00
Manuel Romero	3f30ebe6ca	Create README.md (#6075 )	2020-08-04 17:41:23 -04:00
Binny Mathew	aa7c22a283	Update Model Card (#6246 ) Added citation and paper links.	2020-08-04 17:40:47 -04:00
Joe Davison	972535ea74	fix zero shot pipeline docs (#6245 )	2020-08-04 16:37:49 -04:00
Timo Moeller	5920a37a4c	Add license info to German Bert models (#6242 ) * Add xlm-r QA model card * Add tags * Add license info to german bert	2020-08-04 13:40:49 -04:00
Patrick von Platen	6c9ba1d8fc	[Reformer] Make random seed generator available on random seed and not on model device (#6244 ) * improve if else statement random seeds * Apply suggestions from code review * Update src/transformers/modeling_reformer.py	2020-08-04 13:22:43 -04:00
Sam Shleifer	d5b0a0e235	mBART Conversion script (#6230 )	2020-08-04 09:53:51 -04:00
Stas Bekman	268bf34630	typo (#6225 )	2020-08-04 09:31:49 -04:00
Patrick von Platen	7f65daa2e1	fix reformer fp16 (#6237 )	2020-08-04 13:02:25 +02:00
Andrés Felipe Cruz	7ea9b2db37	Encoder decoder config docs (#6195 ) * Adding docs for how to load encoder_decoder pretrained model with individual config objects * Adding docs for loading encoder_decoder config from pretrained folder * Fixing W293 blank line contains whitespace * Update src/transformers/modeling_encoder_decoder.py * Update src/transformers/modeling_encoder_decoder.py * Update src/transformers/modeling_encoder_decoder.py * Apply suggestions from code review model file should only show examples for how to load save model * Update src/transformers/configuration_encoder_decoder.py * Update src/transformers/configuration_encoder_decoder.py * fix space Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2020-08-04 09:23:28 +02:00
Lysandre Debut	1d5c3a3d96	Test with --no-cache-dir (#6235 )	2020-08-04 03:20:19 -04:00
Sam Shleifer	6730ecdd3c	Remove redundant coverage (#6224 )	2020-08-04 02:59:21 -04:00
Stas Bekman	5deed37f9f	cleanup torch unittests (#6196 ) * improve unit tests this is a sample of one test according to the request in https://github.com/huggingface/transformers/issues/5973 before I apply it to the rest * batch 1 * batch 2 * batch 3 * batch 4 * batch 5 * style * non-tf template * last deletion of check_loss_output	2020-08-04 02:42:56 -04:00
Gong Linyuan	b390a5672a	Make the order of additional special tokens deterministic (#5704 ) * Make the order of additional special tokens deterministic regardless of hash seeds * Fix	2020-08-04 02:38:30 -04:00
Lysandre Debut	d740351f7d	Upgrade pip when doing CI (#6234 ) * Upgrade pip when doing CI * Don't forget Github CI	2020-08-04 02:37:12 -04:00
Sam Shleifer	57eb1cb68d	[s2s] Document better mbart finetuning command (#6229 ) * Document better MT command * improve multigpu command	2020-08-03 18:22:31 -04:00
Victor SANH	0513f8d275	correct label extraction + add note on discrepancies on trained MNLI model and HANS (#6221 )	2020-08-03 15:02:51 -04:00
Kevin Canwen Xu	3c289fb38c	Remove outdated BERT tips (#6217 ) * Remove out-dated BERT tips * Update modeling_outputs.py * Update bert.rst * Update bert.rst	2020-08-04 01:17:56 +08:00
Sylvain Gugger	e4920c92d6	Doc pipelines (#6175 ) * Init work on pipelines doc * Work in progress * Work in progress * Doc pipelines * Rm unwanted default * Apply suggestions from code review Lysandre comments Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-08-03 11:44:46 -04:00
Sam Shleifer	b6b2f2270f	s2s: fix LR logging, remove some dead code. (#6205 )	2020-08-03 10:36:26 -04:00
Maurice Gonzenbach	06f1692b02	Fix _shift_right function in TFT5PreTrainedModel (#6214 )	2020-08-03 16:21:23 +02:00
Suraj Patil	0b41867357	fix labels (#6213 )	2020-08-03 10:19:35 -04:00
Jay Mody	cedc547e7e	Adds train_batch_size, eval_batch_size, and n_gpu to to_sanitized_dict output for logging. (#5331 ) * Adds train_batch_size, eval_batch_size, and n_gpu to to_sanitized_dict() output * Update wandb config logging to use to_sanitized_dict * removed n_gpu from sanitized dict * fix quality check errors	2020-08-03 09:00:39 -04:00
Julien Plu	9996f697e3	Fix saved model creation (#5468 ) * Fix TF Serving when output_hidden_states and output_attentions are True * Add tests for saved model creation + bug fix for multiple choices models * remove unused import * Fix the input for several layers * Fix test * Fix conflict printing * Apply style * Fix XLM and Flaubert for TensorFlow * Apply style * Fix TF check version * Apply style * Trigger CI	2020-08-03 08:10:40 -04:00
Teven	5a0dac53bf	Empty assert hunt (#6056 ) * Fixed empty asserts * black-reformatted stragglers in templates * More code quality checks * Update src/transformers/convert_marian_to_pytorch.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/convert_marian_to_pytorch.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * removed unused line as per @sshleifer Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-08-03 10:19:03 +02:00
Martin Müller	16c2240164	Add script to convert tf2.x checkpoint to PyTorch (#5791 ) * Add script to convert tf2.x checkpoint to pytorch The script converts the newer TF2.x checkpoints (as published on their official GitHub: https://github.com/tensorflow/models/tree/master/official/nlp/bert) to Pytorch. * rename file in order to stay consistent with naming convention	2020-08-03 03:53:38 -04:00
Philip May	82a0e2b67e	Fix docstring for BertTokenizerFast (#6185 ) - remove duplicate doc-entry for tokenize_chinese_chars - add doc for strip_accents and wordpieces_prefix	2020-08-02 15:58:26 +08:00
Stas Bekman	d8dbf3b75d	[s2s] clean up + doc (#6184 ) Co-authored-by: Sam Shleifer <sshleifer@gmail.com>	2020-08-01 14:51:07 -04:00
Faiaz Rahman	a39dfe4fb1	Fixed typo in Longformer (#6180 )	2020-08-01 18:20:48 +08:00
Joe Davison	8edfaaa81b	bart-large-mnli-yahoo-answers model card (#6133 ) * Add bart-large-mnli-yahoo-answers model card * Add examples * Add widget example * Rm bart tag Co-authored-by: Julien Chaumond <chaumond@gmail.com> Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-31 10:56:32 -04:00
Sylvain Gugger	d951c14ae4	Model output test (#6155 ) * Use return_dict=True in all tests * Formatting	2020-07-31 09:44:37 -04:00
Sylvain Gugger	86caab1e0b	Harmonize both Trainers API (#6157 ) * Harmonize both Trainers API * Fix test * main_prcess -> process_zero	2020-07-31 09:43:23 -04:00
Mehrdad Farahani	603cd81a01	readme m3hrdadfi/albert-fa-base-v2 (#6153 ) * readme m3hrdadfi/albert-fa-base-v2 model_card readme for m3hrdadfi/albert-fa-base-v2 * Update model_cards/m3hrdadfi/albert-fa-base-v2/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-07-31 06:19:06 -04:00
Suraj Patil	838dc06ff5	parse arguments from dict (#4869 ) * add parse_dict to parse arguments from dict * add unit test for parse_dict	2020-07-31 04:44:23 -04:00
Paul O'Leary McCann	cf3cf304ca	Replace mecab-python3 with fugashi for Japanese tokenization (#6086 ) * Replace mecab-python3 with fugashi This replaces mecab-python3 with fugashi for Japanese tokenization. I am the maintainer of both projects. Both projects are MeCab wrappers, so the underlying C++ code is the same. fugashi is the newer wrapper and doesn't use SWIG, so for basic use of the MeCab API it's easier to use. This code insures the use of a version of ipadic installed via pip, which should make versioning and tracking down issues easier. fugashi has wheels for Windows, OSX, and Linux, which will help with issues with installing old versions of mecab-python3 on Windows. Compared to mecab-python3, because fugashi doesn't use SWIG, it doesn't require a C++ runtime to be installed on Windows. In adding this change I removed some code dealing with `cursor`, `token_start`, and `token_end` variables. These variables didn't seem to be used for anything, it is unclear to me why they were there. I ran the tests and they passed, though I couldn't figure out how to run the slow tests (`--runslow` gave an error) and didn't try testing with Tensorflow. * Style fix * Remove unused variable Forgot to delete this... * Adapt doc with install instructions * Fix typo Co-authored-by: sgugger <sylvain.gugger@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-07-31 04:41:14 -04:00
Stas Bekman	f250beb8aa	enable easy checkout switch (#5645 ) * enable easy checkout switch allow having multiple repository checkouts and not needing to remember to rerun 'pip install -e .[dev]' when switching between checkouts and running tests. * make isort happy * examples needs one too	2020-07-31 04:34:46 -04:00
kolk	7d50af4b02	Create README.md (#6169 )	2020-07-31 04:28:35 -04:00
Prajjwal Bhargava	0034a1d248	Add Pytorch Native AMP support in Trainer (#6151 ) * fixed type; add Pytorch Native CUDA AMP support * reverted commit on modeling_utils * confirming to HF black formatting rule * changed bool value of _use_apex * scaler support for gradient clipping * fix inplace operation of clip_grad_norm * removed not while version comparison	2020-07-31 04:23:29 -04:00
Funtowicz Morgan	7231f7b503	Enable ONNX/ONNXRuntime optimizations through converter script (#6131 ) * Add onnxruntime transformers optimization support Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added Optimization section in ONNX/ONNXRuntime documentation. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Improve note reference Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Fixing imports order. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Add warning about different level of optimization between torch and tf export. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Address @LysandreJik wording suggestion Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Address @LysandreJik wording suggestion Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Always optimize model before quantization for maximum performances. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Address comments on the documentation. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Improve TensorFlow optimization message as suggested by @yufenglee Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Removed --optimize parameter Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Warn the user about current quantization limitation when model is larger than 2GB. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Trigger CI for last check * Small change in print for the optimization section. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-07-31 09:45:13 +02:00

1 2 3 4 5 ...

4741 Commits