transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-19 20:48:22 +06:00

Author	SHA1	Message	Date
ahotrod	ba498eac38	Create README.md (#2785 ) * Create README.md * Update README.md * Update README.md * Update README.md * [model_cards] Use code fences for consistency Co-authored-by: Julien Chaumond <chaumond@gmail.com>	2020-02-10 17:27:59 -05:00
Malte Pietsch	68ccc04ee6	Add model readme for deepset/roberta-base-squad2 (#2797 ) * Add readme for deepset/roberta-base-squad2 * update model readme	2020-02-10 15:21:48 -05:00
Lysandre	539f601be7	intermediate_size > hidden_dim in distilbert config docstrings	2020-02-10 13:45:57 -05:00
Lysandre	cfb7d108bd	FlauBERT lang embeddings only when n_langs > 1	2020-02-10 13:24:04 -05:00
Julien Chaumond	b4691a438d	[model_cards] BERT-of-Theseus: use the visual as thumbnail cc @jetrunner Co-Authored-By: Kevin Canwen Xu <canwenxu@outlook.com>	2020-02-10 11:27:08 -05:00
Julien Chaumond	fc325e97cd	[model_cards] Showcase model tag syntax	2020-02-10 11:27:08 -05:00
Lysandre	fd639e5be3	Correct quickstart example when using the past	2020-02-10 11:25:56 -05:00
Julien Chaumond	63a5399bc4	[model_cards] Specify language meta + thumbnail cc @tholor see #2799	2020-02-10 11:20:05 -05:00
Lysandre	125a75a121	Correctly compute tokens when padding on the left	2020-02-10 10:47:42 -05:00
Malte Pietsch	9c64d1da35	Add model readme for bert-base-german-cased (#2799 ) * add readme for bert-base-german-cased * update readme	2020-02-10 10:27:29 -05:00
Kevin Canwen Xu	bf99014c46	Create BERT-of-Theseus model card	2020-02-10 09:58:40 -05:00
Thomas Wolf	92e974196f	Merge pull request #2765 from huggingface/extract-cached-archives Add option to `cached_path` to automatically extract archives	2020-02-10 14:05:16 +01:00
Morgan Funtowicz	6aa7973aec	Fix circleci cuInit error on Tensorflow >= 2.1.0. Tensorflow 2.1.0 introduce a new dependency model where pip install tensorflow would install tf with GPU support. Before it would just install with CPU support, thus CircleCI is looking for NVidia driver version at initialization of the tensorflow related tests but fails as their is no NVidia Driver running. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-02-10 13:24:37 +01:00
Lysandre	520e7f2119	Correct docstring for xlnet	2020-02-07 16:42:35 -05:00
Lysandre	dd28830327	Update RoBERTa tips	2020-02-07 16:42:35 -05:00
Lysandre	db97930122	Update XLM-R tips	2020-02-07 16:42:35 -05:00
Lysandre	7046de2991	E231	2020-02-07 15:28:13 -05:00
VictorSanh	0d3aa3c04c	styling	2020-02-07 15:28:13 -05:00
VictorSanh	d8b43600fd	omission	2020-02-07 15:28:13 -05:00
VictorSanh	ee5a6856ca	distilbert-base-cased weights + Readmes + omissions	2020-02-07 15:28:13 -05:00
monologg	73368963b2	Fix importing unofficial TF models with extra optimizer weights	2020-02-07 10:25:31 -05:00
Ari	d7dabfeff5	Fix documentation in ProjectedAdaptiveLogSoftmax	2020-02-07 10:14:58 -05:00
Julien Chaumond	42f08e596f	[examples] rename run_lm_finetuning to run_language_modeling	2020-02-07 09:15:28 -05:00
Julien Chaumond	4f7bdb0958	[examples] Fix broken markdown	2020-02-07 09:15:28 -05:00
thomwolf	c6c5c3fd4e	style and quality	2020-02-07 08:58:06 +01:00
thomwolf	961c69776f	@julien-c proposal for TF/PT compat in hf_buckets	2020-02-07 08:53:17 +01:00
thomwolf	d311f87bca	cleanup	2020-02-07 00:05:28 +01:00
thomwolf	7d99e05f76	file_cache has options to extract archives	2020-02-07 00:03:12 +01:00
dchurchwell	2c12464a20	Changed vocabulary save function. Variable name was inconsistent, causing an error to be thrown when passing a file name instead of a directory.	2020-02-06 16:40:07 -05:00
Peter Izsak	6fc3d34abd	Fix multi-gpu evaluation in run_glue.py	2020-02-06 16:38:55 -05:00
Julien Chaumond	7748cbbe7d	Oopsie	2020-02-06 15:30:02 -05:00
Julien Chaumond	432c12521e	[docs] Add menu w/ links to other pages on hf.co	2020-02-06 15:30:02 -05:00
Clement	c069932f5d	Add contributors snapshot powered by https://github.com/sourcerer-io/hall-of-fame	2020-02-06 15:25:47 -05:00
Lysandre Debut	33d3072e1c	Arxiv README (#2747 ) * Arxiv README * ArXiv-NLP readme	2020-02-05 15:26:28 -05:00
Julien Chaumond	eae8ee0389	[doc] model sharing: mention README.md + tweaks cc @lysandrejik @thomwolf	2020-02-05 14:20:03 -05:00
James Betker	6bb6a01765	Fix GPT2 config set to trainable This prevents the model from being saved, and who knows what else.	2020-02-05 13:55:41 -05:00
Julien Chaumond	ada24def22	[run_lm_finetuning] Tweak fix for non-long tensor, close #2728 see `1ebfeb7946` and #2728 Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2020-02-05 12:49:18 -05:00
Lysandre	2184f87003	RoBERTa TensorFlow Tests	2020-02-04 18:05:35 -05:00
Lysandre	e615269cb8	Correct slow test	2020-02-04 18:05:35 -05:00
Lysandre	5f96ebc0be	Style	2020-02-04 18:05:35 -05:00
Lysandre	950c6a4f09	Flaubert PyTorch tests	2020-02-04 18:05:35 -05:00
Lysandre	d28b81dc29	RoBERTa Pytorch tests	2020-02-04 18:05:35 -05:00
Yuval Pinter	d1ab1fab1b	pass langs parameter to certain XLM models (#2734 ) * pass langs parameter to certain XLM models Adding an argument that specifies the language the SQuAD dataset is in so language-sensitive XLMs (e.g. `xlm-mlm-tlm-xnli15-1024`) don't default to language `0`. Allows resolution of issue #1799 . * fixing from `make style` * fixing style (again)	2020-02-04 17:12:42 -05:00
sshleifer	9e5b549b4d	fix default getattr	2020-02-04 16:38:52 -05:00
sshleifer	25848a6094	double quotes	2020-02-04 16:38:52 -05:00
sshleifer	cbcb83f21d	minor cleanup of test_attention_outputs	2020-02-04 16:38:52 -05:00
Lysandre	3bf5417258	Revert erroneous fix	2020-02-04 16:31:07 -05:00
Lysandre	1ebfeb7946	Cast to long when masking tokens	2020-02-04 15:56:16 -05:00
Lysandre	9c67196b83	Update quickstart	2020-02-04 11:11:37 -05:00
Lysandre	90ab15cb7a	Remove redundant hidden states	2020-02-04 10:59:32 -05:00

... 236 237 238 239 240 ...

15053 Commits