Manuel Romero
e99af3b17b
Create model card for bert-small-finetuned-squadv2
2020-03-19 15:07:55 -04:00
Manuel Romero
39db055268
Merge pull request #3348 from mrm8488/patch-28
...
Create card for BERT-Mini finetuned on SQuAD v2
2020-03-19 15:07:39 -04:00
Manuel Romero
dedc7a8fdb
Create card for BERT-Tiny fine-tuned on SQuAD v2
...
- Only 17MB of Model weights!!
2020-03-19 15:07:22 -04:00
Manuel Romero
676adf8625
Created card for spanbert-finetuned-squadv1
2020-03-19 15:06:35 -04:00
Antti Virtanen
11d8bcc9d7
Add model cards for FinBERT. ( #3331 )
...
* Add a model card for FinBERT
This is a copy of https://github.com/TurkuNLP/FinBERT/blob/master/README.md .
* Added a file for uncased.
* Add metadata for cased.
* Added metadata for uncased.
2020-03-19 15:06:01 -04:00
Lysandre Debut
f049be7ad4
Export ALBERT main layer in TensorFlow ( #3354 )
2020-03-19 13:53:05 -04:00
Kyeongpil Kang
3bedfd3347
Fix wrong link for the notebook file ( #3344 )
...
For the tutorial of "How to generate text", the URL link was wrong (it was linked to the tutorial of "How to train a language model").
I fixed the URL.
2020-03-19 17:22:47 +01:00
Serkan Karakulak
b2c2c31c60
Minor Bug Fix for Running Roberta on Glue ( #3240 )
...
* added return_token_type_ids argument for tokenizers which do not generate return_type_ids by default
* fixed styling
* Style
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>
2020-03-19 12:08:31 -04:00
Sam Shleifer
4e4403c9b4
[BART] torch 1.0 compatibility ( #3322 )
...
* config.activation_function
2020-03-19 11:56:54 -04:00
mataney
c44a17db1b
[FIX] not training when epoch is small ( #3006 )
...
* solving bug where for small epochs and large gradient_accumulation_steps we never train
* black formatting
* no need to change these files
2020-03-19 11:21:21 -04:00
Sam Shleifer
ad7233fc01
[BART] cleanup: remove redundant kwargs, improve docstrings ( #3319 )
2020-03-19 11:16:51 -04:00
Mohamed El-Geish
cd21d8bc00
Typo in warning message ( #3219 )
...
`T5Tokenizer` instead of `XLNetTokenizer`
2020-03-19 09:49:25 -04:00
Matthew Goldey
8d3e218ea6
fix typo in docstring demonstrating usage ( #3213 )
2020-03-19 09:47:54 -04:00
Patrick von Platen
cec3cdda15
Fix input ids can be none attn mask ( #3345 )
...
* fix issue 3289
* fix attention mask if input_ids None behavior
2020-03-19 09:55:17 +01:00
Junyi_Li
f6d813aaaa
Create README.md
2020-03-18 23:45:02 -04:00
Junyi_Li
939328111b
Create README.md
...
roberta_chinese_base card
2020-03-18 23:44:12 -04:00
Junyi_Li
29442d2edf
Create README.md
...
albert_chinese_tiny card
2020-03-18 23:43:49 -04:00
Kyle Lo
20139b7c8d
Added model cards for SciBERT models uploaded under AllenAI org ( #3330 )
...
* Create README.md
* model card
* add model card for cased
2020-03-18 15:45:11 -04:00
Morgan Funtowicz
cae334c43c
Improve fill-mask pipeline example in 03-pipelines notebook.
...
Remove hardcoded mask_token and use the value provided by the tokenizer.
2020-03-18 17:11:42 +01:00
Branden Chan
4b1970bb4c
Create README.md
2020-03-18 11:37:17 -04:00
Lysandre Debut
d6afbd323d
XLM-R Tokenizer now passes common tests + Integration tests ( #3198 )
...
* XLM-R now passes common tests + Integration tests
* Correct mask index
* Model input names
* Style
* Remove text preprocessing
* Unneccessary import
2020-03-18 09:52:49 -04:00
Patrick von Platen
292186a3e7
Adding LM Head to Transfo-XL and first step to fixing problem with Adaptive Embeddings in TransfoXL ( #3286 )
...
* first commit
* work in progress
* make language generation task pass
* update to working version for LM
* delete print
* remove dead code
* make style
2020-03-18 09:24:27 -04:00
Patrick von Platen
efdb46b6e2
add link to blog post ( #3326 )
2020-03-18 13:24:28 +01:00
Patrick von Platen
ddb10c6447
improve doctstring ( #3327 )
2020-03-18 13:24:09 +01:00
Junyi_Li
d7f98cd3ef
Init card for model
2020-03-18 07:55:27 -04:00
Sam Shleifer
38a555a83c
Add Summarization to Pipelines ( #3128 )
...
* passing
* Undo stupid chg
* docs
* undo rename
* delete-cruft
* only import if you have torch
* Dont rely on dict ordering
* Fix dict ordering upstream
* docstring link
* docstring link
* remove trailing comma for 3.5 compat
* new name
* delegate kwarging
* Update kwargs
2020-03-17 18:04:21 -04:00
J.P Lee
2b60a26b46
Update examples/ner/run_ner.py to use AutoModel ( #3305 )
...
* Update examples/ner/run_ner.py to use AutoModel
* Fix missing code and apply `make style` command
2020-03-17 12:30:10 -04:00
Manuel Romero
e41212c715
Create model card for CodeBERTaPy ( #3309 )
2020-03-17 12:29:11 -04:00
Julien Chaumond
0f1bc0d68e
[model_cards] Add google thumbnail
2020-03-17 12:02:51 -04:00
Nathan Raw
930c9412b4
[WIP] Lightning glue example ( #3290 )
...
* ✨ Alter base pl transformer to use automodels
* 🐛 Add batch size env variable to function call
* 💄 Apply black code style from Makefile
* 🚚 Move lightning base out of ner directory
* ✨ Add lightning glue example
* 💄 self
* move _feature_file to base class
* ✨ Move eval logging to custom callback
* 💄 Apply black code style
* 🐛 Add parent to pythonpath, remove copy command
* 🐛 Add missing max_length kwarg
2020-03-17 11:46:42 -04:00
Patrick von Platen
e8f44af5bf
[generate] do_sample default back to False ( #3298 )
...
* change do_samples back
* None better default as boolean
* adapt do_sample to True in test example
* make style
2020-03-17 10:52:37 -04:00
Thomas Wolf
2187c49f5c
CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) ( #3186 )
...
* memory benchmark rss
* have both forward pass and line-by-line mem tracing
* cleaned up tracing
* refactored and cleaning up API
* no f-strings yet...
* add GPU mem logging
* fix GPU memory monitoring
* style and quality
* clean up and doc
* update with comments
* Switching to python 3.6+
* fix quality
2020-03-17 10:17:11 -04:00
Jannes
bd3feddf67
Create README.md ( #3306 )
...
* Create README.md
* Updated README.md
2020-03-17 09:05:11 -04:00
Julien Chaumond
68ef0a111f
[model_cards] Symlink all Google AI's BERT Miniatures to source model card
2020-03-16 23:37:42 -04:00
Sam Shleifer
b2c1a447fe
[BART] Delete redundant unit test ( #3302 )
2020-03-16 23:09:10 -04:00
iuliaturc-google
b2028cc26b
Add model card for Google AI's BERT Miniatures ( #3301 )
...
This model card is intended to be shared among all models under google/bert_uncased_*
(We'll need some support from HuggingFace to get this card cross-linked from all models)
2020-03-16 21:51:46 -04:00
Patrick von Platen
4759176313
add camembert for Question answering for examples
2020-03-16 14:42:11 -04:00
Sam Shleifer
11573231c6
[BART] generation_mode as a kwarg not a class attribute ( #3278 )
2020-03-16 12:47:53 -04:00
Manuel Romero
de697935a2
Create model card for spanbert-finetuned-squadv2
2020-03-16 12:32:46 -04:00
Manuel Romero
3ddd2029bc
Create CodeBERTaJS model card
2020-03-16 12:23:01 -04:00
Julien Plu
879e1d3234
Add TF2 version of FlauBERT ( #2700 )
...
* Add TF2 version of FlauBERT
* Add TF2 version of FlauBERT
* Add documentation
* Apply style and quality
* Apply style once again
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-03-16 09:29:21 -04:00
Patrick von Platen
af471ce5e8
Improved Error message when loading config/model with .from_pretrained() ( #3247 )
...
* better error message
* better error message
* update to model identifier instead of url
* update to model identifier instead of ur
2020-03-16 09:48:30 +01:00
Sam Shleifer
5ea8ba67b4
[BART] Remove unused kwargs ( #3279 )
...
* Remove unused kwargs
* dont call forward in tests
2020-03-15 23:00:44 -04:00
Thomas Wolf
3814e167d9
Merge pull request #3225 from patrickvonplaten/finalize_merge_bart_generate_into_default_generate
...
Complete merge Seq-2-Seq generation into default generation
2020-03-14 15:08:59 +01:00
Sam Shleifer
2bd79e23de
[BART] FP16 testing fixes ( #3266 )
2020-03-13 19:48:26 -04:00
Julien Chaumond
8320feec09
[model_cards] CodeBERTa
2020-03-13 18:28:09 -04:00
Patrick von Platen
ab756f713c
add gpt2-xl for tf
2020-03-13 16:40:35 -04:00
Patrick von Platen
4f75d380a4
make style
2020-03-13 16:35:52 +01:00
Patrick von Platen
c2ee3840ae
update file to new starting token logic
2020-03-13 16:34:44 +01:00
Benjamin Muller
cc4c37952a
Create camembert-base-README.md
2020-03-13 09:35:53 -04:00