Lysandre Debut
d5d7d88612
ELECTRA ( #3257 )
...
* Electra wip
* helpers
* Electra wip
* Electra v1
* ELECTRA may be saved/loaded
* Generator & Discriminator
* Embedding size instead of halving the hidden size
* ELECTRA Tokenizer
* Revert BERT helpers
* ELECTRA Conversion script
* Archive maps
* PyTorch tests
* Start fixing tests
* Tests pass
* Same configuration for both models
* Compatible with base + large
* Simplification + weight tying
* Archives
* Auto + Renaming to standard names
* ELECTRA is uncased
* Tests
* Slight API changes
* Update tests
* wip
* ElectraForTokenClassification
* temp
* Simpler arch + tests
Removed ElectraForPreTraining which will be in a script
* Conversion script
* Auto model
* Update links to S3
* Split ElectraForPreTraining and ElectraForTokenClassification
* Actually test PreTraining model
* Remove num_labels from configuration
* wip
* wip
* From discriminator and generator to electra
* Slight API changes
* Better naming
* TensorFlow ELECTRA tests
* Accurate conversion script
* Added to conversion script
* Fast ELECTRA tokenizer
* Style
* Add ELECTRA to README
* Modeling Pytorch Doc + Real style
* TF Docs
* Docs
* Correct links
* Correct model intialized
* random fixes
* style
* Addressing Patrick's and Sam's comments
* Correct links in docs
2020-04-03 14:10:54 -04:00
Patrick von Platen
83d1fbcff6
[Docs] Add usage examples for translation and summarization ( #3538 )
2020-03-31 09:36:03 -04:00
Patrick von Platen
42e1e3c67f
Update usage doc regarding generate fn ( #3504 )
2020-03-31 09:31:46 -04:00
LysandreJik
6f5a12a583
Release: v2.7.0
2020-03-30 08:49:24 -04:00
Patrick von Platen
5b44e0a31b
[T5] Add training documenation ( #3507 )
...
* Add clear description of how to train T5
* correct docstring in T5
* correct typo
* correct docstring format
* update t5 model docs
* implement collins feedback
* fix typo and add more explanation for sentinal tokens
* delete unnecessary todos
2020-03-30 13:35:53 +02:00
Patrick von Platen
fa9af2468a
Add T5 to docs ( #3461 )
...
* add t5 docs basis
* improve docs
* add t5 docs
* improve t5 docstring
* add t5 tokenizer docstring
* finish docstring
* make style
* add pretrained models
* correct typo
* make examples work
* finalize docs
2020-03-27 10:57:16 -04:00
LysandreJik
471cce24b3
Release: v2.6.0
2020-03-24 10:37:32 -04:00
Sam Shleifer
38a555a83c
Add Summarization to Pipelines ( #3128 )
...
* passing
* Undo stupid chg
* docs
* undo rename
* delete-cruft
* only import if you have torch
* Dont rely on dict ordering
* Fix dict ordering upstream
* docstring link
* docstring link
* remove trailing comma for 3.5 compat
* new name
* delegate kwarging
* Update kwargs
2020-03-17 18:04:21 -04:00
Thomas Wolf
2187c49f5c
CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) ( #3186 )
...
* memory benchmark rss
* have both forward pass and line-by-line mem tracing
* cleaned up tracing
* refactored and cleaning up API
* no f-strings yet...
* add GPU mem logging
* fix GPU memory monitoring
* style and quality
* clean up and doc
* update with comments
* Switching to python 3.6+
* fix quality
2020-03-17 10:17:11 -04:00
Julien Chaumond
d6de6423ba
[doc] --organization tweak
...
Co-Authored-By: Thomas Wolf <thomwolf@users.noreply.github.com>
2020-03-10 16:52:44 -04:00
Julien Chaumond
0e56dc3078
[doc] Document the new --organization flag of CLI
2020-03-10 16:42:01 -04:00
Sam Shleifer
857e0a0d3b
Rename BartForMaskedLM -> BartForConditionalGeneration ( #3114 )
...
* improved documentation
2020-03-05 17:41:18 -05:00
Lysandre
07a79db505
Fix failing doc samples
2020-03-04 19:11:31 -05:00
Lysandre Debut
d3eb7d23a4
Pipeline doc ( #3055 )
...
* Pipeline doc initial commit
* pipeline abstraction
* Remove modelcard argument from pipeline
* Task-specific pipelines can be instantiated with no model or tokenizer
* All pipelines doc
2020-03-02 14:07:10 -05:00
Sam Shleifer
b54ef78d0c
Bart-CNN ( #3059 )
...
`generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.
2020-03-02 10:35:53 -05:00
Sam Shleifer
9df74b8bc4
Delete all mentions of Model2Model ( #3019 )
2020-02-26 11:36:27 -05:00
Lysandre Debut
bb7c468520
Documentation ( #2989 )
...
* All Tokenizers
BertTokenizer + few fixes
RobertaTokenizer
OpenAIGPTTokenizer + Fixes
GPT2Tokenizer + fixes
TransfoXLTokenizer
Correct rst for TransformerXL
XLMTokenizer + fixes
XLNet Tokenizer + Style
DistilBERT + Fix XLNet RST
CTRLTokenizer
CamemBERT Tokenizer
FlaubertTokenizer
XLMRobertaTokenizer
cleanup
* cleanup
2020-02-25 18:43:36 -05:00
Lysandre Debut
65e7c90a77
Adding usage examples for common tasks ( #2850 )
...
* Usage: Sequence Classification & Question Answering
* Pipeline example
* Language modeling
* TensorFlow code for Sequence classification
* Custom TF/PT toggler in docs
* QA + LM for TensorFlow
* Finish Usage for both PyTorch and TensorFlow
* Addressing Julien's comments
* More assertive
* cleanup
* Favicon
- added favicon option in conf.py along with the favicon image
- udpated 🤗 logo. slightly smaller and should appear more consistent across editing programs (no more tongue on the outside of the mouth)
Co-authored-by: joshchagani <joshua@joshuachagani.com>
2020-02-25 13:48:24 -05:00
Lysandre
f9ec5ca90b
Release: v2.5.1
2020-02-24 18:22:54 -05:00
Sam Shleifer
53ce3854a1
New BartModel ( #2745 )
...
* Results same as fairseq
* Wrote a ton of tests
* Struggled with api signatures
* added some docs
2020-02-20 18:11:13 -05:00
Lysandre
fb560dcb07
Release: v2.5.0
...
Welcome Rust Tokenizers
2020-02-19 11:46:19 -05:00
Lysandre
fd639e5be3
Correct quickstart example when using the past
2020-02-10 11:25:56 -05:00
Lysandre
dd28830327
Update RoBERTa tips
2020-02-07 16:42:35 -05:00
Lysandre
db97930122
Update XLM-R tips
2020-02-07 16:42:35 -05:00
VictorSanh
ee5a6856ca
distilbert-base-cased weights + Readmes + omissions
2020-02-07 15:28:13 -05:00
Julien Chaumond
42f08e596f
[examples] rename run_lm_finetuning to run_language_modeling
2020-02-07 09:15:28 -05:00
Julien Chaumond
7748cbbe7d
Oopsie
2020-02-06 15:30:02 -05:00
Julien Chaumond
432c12521e
[docs] Add menu w/ links to other pages on hf.co
2020-02-06 15:30:02 -05:00
Julien Chaumond
eae8ee0389
[doc] model sharing: mention README.md + tweaks
...
cc @lysandrejik @thomwolf
2020-02-05 14:20:03 -05:00
Lysandre
9c67196b83
Update quickstart
2020-02-04 11:11:37 -05:00
Lysandre
d426b58b9e
Patch: v2.4.1
2020-01-31 14:55:33 -05:00
Lysandre
6664ea943d
Release: v2.4.0
2020-01-31 09:40:32 -05:00
Hang Le
b43cb09aaa
Add layerdrop
2020-01-30 12:05:01 -05:00
Lysandre
93dccf527b
Pretrained models
2020-01-30 10:04:18 -05:00
Lysandre
73306d028b
FlauBERT documentation
2020-01-30 10:04:18 -05:00
Lysandre
c69b082601
Update documentation
2020-01-29 12:06:13 -05:00
Lysandre
44a5b4bbe7
Update documentation
2020-01-29 11:47:49 -05:00
Wietse de Vries
f5a236c3ca
Add Dutch pre-trained BERT model
2020-01-27 21:00:34 -05:00
thomwolf
e0849a66ac
adding in the doc
2020-01-27 14:27:07 -05:00
Lysandre
983fef469c
AutoModels doc
2020-01-24 16:37:30 -05:00
Lysandre
24d5ad1dcc
Run the examples in slow
2020-01-23 09:38:45 -05:00
Lysandre
9ddf60b694
Tips + whitespaces
2020-01-23 09:38:45 -05:00
Lysandre
0e9899f451
Fixes
2020-01-23 09:38:45 -05:00
Lysandre
7511f3dd89
PyTorch CTRL + Style
2020-01-23 09:38:45 -05:00
Lysandre
980211a63a
XLM-RoBERTa
2020-01-23 09:38:45 -05:00
Lysandre
db1a7f27a1
PyTorch DistilBERT
2020-01-23 09:38:45 -05:00
Lysandre
b28020f590
TF RoBERTa
2020-01-23 09:38:45 -05:00
Lysandre
3e1bc27e1b
Pytorch RoBERTa
2020-01-23 09:38:45 -05:00
Lysandre
f44ff574d3
Camembert
2020-01-23 09:38:45 -05:00
Lysandre
ccebcae75f
PyTorch XLM
2020-01-23 09:38:45 -05:00
Lysandre
cd656fb21a
PyTorch XLNet
2020-01-23 09:38:45 -05:00
Lysandre
98edad418e
PyTorch Transformer-XL
2020-01-23 09:38:45 -05:00
Lysandre
850795c487
Pytorch GPT
2020-01-23 09:38:45 -05:00
Lysandre
1487b840d3
TF GPT2
2020-01-23 09:38:45 -05:00
Lysandre
bd0d3fd76e
GPT-2 PyTorch models + better tips for BERT
2020-01-23 09:38:45 -05:00
Lysandre
cd77c750c5
BERT PyTorch models
2020-01-23 09:38:45 -05:00
Lysandre
3922a2497e
TF ALBERT + TF Utilities + Fix warnings
2020-01-23 09:38:45 -05:00
Lysandre
00df3d4de0
ALBERT Modeling + required changes to utilities
2020-01-23 09:38:45 -05:00
Lysandre
632675ea88
Can test examples spread over multiple blocks
2020-01-23 09:38:45 -05:00
Lysandre
9bab9b83d2
Glossary
2020-01-23 09:38:45 -05:00
Julien Chaumond
119dc50e2a
Doc tweak on model sharing
2020-01-22 22:40:38 -05:00
Lysandre
387217bd3e
Added example usage
2020-01-14 14:09:09 +01:00
Lysandre
7d1bb7f256
Add missing XLNet and XLM models
2020-01-14 14:09:09 +01:00
Lysandre Debut
632682726f
Updated Configurations
2020-01-14 14:09:09 +01:00
alberduris
81d6841b4b
GPU text generation: mMoved the encoded_prompt to correct device
2020-01-06 15:11:12 +01:00
alberduris
dd4df80f0b
Moved the encoded_prompts to correct device
2020-01-06 15:11:12 +01:00
Morgan Funtowicz
80faf22b4a
Updating documentation for converting tensorflow model to reflect the new cli convert format.
...
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>
2020-01-04 13:41:18 +01:00
Julien Chaumond
9b2badf3c9
[cli] Update doc
2019-12-27 22:54:29 -05:00
Aymeric Augustin
a8d34e534e
Remove [--editable] in install instructions.
...
Use -e only in docs targeted at contributors.
If a user copy-pastes command line with [--editable], they will hit
an error. If they don't know the --editable option, we're giving them
a choice to make before they can move forwards, but this isn't a choice
they need to make right now.
2019-12-24 08:46:08 +01:00
Aymeric Augustin
70373a5f7c
Update contribution instructions.
...
Also provide shortcuts in a Makefile.
2019-12-23 21:05:30 +01:00
Aymeric Augustin
d8e33dbd67
Fix path to source code in docs config.
...
This should fix API docs, which went AWOL with yesterday's changes.
2019-12-23 16:49:35 +01:00
Aymeric Augustin
45841eaf7b
Remove references to Python 2 in documentation.
2019-12-22 18:38:56 +01:00
Aymeric Augustin
ced0a94204
Switch test files to the standard test_*.py scheme.
2019-12-22 14:15:13 +01:00
Aymeric Augustin
067395d5c5
Move tests outside of library.
2019-12-22 13:47:17 +01:00
Julien Chaumond
ac1b449cc9
[doc] move distilroberta to more appropriate place
...
cc @lysandrejik
2019-12-21 00:09:01 -05:00
Lysandre
a436574bfd
Release: v2.3.0
2019-12-20 16:22:20 -05:00
Rémi Louf
4e3f745ba4
add example for Model2Model in quickstart
2019-12-20 09:12:31 -05:00
Stefan Schweter
f09d999641
docs: fix numbering 😅
2019-12-18 19:49:33 +01:00
Stefan Schweter
dd7a958fd6
docs: add XLM-RoBERTa to pretrained model list (incl. all parameters)
2019-12-18 19:45:46 +01:00
Stefan Schweter
d35405b7a3
docs: add XLM-RoBERTa to index page
2019-12-18 19:45:10 +01:00
Antti Virtanen
abc43ffbff
Add pretrained model documentation for FinBERT.
2019-12-17 20:35:25 -05:00
Julien Chaumond
3f5ccb183e
[doc] Clarify uploads
...
cf 855ff0e91d (commitcomment-36452545)
2019-12-16 18:20:29 -05:00
Julien Chaumond
855ff0e91d
[doc] Model upload and sharing
...
ping @lysandrejik @thomwolf
Is this clear enough? Anything we should add?
2019-12-16 12:42:22 -05:00
Thomas Wolf
e92bcb7eb6
Merge pull request #1739 from huggingface/t5
...
[WIP] Adding Google T5 model
2019-12-14 09:40:43 +01:00
Lysandre
7bd11dda6f
Release: v2.2.2
2019-12-13 16:45:30 -05:00
thomwolf
5c00e344c1
update model doc - swith 3B/11B to 3b/11b
2019-12-13 16:33:29 +01:00
Thomas Wolf
110394b2ba
Merge branch 'master' into t5
2019-12-13 16:03:32 +01:00
Julien Chaumond
1748fdf657
[doc] Fix rst table
2019-12-11 18:32:27 -05:00
Masatoshi Suzuki
c03c0dfd23
Add support for Japanese BERT models by cl-tohoku
2019-12-11 18:32:27 -05:00
Stefan Schweter
030faccb8d
doc: fix pretrained models table
2019-12-11 12:19:21 -05:00
thomwolf
0558c9cb9b
Merge branch 'master' into t5
2019-12-10 12:58:48 +01:00
Thomas Wolf
e57d00ee10
Merge pull request #1984 from huggingface/squad-refactor
...
[WIP] Squad refactor
2019-12-10 11:07:26 +01:00
Pierric Cistac
5c877fe94a
fix albert links
2019-12-09 18:53:00 -05:00
Lysandre Debut
00c4e39581
Merge branch 'master' into squad-refactor
2019-12-09 10:41:15 -05:00
Aymeric Augustin
35401fe50f
Remove dependency on pytest for running tests ( #2055 )
...
* Switch to plain unittest for skipping slow tests.
Add a RUN_SLOW environment variable for running them.
* Switch to plain unittest for PyTorch dependency.
* Switch to plain unittest for TensorFlow dependency.
* Avoid leaking open files in the test suite.
This prevents spurious warnings when running tests.
* Fix unicode warning on Python 2 when running tests.
The warning was:
UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal
* Support running PyTorch tests on a GPU.
Reverts 27e015bd
.
* Tests no longer require pytest.
* Make tests pass on cuda
2019-12-06 13:57:38 -05:00
Thomas Wolf
5482822a2b
Merge pull request #2046 from jplu/tf2-ner-example
...
Add NER TF2 example.
2019-12-06 12:12:22 +01:00
LysandreJik
9ecd83dace
Patch evaluation for impossible values + cleanup
2019-12-05 14:44:57 -05:00
VictorSanh
552c44a9b1
release distilm-bert
2019-12-05 10:14:58 -05:00
Julien Plu
9200a759d7
Add few tests on the TF optimization file with some info in the documentation. Complete the README.
2019-12-05 12:56:43 +01:00
Thomas Wolf
1f179f095f
Merge pull request #2011 from AdityaSoni19031997/patch-1
...
typo fix on the docs as per Pytorch v1.1+
2019-12-05 12:39:04 +01:00