Stas Bekman
|
fbe278c76c
|
[doc] bf16/tf32 guide (#14579)
* [doc] bf16/tf32 guide
* expand
* expand
* Update docs/source/performance.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2021-12-01 14:18:58 -08:00 |
|
NielsRogge
|
0490b98877
|
[ImageGPT] Small fixes (#14460)
* Add integration test
* Fix typo
|
2021-11-19 15:15:02 +01:00 |
|
NielsRogge
|
da36c557f7
|
Add ImageGPT (#14240)
* First draft
* More improvements
* Improve conversion script
* Fix init weights for layer norm
* Fix correct model for conversion script
* Don't tie input and output embeddings
* Add print statements for debugging
* Add print statements for debugging
* Fix vocab size of model
* Improve documentation, remove fast tokenizer
* Add ImageGPTForImageClassification, improve docs
* Fix docs issue
* Set verbosity level back to info
* Improve tests
* Fix tests and add figure
* Delete tokenizer file
* Remove ImageGPTTokenizer from init files
* Remove ImageGPTLayer from init files
* Remove ImageGPT tokenizer from docs
* First draft of ImageGPTFeatureExtractor
* Fix typo
* Fix bug
* More improvements
* Apply suggestions from code review, add tests for feature extractor
* Fix layernorm
* Update save_pretrained method
* Fix issue
* Make all tests of ImageGPTFeatureExtractor pass
* Update code examples
* Rename model inputs to pixel_values
* Improve code examples
* Update init_weights to post_init
* Fix post_init
|
2021-11-18 16:24:34 +01:00 |
|
Kevin Ko
|
95bab53868
|
Update TP parallel GEMM image (#14112)
* Update TP parallel GEMM image
* Delete parallelism-tp-parallel_gemm.png
* Update parallelism-tp-parallel_gemm.png
|
2021-10-22 12:57:48 -07:00 |
|
Stas Bekman
|
0dcc3c86e4
|
[doc] DP/PP/TP/etc parallelism (#12524)
* wip
* complete the doc
* missing img
* improve
* correction
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2021-07-09 17:39:09 -07:00 |
|
Sylvain Gugger
|
a8694b8850
|
Adjust banner width
|
2021-06-15 09:37:15 -04:00 |
|
Sylvain Gugger
|
60b1d6b45b
|
Add course banner (#12157)
* Add course banner
* Update course banner
|
2021-06-15 09:25:49 -04:00 |
|
Patrick von Platen
|
89be094e29
|
[Templates] Add template "call-for-model" markdown and "call-for-big-bird" markdown (#9921)
* add big bird
* change teacher to mentor
* add proposal template
* adapt template
* delete old template
* correct some links
* finish template
* create big bird from template
* add big bird
* improve boxes
* finish boxes
* add pointers for BigBird
* finish big bird
* up
* up
* up
* up
* apply lysandres and sylvains suggestions
* delete bogus file
* correct markdown
* try different style
* try different style
* finalize
|
2021-02-05 15:47:54 +03:00 |
|
Patrick von Platen
|
0e3be1ac8f
|
Add new model docs (#9667)
* add new model logic
* fix docs
* change structure
* improve add_new_model
* push new changes
* up
* up
* correct spelling
* improve docstring
* correct line length
* update readme
* correct links
* correct typos
* only add rst file for now
* Apply suggestions from code review 1
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Bram Vanroy <Bram.Vanroy@UGent.be>
* Apply suggestions from code review
Co-authored-by: Bram Vanroy <Bram.Vanroy@UGent.be>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stefan Schweter <stefan@schweter.it>
Co-authored-by: Bram Vanroy <Bram.Vanroy@UGent.be>
* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Pierric Cistac <Pierrci@users.noreply.github.com>
* finish adding all suggestions
* make style
* apply Niels feedback
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* apply sylvains suggestions
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Bram Vanroy <Bram.Vanroy@UGent.be>
Co-authored-by: Stefan Schweter <stefan@schweter.it>
Co-authored-by: Pierric Cistac <Pierrci@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2021-02-01 17:55:10 +03:00 |
|
Joe Davison
|
b4b33fdf25
|
Guide to fixed-length model perplexity evaluation (#5449)
* add first draft ppl guide
* upload imgs
* expand on strides
* ref typo
* rm superfluous past var
* add tokenization disclaimer
|
2020-07-07 16:04:15 -06:00 |
|
Sylvain Gugger
|
fa661ce749
|
Add model summary (#4789)
* Add model summary
* Add link to pretrained models
|
2020-06-05 12:22:50 -04:00 |
|
alberduris
|
81d6841b4b
|
GPU text generation: mMoved the encoded_prompt to correct device
|
2020-01-06 15:11:12 +01:00 |
|
alberduris
|
dd4df80f0b
|
Moved the encoded_prompts to correct device
|
2020-01-06 15:11:12 +01:00 |
|
thomwolf
|
f47f7f4611
|
add logo
|
2019-09-26 11:28:44 +02:00 |
|
LysandreJik
|
ab30651802
|
Hugging Face theme.
|
2019-07-08 16:05:26 -04:00 |
|