transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 21:00:08 +06:00

Author	SHA1	Message	Date
NielsRogge	30677dc743	Add Vision Transformer and ViTFeatureExtractor (#10950 ) * Squash all commits into one * Update ViTFeatureExtractor to use image_utils instead of torchvision * Remove torchvision and add Pillow * Small docs improvement * Address most comments by @sgugger * Fix tests * Clean up conversion script * Pooler first draft * Fix quality * Improve conversion script * Make style and quality * Make fix-copies * Minor docs improvements * Should use fix-copies instead of manual handling * Revert "Should use fix-copies instead of manual handling" This reverts commit `fd4e591bce`. * Place ViT in alphabetical order Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-04-01 11:16:05 -04:00
Sylvain Gugger	d0b3797a3b	Add more metadata to the user agent (#10972 ) * Add more metadata to the user agent * Fix typo * Use DISABLE_TELEMETRY * Address review comments * Use global env * Add clean envs on circle CI	2021-03-31 09:36:07 -04:00
Lysandre	3f48b2bc3e	Update stable docs	2021-03-23 11:01:16 -04:00
Sylvain Gugger	21e86f99e6	Sort init import (#10801 ) * Initial script * Add script to properly sort imports in init. * Add to the CI * Update utils/custom_init_isort.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Separate scripts that change content from quality * Move class_mapping_update to style_checks Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-19 16:17:13 -04:00
Sylvain Gugger	dcebe254fa	Document v4.4.2	2021-03-18 15:19:25 -04:00
Lysandre	73fe40898d	Docs for v4.4.1	2021-03-16 15:41:49 -04:00
Lysandre	1b5ce1e63b	Development on v4.5.0dev0	2021-03-16 11:41:15 -04:00
Patrick von Platen	9f8619c6aa	Flax testing should not run the full torch test suite (#10725 ) * make flax tests pytorch independent * fix typo * finish * improve circle ci * fix return tensors * correct flax test * re-add sentencepiece * last tokenizer fixes * finish maybe now	2021-03-16 08:05:37 +03:00
Suraj Patil	d26b37e744	Speech2TextTransformer (#10175 ) * s2t * fix config * conversion script * fix import * add tokenizer * fix tok init * fix tokenizer * first version working * fix embeds * fix lm head * remove extra heads * fix convert script * handle encoder attn mask * style * better enc attn mask * override _prepare_attention_mask_for_generation * handle attn_maks in encoder and decoder * input_ids => input_features * enable use_cache * remove old code * expand embeddings if needed * remove logits bias * masked_lm_loss => loss * hack tokenizer to support feature processing * fix model_input_names * style * fix error message * doc * remove inputs_embeds * remove input_embeds * remove unnecessary docstring * quality * SpeechToText => Speech2Text * style * remove shared_embeds * subsample => conv * remove Speech2TextTransformerDecoderWrapper * update output_lengths formula * fix table * remove max_position_embeddings * update conversion scripts * add possibility to do upper case for now * add FeatureExtractor and Processor * add tests for extractor * require_torch_audio => require_torchaudio * add processor test * update import * remove classification head * attention mask is now 1D * update docstrings * attention mask should be of type long * handle attention mask from generate * alwyas return attention_mask * fix test * style * doc * Speech2TextTransformer => Speech2Text * Speech2TextTransformerConfig => Speech2TextConfig * remove dummy_inputs * nit * style * multilinguial tok * fix tokenizer * add tgt_lang setter * save lang_codes * fix tokenizer * add forced_bos_token_id to tokenizer * apply review suggestions * add torchaudio to extra deps * add speech deps to CI * fix dep * add libsndfile to ci * libsndfile1 * add speech to extras all * libsndfile1 -> libsndfile1 * libsndfile * libsndfile1-dev * apt update * add sudo to install * update deps table * install libsndfile1-dev on CI * tuple to list * init conv layer * add model tests * quality * add integration tests * skip_special_tokens * add speech_to_text_transformer in toctree * fix tokenizer * fix fp16 tests * add tokenizer tests * fix copyright * input_values => input_features * doc * add model in readme * doc * change checkpoint names * fix copyright * fix code example * add max_model_input_sizes in tokenizer * fix integration tests * add do_lower_case to tokenizer * remove clamp trick * fix "Add modeling imports here" * fix copyrights * fix tests * SpeechToTextTransformer => SpeechToText * fix naming * fix table formatting * fix typo * style * fix typos * remove speech dep from extras[testing] * fix copies * rename doc file, * put imports under is_torch_available * run feat extract tests when torch is available * dummy objects for processor and extractor * fix imports in tests * fix import in modeling test * fxi imports * fix torch import * fix imports again * fix positional embeddings * fix typo in import * adapt new extractor refactor * style * fix torchscript test * doc * doc * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix docs, copied from, style * fix docstring * handle imports * remove speech from all extra deps * remove s2t from seq2seq lm mapping * better names * skip training tests * add install instructions * List => Tuple * doc * fix conversion script * fix urls * add instruction for libsndfile * fix fp16 test Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-03-10 21:42:04 +05:30
Sylvain Gugger	7da995c00c	Fix embeddings for PyTorch 1.8 (#10549 ) * Fix embeddings for PyTorch 1.8 * Try with PyTorch 1.8.0 * Fix embeddings init * Fix copies * Typo * More typos	2021-03-05 16:18:48 -05:00
Lysandre	dc9aaa3848	Pin torch to 1.7.1 in tests while we resolve issues	2021-03-05 07:57:35 -05:00
Lysandre	093b88f4e9	Update scatter to use torch 1.8.0	2021-03-05 07:31:51 -05:00
Lysandre	3591844306	v4.3.3 docs	2021-02-24 15:19:01 -05:00
Stas Bekman	d478257d9b	[CI] build docs faster (#10115 ) I assume the CI machine should have at least 4 cores, so let's build docs faster	2021-02-10 03:02:39 -05:00
Sylvain Gugger	0c3d23dff7	Add patch releases to the doc	2021-02-09 14:17:09 -05:00
Lysandre	bf1a06a437	Docs for v4.3.1 release	2021-02-09 10:02:50 +01:00
Lysandre	ba542ffb49	Fix deployment script	2021-02-09 08:43:00 +01:00
Lysandre	0dd579c9cf	Docs for v4.3.0	2021-02-08 18:53:24 +01:00
Sylvain Gugger	45aaf5f7ab	A few fixes in the documentation (#10033 )	2021-02-08 05:02:01 -05:00
Lysandre	ad2c431097	Update doc deployment script path	2021-02-05 11:18:59 +01:00
Lysandre	95a5f271e5	Update doc deployment script	2021-02-05 11:10:29 +01:00
Sylvain Gugger	3be965c5db	Update doc for pre-release (#10014 ) * Update doc for pre-release * Use stable as default * Use the right commit :facepalms:	2021-02-04 16:52:27 -05:00
Lysandre Debut	910aa89671	Temporarily deactivate TPU tests while we work on fixing them (#9720 )	2021-01-21 04:17:39 -05:00
Lysandre	33a8497db8	v4.2.0 documentation	2021-01-13 16:15:40 +01:00
Lysandre	bd40345d3e	v4.1.1 docs	2020-12-17 11:28:38 -05:00
Lysandre	f83d9c8da7	v4.1.0 docs	2020-12-17 10:16:07 -05:00
NielsRogge	1551e2dc6d	[WIP] Tapas v4 (tres) (#9117 ) * First commit: adding all files from tapas_v3 * Fix multiple bugs including soft dependency and new structure of the library * Improve testing by adding torch_device to inputs and adding dependency on scatter * Use Python 3 inheritance rather than Python 2 * First draft model cards of base sized models * Remove model cards as they are already on the hub * Fix multiple bugs with integration tests * All model integration tests pass * Remove print statement * Add test for convert_logits_to_predictions method of TapasTokenizer * Incorporate suggestions by Google authors * Fix remaining tests * Change position embeddings sizes to 512 instead of 1024 * Comment out positional embedding sizes * Update PRETRAINED_VOCAB_FILES_MAP and PRETRAINED_POSITIONAL_EMBEDDINGS_SIZES * Added more model names * Fix truncation when no max length is specified * Disable torchscript test * Make style & make quality * Quality * Address CI needs * Test the Masked LM model * Fix the masked LM model * Truncate when overflowing * More much needed docs improvements * Fix some URLs * Some more docs improvements * Test PyTorch scatter * Set to slow + minify * Calm flake8 down * First commit: adding all files from tapas_v3 * Fix multiple bugs including soft dependency and new structure of the library * Improve testing by adding torch_device to inputs and adding dependency on scatter * Use Python 3 inheritance rather than Python 2 * First draft model cards of base sized models * Remove model cards as they are already on the hub * Fix multiple bugs with integration tests * All model integration tests pass * Remove print statement * Add test for convert_logits_to_predictions method of TapasTokenizer * Incorporate suggestions by Google authors * Fix remaining tests * Change position embeddings sizes to 512 instead of 1024 * Comment out positional embedding sizes * Update PRETRAINED_VOCAB_FILES_MAP and PRETRAINED_POSITIONAL_EMBEDDINGS_SIZES * Added more model names * Fix truncation when no max length is specified * Disable torchscript test * Make style & make quality * Quality * Address CI needs * Test the Masked LM model * Fix the masked LM model * Truncate when overflowing * More much needed docs improvements * Fix some URLs * Some more docs improvements * Add add_pooling_layer argument to TapasModel Fix comments by @sgugger and @patrickvonplaten * Fix issue in docs + fix style and quality * Clean up conversion script and add task parameter to TapasConfig * Revert the task parameter of TapasConfig Some minor fixes * Improve conversion script and add test for absolute position embeddings * Improve conversion script and add test for absolute position embeddings * Fix bug with reset_position_index_per_cell arg of the conversion cli * Add notebooks to the examples directory and fix style and quality * Apply suggestions from code review * Move from `nielsr/` to `google/` namespace * Apply Sylvain's comments Co-authored-by: sgugger <sylvain.gugger@gmail.com> Co-authored-by: Rogge Niels <niels.rogge@howest.be> Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: sgugger <sylvain.gugger@gmail.com>	2020-12-15 17:08:49 -05:00
Lysandre Debut	91fa707217	Remove docs only check (#9065 )	2020-12-11 10:27:31 -05:00
Sylvain Gugger	783d7d2629	Reorganize examples (#9010 ) * Reorganize example folder * Continue reorganization * Change requirements for tests * Final cleanup * Finish regroup with tests all passing * Copyright * Requirements and readme * Make a full link for the documentation * Address review comments * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Add symlink * Reorg again * Apply suggestions from code review Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com> * Adapt title * Update to new strucutre * Remove test * Update READMEs Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>	2020-12-11 10:07:02 -05:00
Stas Bekman	5e637e6c69	[wip] [ci] doc-job-skip take #4 dry-run (#8980 ) * ci-doc-job-skip-take-4 * wip * wip * wip * wip * skip yaml * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * wip * ready to test * yet another way * trying with HEAD * trying with head.sha * trying with head.sha fix * trying with head.sha fix wip * undo * try to switch to sha * current branch * current branch * PR number check * joy ride * joy ride * joy ride * joy ride * joy ride * joy ride * joy ride * joy ride * joy ride * joy ride * joy ride * joy ride	2020-12-09 15:36:36 -05:00
Lysandre Debut	2ae7388eee	Check table as independent script (#8976 )	2020-12-07 19:55:12 -05:00
Julien Chaumond	28fa014a1f	transformers-cli: LFS multipart uploads (> 5GB) (#8663 ) * initial commit * [cli] lfs commands * Fix FileSlice * Tweak to FileSlice * [hf_api] Backport filetype arg from `datasets` cc @lhoestq * Silm down the CI while i'm working * Ok let's try this in CI * Update config.yml * Do not try this at home * one more try * Update lfs.py * Revert "Tweak to FileSlice" This reverts commit `d7e32c4b35`. * Update test_hf_api.py * Update test_hf_api.py * Update test_hf_api.py * CI still green? * make CI green again? * Update test_hf_api.py * make CI red again? * Update test_hf_api.py * add CI style back * Fix CI? * oh my * doc + switch back to real staging endpoint * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Pierric Cistac <Pierrci@users.noreply.github.com> * Fix docblock + f-strings Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Pierric Cistac <Pierrci@users.noreply.github.com>	2020-12-07 16:38:39 -05:00
Stas Bekman	37f4c24f10	> 30 files leads to hanging on --More-- cancel debug printing for now. As it can be seen lead to a failing test here: https://app.circleci.com/pipelines/github/huggingface/transformers/16894/workflows/cc86f7a9-4020-45af-8ab3-c22f79b427cf/jobs/131924	2020-12-07 12:18:05 -08:00
Stas Bekman	73c51f7fcd	[ci] skip doc jobs - circleCI is not reliable - disable skip for now (#8926 ) * disable skipping, but leave logging for the future	2020-12-04 10:13:42 -08:00
Stas Bekman	24f0c2fe33	[ci] skip doc jobs take #3 (#8885 ) * check that we get any match first * docs only * 2 docs only * add code * restore	2020-12-02 10:06:45 -05:00
Stas Bekman	693ac3594b	disable job skip - need more work reference: https://github.com/huggingface/transformers/pull/8853#issuecomment-736779863	2020-12-01 12:03:29 -08:00
Stas Bekman	21db560df3	[CI] skip docs-only jobs take #2 (#8853 ) * restore skip * Revert "Remove deprecated `evalutate_during_training` (#8852)" This reverts commit `5530299096`. * check that pipeline.git.base_revision is defined before proceeding * Revert "Revert "Remove deprecated `evalutate_during_training` (#8852)"" This reverts commit `dfec84db3f`. * check that pipeline.git.base_revision is defined before proceeding * doc only * doc + code * restore * restore * typo	2020-12-01 13:15:25 -05:00
LysandreJik	9995a341c9	Update docs	2020-11-30 12:07:52 -05:00
Sylvain Gugger	08e707633c	Comment the skip job on doc line	2020-11-30 10:51:25 -05:00
Stas Bekman	c239dcda83	[CI] implement job skipping for doc-only PRs (#8826 ) * implement job skipping for doc-only PRs * silent grep is crucial * wip * wip * wip * wip * wip * wip * wip * wip * let's add doc * let's add code * revert test commits * restore * Better name * Better name * Better name * some more testing * some more testing * some more testing * finish testing	2020-11-29 11:31:30 -05:00
Julien Chaumond	0cc5ab1333	Improve bert-japanese tokenizer handling (#8659 ) * Make ci fail * Try to make tests actually run? * CI finally failing? * Fix CI * Revert "Fix CI" This reverts commit `ca7923be73`. * Ooops wrong one * one more try * Ok ok let's move this elsewhere * Alternative to globals() (#8667) * Alternative to globals() * Error is raised later so return None * Sentencepiece not installed make some tokenizers None * Apply Lysandre wisdom * Slightly clearer comment? cc @sgugger Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-11-23 11:15:02 -05:00
Sylvain Gugger	6494910f27	Add sentencepiece to the CI and fix tests (#8672 ) * Fix the CI and tests * Fix quality * Remove that m form nowhere	2020-11-19 16:44:20 -05:00
Sylvain Gugger	bb03a14edd	Update doc for v3.5.1	2020-11-13 10:29:58 -05:00
Funtowicz Morgan	121c24efa4	Update deploy-docs dependencies on CI to enable Flax (#8475 ) * Update deploy-docs dependencies on CI to enable Flax Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Added pair of "" Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-11-11 18:31:41 -05:00
Funtowicz Morgan	a5b682329c	Flax/Jax documentation (#8331 ) * First addition of Flax/Jax documentation Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * make style * Ensure input order match between Bert & Roberta Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Install dependencies "all" when building doc Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * wraps build_doc deps with "" Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Addressing @sgugger comments. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Use list to highlight JAX features. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Make style. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Let's not look to much into the future for now. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Style Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-11-11 14:53:36 -05:00
Lysandre	aec51e5696	v3.5.0 documentation	2020-11-10 08:58:47 -05:00
Sylvain Gugger	854b44aa38	Revert size change as it doesn't change anything	2020-11-04 11:13:24 -05:00
Sylvain Gugger	414985c427	Upgrade resource for doc building	2020-11-04 10:44:19 -05:00
Stas Bekman	1bb4bba53c	[CIs] Better reports everywhere (#8275 ) * make it possible to invoke testconf.py in both test suites without crashing on having the same option added * perl -pi -e 's\|--make_reports\|--make-reports\|' to be consistent with other opts * add `pytest --make-reports` to all CIs (and artifacts) * fix	2020-11-03 16:57:12 -05:00
Sylvain Gugger	4c19f3baab	Clean Trainer tests and datasets dep (#8268 )	2020-11-03 15:50:55 -05:00
Sylvain Gugger	691176283d	Add a template for examples and apply it for mlm and plm examples (#8153 ) * Add a template for example scripts and apply it to mlm * Formatting * Fix test * Add plm script * Add a template for example scripts and apply it to mlm * Formatting * Fix test * Add plm script * Add a template for example scripts and apply it to mlm * Formatting * Fix test * Add plm script * Styling	2020-10-29 13:38:11 -04:00
Lysandre Debut	1b6c8d4811	Update CI cache (#8126 )	2020-10-28 13:59:43 -04:00
Lysandre Debut	a0906068cf	Fully remove codecov (#8093 )	2020-10-27 14:14:13 -04:00
Stas Bekman	bfd5e370a7	[CI] generate separate report files as artifacts (#7995 ) * better reports * a whole bunch of reports in their own files * clean up * improvements * github artifacts experiment * style * complete the report generator with multiple improvements/fixes * fix * save all reports under one dir to easy upload * can remove temp failing tests * doc fix * some cleanup	2020-10-27 09:25:07 -04:00
Sylvain Gugger	08f534d2da	Doc styling (#8067 ) * Important files * Styling them all * Revert "Styling them all" This reverts commit `7d029395fd`. * Syling them for realsies * Fix syntax error * Fix benchmark_utils * More fixes * Fix modeling auto and script * Remove new line * Fixes * More fixes * Fix more files * Style * Add FSMT * More fixes * More fixes * More fixes * More fixes * Fixes * More fixes * More fixes * Last fixes * Make sphinx happy	2020-10-26 18:26:02 -04:00
Thomas Wolf	3a40cdf58d	[tests\|tokenizers] Refactoring pipelines test backbone - Small tokenizers improvements - General tests speedups (#7970 ) * WIP refactoring pipeline tests - switching to fast tokenizers * fix dialog pipeline and fill-mask * refactoring pipeline tests backbone * make large tests slow * fix tests (tf Bart inactive for now) * fix doc... * clean up for merge * fixing tests - remove bart from summarization until there is TF * fix quality and RAG * Add new translation pipeline tests - fix JAX tests * only slow for dialog * Fixing the missing TF-BART imports in modeling_tf_auto * spin out pipeline tests in separate CI job * adding pipeline test to CI YAML * add slow pipeline tests * speed up tf and pt join test to avoid redoing all the standalone pt and tf tests * Update src/transformers/tokenization_utils_base.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * Update src/transformers/pipelines.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/pipelines.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/testing_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add require_torch and require_tf in is_pt_tf_cross_test Co-authored-by: Sam Shleifer <sshleifer@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-10-23 15:58:19 +02:00
Lysandre	467573ddde	Fix documentation redirect	2020-10-22 15:37:51 -04:00
Lysandre	ef0ac063c9	Docs for v3.4.0	2020-10-20 16:29:00 +02:00
Stas Bekman	ca37db0559	[flax] fix repo_check (#7914 ) * [flax] fix repo_check Unless, this is actually a problem, this adds `modeling_flax_utils` to ignore list. otherwise currently it expects to have a 'tests/test_modeling_flax_utils.py' for it. for context please see: https://github.com/huggingface/transformers/pull/3722#issuecomment-712360415 * fix 2 more issues * merge https://github.com/huggingface/transformers/pull/7919/	2020-10-20 07:55:40 -04:00
Funtowicz Morgan	8f8f8d99fc	Integrate Bert-like model on Flax runtime. (#3722 ) * WIP flax bert * Initial commit Bert Jax/Flax implementation. * Embeddings working and equivalent to PyTorch. * Move embeddings in its own module BertEmbeddings * Added jax.jit annotation on forward call * BertEncoder on par with PyTorch ! :D * Add BertPooler on par with PyTorch !! * Working Jax+Flax implementation of BertModel with < 1e-5 differences on the last layer. * Fix pooled output to take only the first token of the sequence. * Refactoring to use BertConfig from transformers. * Renamed FXBertModel to FlaxBertModel * Model is now initialized in FlaxBertModel constructor and reused. * WIP JaxPreTrainedModel * Cleaning up the code of FlaxBertModel * Added ability to load Flax model saved through save_pretrained() * Added ability to convert Pytorch Bert model to FlaxBert * FlaxBert can now load every Pytorch Bert model with on-the-fly conversion * Fix hardcoded shape values in conversion scripts. * Improve the way we handle LayerNorm conversion from PyTorch to Flax. * Added positional embeddings as parameter of BertModel with default to np.arange. * Let's roll FlaxRoberta ! * Fix missing position_ids parameters on predict for Bert * Flax backend now supports batched inputs Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Make it possible to load msgpacked model on convert from pytorch in last resort. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Moved save_pretrained to Jax base class along with more constructor parameters. * Use specialized, model dependent conversion functio. * Expose `is_flax_available` in file_utils. * Added unittest for Flax models. * Added run_tests_flax to the CI. * Introduce FlaxAutoModel * Added more unittests * Flax model reference the _MODEL_ARCHIVE_MAP from PyTorch model. * Addressing review comments. * Expose seed in both Bert and Roberta * Fix typo suggested by @stefan-it Co-Authored-By: Stefan Schweter <stefan@schweter.it> * Attempt to make style * Attempt to make style in tests too * Added jax & jaxlib to the flax optional dependencies. * Attempt to fix flake8 warnings ... * Redo black again and again * When black and flake8 fight each other for a space ... 💥 💥 💥 * Try removing trailing comma to make both black and flake happy! * Fix invalid is_<framework>_available call, thanks @LysandreJik 🎉 * Fix another invalid import in flax_roberta test * Bump and pin flax release to 0.1.0. * Make flake8 happy, remove unused jax import * Change the type of the catch for msgpack. * Remove unused import. * Put seed as optional constructor parameter. * trigger ci again * Fix too much parameters in BertAttention. * Formatting. * Simplify Flax unittests to avoid machine crashes. * Fix invalid number of arguments when raising issue for an unknown model. * Address @bastings comment in PR, moving jax.jit decorated outside of __call__ * Fix incorrect path to require_flax/require_pytorch functions. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Attempt to make style. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Correct rebasing of circle-ci dependencies Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Fix import sorting. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Fix unused imports. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Again import sorting... Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Installing missing nlp dependency for flax unittests. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Fix laoding of model for Flax implementations. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * jit the inner function call to make JAX-compatible Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Format ! Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Flake one more time 🎶 Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Rewrites BERT in Flax to the new Linen API (#7211) * Rewrite Flax HuggingFace PR to Linen * Some fixes * Fix tests * Fix CI with change of name of nlp (#7054) * nlp -> datasets * More nlp -> datasets * Woopsie * More nlp -> datasets * One last * Expose `is_flax_available` in file_utils. * Added run_tests_flax to the CI. * Attempt to make style * trigger ci again * Fix import sorting. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Revert "Rewrites BERT in Flax to the new Linen API (#7211)" This reverts commit 23703a5eb3364e26a1cbc3ee34b4710d86a674b0. * Remove jnp.lax references Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make style. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Reintroduce Linen changes ... Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make style. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Use jax native's gelu function. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Renaming BertModel to BertModule to highlight the fact this is the Flax Module object. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Rewrite FlaxAutoModel test to not rely on pretrained_model_archive_map Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Remove unused variable in BertModule. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Remove unused variable in BertModule again Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Attempt to have is_flax_available working again. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Introduce JAX TensorType Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Improve ImportError message when trying to convert to various TensorType format. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Makes Flax model jittable. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Ensure flax models are jittable in unittests. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co> * Remove unused imports. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Ensure jax imports are guarded behind is_flax_available. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make style. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make style again Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make style again again Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make style again again again Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Update src/transformers/file_utils.py Co-authored-by: Marc van Zee <marcvanzee@gmail.com> * Bump flax to it's latest version Co-authored-by: Marc van Zee <marcvanzee@gmail.com> * Bump jax version to at least 0.2.0 Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Style. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Update the unittest to use TensorType.JAX Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * isort import in tests. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Match new flax parameters name "params" Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Remove unused imports. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Add flax models to transformers __init__ Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Attempt to address all CI related comments. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Correct circle.yml indent. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Correct circle.yml indent (2) Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Remove coverage from flax tests Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Addressing many naming suggestions from comments Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Simplify for loop logic to interate over layers in FlaxBertLayerCollection Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * use f-string syntax for formatting logs. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Use config property from FlaxPreTrainedModel. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * use "cls_token" instead of "first_token" variable name. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * use "hidden_state" instead of "h" variable name. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Correct class reference in docstring to link to Flax related modules. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Added HF + Google Flax team copyright. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make Roberta independent from Bert Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Move activation functions to flax_utils. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Move activation functions to flax_utils for bert. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Added docstring for BERT Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Update import for Bert and Roberta tokenizers Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make style. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * fix-copies Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Correct FlaxRobertaLayer to match PyTorch. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Use the same store_artifact for flax unittest Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Style. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Make sure gradient are disabled only locally for flax unittest using torch equivalence. Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> * Use relative imports Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com> Co-authored-by: Stefan Schweter <stefan@schweter.it> Co-authored-by: Marc van Zee <marcvanzee@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-10-19 09:55:41 -04:00
Stas Bekman	805a202e1a	[CIs] report slow tests add --durations=0 to some pytest jobs (#7884 ) * add --durations=50 to some pytest runs * report all tests	2020-10-19 08:23:14 -04:00
Thomas Wolf	ba8c4d0ac0	[Dependencies\|tokenizers] Make both SentencePiece and Tokenizers optional dependencies (#7659 ) * splitting fast and slow tokenizers [WIP] * [WIP] splitting sentencepiece and tokenizers dependencies * update dummy objects * add name_or_path to models and tokenizers * prefix added to file names * prefix * styling + quality * spliting all the tokenizer files - sorting sentencepiece based ones * update tokenizer version up to 0.9.0 * remove hard dependency on sentencepiece 🎉 * and removed hard dependency on tokenizers 🎉 * update conversion script * update missing models * fixing tests * move test_tokenization_fast to main tokenization tests - fix bugs * bump up tokenizers * fix bert_generation * update ad fix several tokenizers * keep sentencepiece in deps for now * fix funnel and deberta tests * fix fsmt * fix marian tests * fix layoutlm * fix squeezebert and gpt2 * fix T5 tokenization * fix xlnet tests * style * fix mbart * bump up tokenizers to 0.9.2 * fix model tests * fix tf models * fix seq2seq examples * fix tests without sentencepiece * fix slow => fast conversion without sentencepiece * update auto and bert generation tests * fix mbart tests * fix auto and common test without tokenizers * fix tests without tokenizers * clean up tests lighten up when tokenizers + sentencepiece are both off * style quality and tests fixing * add sentencepiece to doc/examples reqs * leave sentencepiece on for now * style quality split hebert and fix pegasus * WIP Herbert fast * add sample_text_no_unicode and fix hebert tokenization * skip FSMT example test for now * fix style * fix fsmt in example tests * update following Lysandre and Sylvain's comments * Update src/transformers/testing_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/testing_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/tokenization_utils_base.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/tokenization_utils_base.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-10-18 20:51:24 +02:00
Sylvain Gugger	28d183c90c	Allow soft dependencies in the namespace with ImportErrors at use (#7537 ) * PoC on RAG * Format class name/obj name * Better name in message * PoC on one TF model * Add PyTorch and TF dummy objects + script * Treat scikit-learn * Bad copy pastes * Typo	2020-10-05 09:12:04 -04:00
Lysandre	16c213820e	Update docs to version v3.3.0	2020-09-28 16:32:00 +02:00
Stas Bekman	df53643807	[code quality] fix confused flake8 (#7309 ) * fix confused flake We run `black --target-version py35 ...` but flake8 doesn't know that, so currently with py38 flake8 fails suggesting that black should have reformatted 63 files. Indeed if I run: ``` black --line-length 119 --target-version py38 examples templates tests src utils ``` it indeed reformats 63 files. The only solution I found is to create a black config file as explained at https://github.com/psf/black#configuration-format, which is what this PR adds. Now flake8 knows that py35 is the standard and no longer gets confused regardless of the user's python version. * adjust the other files that will now rely on black's config file	2020-09-22 22:12:36 -04:00
Lysandre	6e21f24220	Documentation version	2020-09-22 18:04:39 +02:00
Sylvain Gugger	e4b94d8e58	Copy code from Bert to Roberta and add safeguard script (#7219 ) * Copy code from Bert to Roberta and add safeguard script * Fix docstring * Comment code * Formatting * Update src/transformers/modeling_roberta.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Add test and fix bugs * Fix style and make new comand Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2020-09-22 05:02:27 -04:00
Stas Bekman	79111b77d2	remove deprecated flag (#7171 ) ``` /home/circleci/.local/lib/python3.6/site-packages/isort/main.py:915: UserWarning: W0501: The following deprecated CLI flags were used and ignored: --recursive! "W0501: The following deprecated CLI flags were used and ignored: " ```	2020-09-17 05:52:12 -04:00
Sylvain Gugger	514486739c	Fix CI with change of name of nlp (#7054 ) * nlp -> datasets * More nlp -> datasets * Woopsie * More nlp -> datasets * One last	2020-09-10 14:51:08 -04:00
Lysandre	3726754a6c	v3.1.0 documentation	2020-09-01 14:39:07 +02:00
Stas Bekman	59a6a32a61	add a final report to all pytest jobs (#6861 ) we had it added for one job, please add it to all pytest jobs - we need the output of what tests were run to debug the codecov issue. thank you!	2020-08-31 22:47:23 -04:00
Sylvain Gugger	abc0202194	More tests to Trainer (#6699 ) * More tests to Trainer * Add warning in the doc	2020-08-25 07:07:36 -04:00
Sylvain Gugger	a573777901	Update repo to isort v5 (#6686 ) * Run new isort * More changes * Update CI, CONTRIBUTING and benchmarks	2020-08-24 11:03:01 -04:00
Masatoshi Suzuki	48c6c6139f	Support additional dictionaries for BERT Japanese tokenizers (#6515 ) * Update BERT Japanese tokenizers * Update CircleCI config to download unidic * Specify to use the latest dictionary packages	2020-08-17 12:00:23 +08:00
zcain117	fd3de2000f	Get GKE logs via kubectl logs instead of gcloud logging read. (#6446 )	2020-08-12 11:46:24 -04:00
Sylvain Gugger	a8db954cda	Activate check on the CI (#6427 ) * Activate check on the CI * Fix repo inconsistencies * Don't document too much	2020-08-12 08:42:14 -04:00
Lysandre	8a3db6b303	Add TPU testing once again	2020-08-11 08:49:37 +02:00
zcain117	f65ac1faf2	Add missing docker arg for TPU CI. (#6393 )	2020-08-11 02:48:49 -04:00
Lysandre	1bbc54a87c	Temporarily de-activate TPU CI	2020-08-10 08:11:40 +02:00
zcain117	1b8a7ffcfd	Add setup for TPU CI to run every hour. (#6219 ) * Add setup for TPU CI to run every hour. * Re-organize config.yml Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>	2020-08-07 11:17:07 -04:00
Lysandre Debut	80a0676a51	CI dependency wheel caching (#6287 ) * Single workflow cache test Remove cache dir, re-trigger cache Only pip archives Not sudo when pip * All workflow cache Remove no-cache-dir instruction Remove last sudo occurrences v0.3	2020-08-07 02:48:59 -04:00
Lysandre Debut	1d5c3a3d96	Test with --no-cache-dir (#6235 )	2020-08-04 03:20:19 -04:00
Lysandre Debut	d740351f7d	Upgrade pip when doing CI (#6234 ) * Upgrade pip when doing CI * Don't forget Github CI	2020-08-04 02:37:12 -04:00
Paul O'Leary McCann	cf3cf304ca	Replace mecab-python3 with fugashi for Japanese tokenization (#6086 ) * Replace mecab-python3 with fugashi This replaces mecab-python3 with fugashi for Japanese tokenization. I am the maintainer of both projects. Both projects are MeCab wrappers, so the underlying C++ code is the same. fugashi is the newer wrapper and doesn't use SWIG, so for basic use of the MeCab API it's easier to use. This code insures the use of a version of ipadic installed via pip, which should make versioning and tracking down issues easier. fugashi has wheels for Windows, OSX, and Linux, which will help with issues with installing old versions of mecab-python3 on Windows. Compared to mecab-python3, because fugashi doesn't use SWIG, it doesn't require a C++ runtime to be installed on Windows. In adding this change I removed some code dealing with `cursor`, `token_start`, and `token_end` variables. These variables didn't seem to be used for anything, it is unclear to me why they were there. I ran the tests and they passed, though I couldn't figure out how to run the slow tests (`--runslow` gave an error) and didn't try testing with Tensorflow. * Style fix * Remove unused variable Forgot to delete this... * Adapt doc with install instructions * Fix typo Co-authored-by: sgugger <sylvain.gugger@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-07-31 04:41:14 -04:00
Stas Bekman	daa5dd1202	add a summary report flag for run_examples on CI (#6035 ) Currently, it's hard to derive which example tests were run on CI, and which weren't. Adding `-rA` flag to `pytest`, will now include a summary like: ``` ==================================================================== short test summary info ===================================================================== PASSED examples/test_examples.py::ExamplesTests::test_generation PASSED examples/test_examples.py::ExamplesTests::test_run_glue PASSED examples/test_examples.py::ExamplesTests::test_run_language_modeling PASSED examples/test_examples.py::ExamplesTests::test_run_squad FAILED examples/test_examples.py::ExamplesTests::test_run_pl_glue - AttributeError: 'Namespace' object has no attribute 'gpus' ============================================================ 1 failed, 4 passed, 8 warnings in 42.96s ============================================================ ``` which makes it easier to validate whether some example is being covered by CI or not.	2020-07-26 14:09:14 -04:00
sgugger	a540405213	Fix commit hash for stable doc	2020-07-24 09:07:40 -04:00
Lysandre	b9d8af07e6	Update stable doc	2020-07-09 11:06:23 -04:00
Lysandre	5c82bf6831	Update stable doc	2020-07-09 10:16:13 -04:00
Lysandre	1d2332861f	Post v3.0.2 release commit	2020-07-06 18:56:47 -04:00
Sam Shleifer	ac61114592	[CI] gh runner doesn't use -v, cats new result (#5409 )	2020-06-30 16:12:14 -04:00
Lysandre Debut	b9ee87f5c7	Doc for v3.0.0 (#5366 ) * Doc for v3.0.0 * Update docs/source/_static/js/custom.js Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/_static/js/custom.js Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-06-29 11:08:54 -04:00
Sam Shleifer	bf0d12c220	CircleCI stores cleaner output at test_outputs.txt (#5291 )	2020-06-26 13:59:31 -04:00
Sylvain Gugger	0148c262e7	Fix first test (#5255 )	2020-06-24 15:16:04 -04:00
Sylvain Gugger	70c1e1d2d5	Use master _static (#5253 ) * Use _static from master everywhere * Copy to existing too	2020-06-24 15:06:14 -04:00
Sylvain Gugger	f216b60671	Fix deploy doc (#5246 ) * Try with the same command * Try like this	2020-06-24 10:59:06 -04:00
Sylvain Gugger	49f6e7a3c6	Add some prints to debug (#5244 )	2020-06-24 10:37:01 -04:00
Sylvain Gugger	64c393ee74	Don't recreate old docs (#5243 )	2020-06-24 09:59:07 -04:00
Sylvain Gugger	c439752482	Switch master/stable doc and add older releases (#5193 )	2020-06-22 16:38:53 -04:00
Lysandre Debut	4e741efa92	Have documentation fail on warning (#5189 ) * Have documentation fail on warning * Force ci failure * Revert "Force ci failure" This reverts commit `f0a4666ec2`.	2020-06-22 15:49:50 -04:00
Harutaka Kawamura	b0c9fbb293	Add workflow to build docs (#3763 )	2020-04-17 11:23:18 -04:00
Julien Chaumond	d0c36a7b72	[ci] Partial revert of `18eec3a984` due to `fbc5bf10cf`	2020-03-24 12:10:43 -04:00
Julien Chaumond	18eec3a984	[ci] simpler way to load correct version of isort hat/tip @bramvanroy	2020-03-23 10:03:22 -04:00
Thomas Wolf	2187c49f5c	CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) (#3186 ) * memory benchmark rss * have both forward pass and line-by-line mem tracing * cleaned up tracing * refactored and cleaning up API * no f-strings yet... * add GPU mem logging * fix GPU memory monitoring * style and quality * clean up and doc * update with comments * Switching to python 3.6+ * fix quality	2020-03-17 10:17:11 -04:00
Julien Chaumond	d6ef587a10	[ci] Fixup `e36bd94345`	2020-02-28 23:19:17 -05:00
Julien Chaumond	e36bd94345	[ci] Run all tests on (self-hosted) GPU (#3020 ) * Create self-hosted.yml * Update self-hosted.yml * Update self-hosted.yml * Update self-hosted.yml * Update self-hosted.yml * Update self-hosted.yml * do not run slow tests, for now * [ci] For comparison with circleci, let's also run CPU-tests * [ci] reorganize * clearer filenames * [ci] Final tweaks before merging * rm slow tests on circle ci * Trigger CI * On GPU this concurrency was way too high	2020-02-28 21:11:08 -05:00
Julien Chaumond	e693cd1e87	[ci] Run slow tests every day	2020-02-24 19:54:47 -05:00
Julien Chaumond	4fc63151af	[ci] Attempt to fix #2844	2020-02-24 19:51:34 -05:00
Lysandre	22b2b5790e	Documentation v2.5.0	2020-02-19 11:53:30 -05:00
Sam Shleifer	0ed630f139	Attempt to increase timeout for circleci slow tests (#2844 )	2020-02-13 09:11:03 -05:00
Morgan Funtowicz	6aa7973aec	Fix circleci cuInit error on Tensorflow >= 2.1.0. Tensorflow 2.1.0 introduce a new dependency model where pip install tensorflow would install tf with GPU support. Before it would just install with CPU support, thus CircleCI is looking for NVidia driver version at initialization of the tensorflow related tests but fails as their is no NVidia Driver running. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-02-10 13:24:37 +01:00
Lysandre	0aa40e9569	v2.4.0 documentation	2020-01-31 09:55:34 -05:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Aymeric Augustin	0ffc8eaf53	Enforce target version for black. This should stabilize formatting.	2020-01-05 12:52:14 -05:00
Aymeric Augustin	10724a8123	Run the slow tests every Monday morning.	2019-12-24 09:09:43 +01:00
Aymeric Augustin	8a6881822a	Run some tests on Python 3.7. This will improve version coverage.	2019-12-23 21:06:23 +01:00
Aymeric Augustin	76a1417f2a	Include all optional dependencies in extras. Take advantage of this to simplify the Circle CI configuration. Don't bother with tensorboardX: it's a fallback for PyTorch < 1.1.0.	2019-12-23 19:14:31 +01:00
Aymeric Augustin	23dad8447c	Install deps from setup.py for building docs. requirements.txt isn't up to date.	2019-12-23 17:06:32 +01:00
Thomas Wolf	ba2378ced5	Merge pull request #2264 from upura/fix-doclink Fix doc link in README	2019-12-23 12:31:00 +01:00
Aymeric Augustin	0dddc1494d	Remove py3 marker.	2019-12-22 18:38:56 +01:00
Aymeric Augustin	6be7cdda66	Move source code inside a src subdirectory. This prevents transformers from being importable simply because the CWD is the root of the git repository, while not being importable from other directories. That led to inconsistent behavior, especially in examples. Once you fetch this commit, in your dev environment, you must run: $ pip uninstall transformers $ pip install -e .	2019-12-22 14:15:13 +01:00
Aymeric Augustin	ced0a94204	Switch test files to the standard test_*.py scheme.	2019-12-22 14:15:13 +01:00
Aymeric Augustin	067395d5c5	Move tests outside of library.	2019-12-22 13:47:17 +01:00
Aymeric Augustin	c11b3e2926	Sort imports for optional third-party libraries. These libraries aren't always installed in the virtual environment where isort is running. Declaring them properly avoids mixing these third-party imports with local imports.	2019-12-22 11:19:13 +01:00
Aymeric Augustin	577a03664d	Enforce flake8 in CI.	2019-12-22 11:00:04 +01:00
Aymeric Augustin	9e80fc7b2f	Enforce isort in CI. We need https://github.com/timothycrosley/isort/pull/1000 but there's no release with this fix yet, so we'll install from GitHub.	2019-12-22 10:59:00 +01:00
upura	9d00f78f16	fix doc link	2019-12-22 16:07:05 +09:00
Aymeric Augustin	6e5291a915	Enforce black in CI.	2019-12-21 17:53:18 +01:00
Aymeric Augustin	343c094f21	Run examples separately from tests. This optimizes the total run time of the Circle CI test suite.	2019-12-21 08:43:19 +01:00
Aymeric Augustin	80caf79d07	Prevent excessive parallelism in PyTorch. We're already using as many processes in parallel as we have CPU cores. Furthermore, the number of core may be incorrectly calculated as 36 (we've seen this in pytest-xdist) which make compound the problem. PyTorch performance craters without this.	2019-12-21 08:43:19 +01:00
Aymeric Augustin	bb3bfa2d29	Distribute tests from the same file to the same worker. This should prevent two issues: - hitting API rate limits for tests that hit the HF API - multiplying the cost of expensive test setups	2019-12-21 08:43:19 +01:00
Aymeric Augustin	29cbab98f0	Parallelize tests on Circle CI. Set the number of CPUs manually based on the Circle CI resource class, or else we're getting 36 CPUs, which is far too much (perhaps that's the underlying hardware and not what Circle CI allocates to us). Don't parallelize the custom tokenizers tests because they take less than one second to run and parallelization actually makes them slower.	2019-12-21 08:43:19 +01:00
thomwolf	15dda5ea32	remove python 2 tests for circle-ci cc @aaugustin @julien-c @LysandreJik	2019-12-20 13:20:41 +01:00
thomwolf	e4baa68ddb	tick-tock cc @julien-c	2019-12-19 20:37:26 +01:00
Thomas Wolf	d5712f7cac	Merge branch 'master' into check-link-validity	2019-12-12 08:00:51 +01:00
Julien Chaumond	371c5ddfad	Py2 tests for Lysandre	2019-12-11 18:32:27 -05:00
Julien Chaumond	5505cf7014	Run tests on Py2 too, for Lysandre	2019-12-11 18:32:27 -05:00
Julien Chaumond	9cb97c0c0f	Actually run the tests	2019-12-11 18:32:27 -05:00
Julien Chaumond	95854c4a2f	Actually run the tests	2019-12-11 18:32:27 -05:00
Julien Chaumond	d2100428d3	Update to new test infra and only run conditionally	2019-12-11 18:32:27 -05:00
Masatoshi Suzuki	597ba7feb3	Support testing Japanese BERT tokenizers	2019-12-11 18:32:27 -05:00
Rémi Louf	f230d91b43	check the validity of links We add a script and a CI workflow to check that all download links present in the source code are valid.	2019-12-06 09:41:28 +01:00
Lysandre	45d767297a	Updated v2.2.0 doc	2019-11-27 10:12:20 -05:00
Lysandre	cc7968227e	Updated v2.2.0 doc	2019-11-26 15:52:25 -05:00
Lysandre	44b82c777f	Updated v2.2.0 doc	2019-11-26 15:15:11 -05:00
Lysandre	7c6000e412	Updated v2.2.0 doc	2019-11-26 14:55:29 -05:00
Lysandre	ae98d45991	Release: v2.2.0	2019-11-26 14:12:44 -05:00
Lysandre	be7f2aacce	[CI][DOC] Don't rebuild if folder exists - Correct directory.	2019-11-14 14:54:44 -05:00
Lysandre	8f8d69716a	[CI][DOC] Don't rebuild if folder exists.	2019-11-14 14:48:21 -05:00
Lysandre Debut	6e85bccafc	Fixed typo	2019-10-22 18:07:01 -04:00
Lysandre	fbcc5ff9fb	Change branch to master	2019-10-22 18:01:10 -04:00
Lysandre	69eba0ab19	Edit script path	2019-10-22 17:53:52 -04:00
Lysandre	bc3e57d551	Multi version doc deployment	2019-10-22 17:51:30 -04:00
thomwolf	2a4fef837a	move Circle-CI from TF2-rc0 to official TF2	2019-10-10 15:57:35 +02:00
Julien Chaumond	22d2fded2c	[docs] Fix doc auto-deploy Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2019-09-26 18:22:45 -04:00
thomwolf	31c23bd5ee	[BIG] pytorch-transformers => transformers	2019-09-26 10:15:53 +02:00
thomwolf	0f091062d4	Merge branch 'glue-example' into tf2	2019-09-25 10:21:52 +02:00
thomwolf	2167e366ba	update circleCi	2019-09-24 13:27:45 +02:00
thomwolf	e9a103c17a	bidirectional conversion TF <=> PT - extended tests	2019-09-24 13:25:50 +02:00
LysandreJik	11ac4b9555	[CI] Symbolic link for documentation	2019-09-11 10:13:44 +02:00
thomwolf	e30579f764	no pytest version checking	2019-09-08 15:02:06 +03:00
thomwolf	518307dfcd	test suite independent of framework	2019-09-08 15:02:06 +03:00
thomwolf	9d0a11a68c	update dependencies and circle-ci	2019-09-08 15:02:06 +03:00
thomwolf	ad0ab9afe9	fix test when tf is not here	2019-09-08 15:02:06 +03:00
LysandreJik	3fbf301bba	[CI] Updated resource size for python 3 tests	2019-09-02 12:35:14 -04:00
Julien Chaumond	0fd0b674e6	[ci] legible output [skip ci]	2019-08-30 20:36:26 -04:00
Julien Chaumond	b65a994f59	[ci] decrease parallelism to increase success prob	2019-08-30 20:33:16 -04:00
LysandreJik	e7fba4bef5	Documentation auto-deploy	2019-08-29 12:14:29 -04:00
thomwolf	7322c314a6	remove python2 testing for examples	2019-07-12 14:24:08 +02:00
thomwolf	273617b86d	update config - fix gpt/gpt-2 from pretrained	2019-07-11 22:45:03 +02:00
thomwolf	6b13f4cb3a	update circle-ci	2019-07-11 22:36:35 +02:00
thomwolf	2b644785f0	add tests on examples and large circle ci config	2019-07-11 22:31:50 +02:00
LysandreJik	3f56ad5aff	Updated CircleCI's config.yml to use a large resource class.	2019-07-09 18:50:59 -04:00
thomwolf	a4f980547f	remove circle ci parallelism	2019-07-05 12:31:34 +02:00
thomwolf	0bab55d5d5	[BIG] name change	2019-07-05 11:55:36 +02:00
thomwolf	cf86d23eff	parallelism in circlci	2019-07-04 17:02:21 +02:00
thomwolf	99ae5ab883	update config tests and circle-ci	2019-07-02 12:40:39 +02:00
thomwolf	411981a080	remove slow circle-ci	2019-06-20 08:54:18 +02:00
thomwolf	cd110835a0	coverage in circle-ci	2019-04-30 11:35:40 +02:00
thomwolf	1f5fc95b68	add code coverage	2019-04-30 11:05:26 +02:00
thomwolf	31d387604c	adding s3 model tests with --runslow	2019-04-17 11:58:27 +02:00
thomwolf	8197eb9f10	update Circle CI config	2019-02-11 10:22:10 +01:00
thomwolf	525eba68ab	update Circle CI	2019-02-11 10:19:25 +01:00
thomwolf	34bdb7f9cb	update circle-ci for python 2.7 and 3.5	2019-02-06 00:25:12 +01:00
Julien Chaumond	8da280ebbe	Setup CI	2018-12-20 16:33:39 -05:00

... 4 5 6 7 8 ...

435 Commits