transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-03 03:31:05 +06:00

Author	SHA1	Message	Date
Paul O'Leary McCann	cf3cf304ca	Replace mecab-python3 with fugashi for Japanese tokenization (#6086 ) * Replace mecab-python3 with fugashi This replaces mecab-python3 with fugashi for Japanese tokenization. I am the maintainer of both projects. Both projects are MeCab wrappers, so the underlying C++ code is the same. fugashi is the newer wrapper and doesn't use SWIG, so for basic use of the MeCab API it's easier to use. This code insures the use of a version of ipadic installed via pip, which should make versioning and tracking down issues easier. fugashi has wheels for Windows, OSX, and Linux, which will help with issues with installing old versions of mecab-python3 on Windows. Compared to mecab-python3, because fugashi doesn't use SWIG, it doesn't require a C++ runtime to be installed on Windows. In adding this change I removed some code dealing with `cursor`, `token_start`, and `token_end` variables. These variables didn't seem to be used for anything, it is unclear to me why they were there. I ran the tests and they passed, though I couldn't figure out how to run the slow tests (`--runslow` gave an error) and didn't try testing with Tensorflow. * Style fix * Remove unused variable Forgot to delete this... * Adapt doc with install instructions * Fix typo Co-authored-by: sgugger <sylvain.gugger@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-07-31 04:41:14 -04:00
Stas Bekman	daa5dd1202	add a summary report flag for run_examples on CI (#6035 ) Currently, it's hard to derive which example tests were run on CI, and which weren't. Adding `-rA` flag to `pytest`, will now include a summary like: ``` ==================================================================== short test summary info ===================================================================== PASSED examples/test_examples.py::ExamplesTests::test_generation PASSED examples/test_examples.py::ExamplesTests::test_run_glue PASSED examples/test_examples.py::ExamplesTests::test_run_language_modeling PASSED examples/test_examples.py::ExamplesTests::test_run_squad FAILED examples/test_examples.py::ExamplesTests::test_run_pl_glue - AttributeError: 'Namespace' object has no attribute 'gpus' ============================================================ 1 failed, 4 passed, 8 warnings in 42.96s ============================================================ ``` which makes it easier to validate whether some example is being covered by CI or not.	2020-07-26 14:09:14 -04:00
sgugger	a540405213	Fix commit hash for stable doc	2020-07-24 09:07:40 -04:00
Lysandre	b9d8af07e6	Update stable doc	2020-07-09 11:06:23 -04:00
Lysandre	5c82bf6831	Update stable doc	2020-07-09 10:16:13 -04:00
Lysandre	1d2332861f	Post v3.0.2 release commit	2020-07-06 18:56:47 -04:00
Sam Shleifer	ac61114592	[CI] gh runner doesn't use -v, cats new result (#5409 )	2020-06-30 16:12:14 -04:00
Lysandre Debut	b9ee87f5c7	Doc for v3.0.0 (#5366 ) * Doc for v3.0.0 * Update docs/source/_static/js/custom.js Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/_static/js/custom.js Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2020-06-29 11:08:54 -04:00
Sam Shleifer	bf0d12c220	CircleCI stores cleaner output at test_outputs.txt (#5291 )	2020-06-26 13:59:31 -04:00
Sylvain Gugger	0148c262e7	Fix first test (#5255 )	2020-06-24 15:16:04 -04:00
Sylvain Gugger	70c1e1d2d5	Use master _static (#5253 ) * Use _static from master everywhere * Copy to existing too	2020-06-24 15:06:14 -04:00
Sylvain Gugger	f216b60671	Fix deploy doc (#5246 ) * Try with the same command * Try like this	2020-06-24 10:59:06 -04:00
Sylvain Gugger	49f6e7a3c6	Add some prints to debug (#5244 )	2020-06-24 10:37:01 -04:00
Sylvain Gugger	64c393ee74	Don't recreate old docs (#5243 )	2020-06-24 09:59:07 -04:00
Sylvain Gugger	c439752482	Switch master/stable doc and add older releases (#5193 )	2020-06-22 16:38:53 -04:00
Lysandre Debut	4e741efa92	Have documentation fail on warning (#5189 ) * Have documentation fail on warning * Force ci failure * Revert "Force ci failure" This reverts commit `f0a4666ec2`.	2020-06-22 15:49:50 -04:00
Harutaka Kawamura	b0c9fbb293	Add workflow to build docs (#3763 )	2020-04-17 11:23:18 -04:00
Julien Chaumond	d0c36a7b72	[ci] Partial revert of `18eec3a984` due to `fbc5bf10cf`	2020-03-24 12:10:43 -04:00
Julien Chaumond	18eec3a984	[ci] simpler way to load correct version of isort hat/tip @bramvanroy	2020-03-23 10:03:22 -04:00
Thomas Wolf	2187c49f5c	CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) (#3186 ) * memory benchmark rss * have both forward pass and line-by-line mem tracing * cleaned up tracing * refactored and cleaning up API * no f-strings yet... * add GPU mem logging * fix GPU memory monitoring * style and quality * clean up and doc * update with comments * Switching to python 3.6+ * fix quality	2020-03-17 10:17:11 -04:00
Julien Chaumond	d6ef587a10	[ci] Fixup `e36bd94345`	2020-02-28 23:19:17 -05:00
Julien Chaumond	e36bd94345	[ci] Run all tests on (self-hosted) GPU (#3020 ) * Create self-hosted.yml * Update self-hosted.yml * Update self-hosted.yml * Update self-hosted.yml * Update self-hosted.yml * Update self-hosted.yml * do not run slow tests, for now * [ci] For comparison with circleci, let's also run CPU-tests * [ci] reorganize * clearer filenames * [ci] Final tweaks before merging * rm slow tests on circle ci * Trigger CI * On GPU this concurrency was way too high	2020-02-28 21:11:08 -05:00
Julien Chaumond	e693cd1e87	[ci] Run slow tests every day	2020-02-24 19:54:47 -05:00
Julien Chaumond	4fc63151af	[ci] Attempt to fix #2844	2020-02-24 19:51:34 -05:00
Lysandre	22b2b5790e	Documentation v2.5.0	2020-02-19 11:53:30 -05:00
Sam Shleifer	0ed630f139	Attempt to increase timeout for circleci slow tests (#2844 )	2020-02-13 09:11:03 -05:00
Morgan Funtowicz	6aa7973aec	Fix circleci cuInit error on Tensorflow >= 2.1.0. Tensorflow 2.1.0 introduce a new dependency model where pip install tensorflow would install tf with GPU support. Before it would just install with CPU support, thus CircleCI is looking for NVidia driver version at initialization of the tensorflow related tests but fails as their is no NVidia Driver running. Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>	2020-02-10 13:24:37 +01:00
Lysandre	0aa40e9569	v2.4.0 documentation	2020-01-31 09:55:34 -05:00
alberduris	81d6841b4b	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
alberduris	dd4df80f0b	Moved the encoded_prompts to correct device	2020-01-06 15:11:12 +01:00
Aymeric Augustin	0ffc8eaf53	Enforce target version for black. This should stabilize formatting.	2020-01-05 12:52:14 -05:00
Aymeric Augustin	10724a8123	Run the slow tests every Monday morning.	2019-12-24 09:09:43 +01:00
Aymeric Augustin	8a6881822a	Run some tests on Python 3.7. This will improve version coverage.	2019-12-23 21:06:23 +01:00
Aymeric Augustin	76a1417f2a	Include all optional dependencies in extras. Take advantage of this to simplify the Circle CI configuration. Don't bother with tensorboardX: it's a fallback for PyTorch < 1.1.0.	2019-12-23 19:14:31 +01:00
Aymeric Augustin	23dad8447c	Install deps from setup.py for building docs. requirements.txt isn't up to date.	2019-12-23 17:06:32 +01:00
Thomas Wolf	ba2378ced5	Merge pull request #2264 from upura/fix-doclink Fix doc link in README	2019-12-23 12:31:00 +01:00
Aymeric Augustin	0dddc1494d	Remove py3 marker.	2019-12-22 18:38:56 +01:00
Aymeric Augustin	6be7cdda66	Move source code inside a src subdirectory. This prevents transformers from being importable simply because the CWD is the root of the git repository, while not being importable from other directories. That led to inconsistent behavior, especially in examples. Once you fetch this commit, in your dev environment, you must run: $ pip uninstall transformers $ pip install -e .	2019-12-22 14:15:13 +01:00
Aymeric Augustin	ced0a94204	Switch test files to the standard test_*.py scheme.	2019-12-22 14:15:13 +01:00
Aymeric Augustin	067395d5c5	Move tests outside of library.	2019-12-22 13:47:17 +01:00
Aymeric Augustin	c11b3e2926	Sort imports for optional third-party libraries. These libraries aren't always installed in the virtual environment where isort is running. Declaring them properly avoids mixing these third-party imports with local imports.	2019-12-22 11:19:13 +01:00
Aymeric Augustin	577a03664d	Enforce flake8 in CI.	2019-12-22 11:00:04 +01:00
Aymeric Augustin	9e80fc7b2f	Enforce isort in CI. We need https://github.com/timothycrosley/isort/pull/1000 but there's no release with this fix yet, so we'll install from GitHub.	2019-12-22 10:59:00 +01:00
upura	9d00f78f16	fix doc link	2019-12-22 16:07:05 +09:00
Aymeric Augustin	6e5291a915	Enforce black in CI.	2019-12-21 17:53:18 +01:00
Aymeric Augustin	343c094f21	Run examples separately from tests. This optimizes the total run time of the Circle CI test suite.	2019-12-21 08:43:19 +01:00
Aymeric Augustin	80caf79d07	Prevent excessive parallelism in PyTorch. We're already using as many processes in parallel as we have CPU cores. Furthermore, the number of core may be incorrectly calculated as 36 (we've seen this in pytest-xdist) which make compound the problem. PyTorch performance craters without this.	2019-12-21 08:43:19 +01:00
Aymeric Augustin	bb3bfa2d29	Distribute tests from the same file to the same worker. This should prevent two issues: - hitting API rate limits for tests that hit the HF API - multiplying the cost of expensive test setups	2019-12-21 08:43:19 +01:00
Aymeric Augustin	29cbab98f0	Parallelize tests on Circle CI. Set the number of CPUs manually based on the Circle CI resource class, or else we're getting 36 CPUs, which is far too much (perhaps that's the underlying hardware and not what Circle CI allocates to us). Don't parallelize the custom tokenizers tests because they take less than one second to run and parallelization actually makes them slower.	2019-12-21 08:43:19 +01:00
thomwolf	15dda5ea32	remove python 2 tests for circle-ci cc @aaugustin @julien-c @LysandreJik	2019-12-20 13:20:41 +01:00

1 2 3

102 Commits