transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-22 22:09:23 +06:00

History

Arthur 9cccb3a838 [`Persimmon`] Add support for persimmon (#26042 ) * intiial commit * updates * nits * update conversion script * update conversion script * use path to load * add tips etc * some modeling logic * modeling update * more nits * nits * normal layer norm * update config and doc * nits * update doc remove unused * update * fix inits and stuff * fixup * revert wrong changes * updates * more nits * add default config values to the configuration file * fixup happy * update * 2 tests left * update readmes * more nits * slow test and more documentation * update readme * fix licences * styling * use fast if possible when saving tokenizer * remove todo * remove tokenization tests * small last nits * Apply suggestions from code review Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * nits to skip the timout doctest * fix integration test * fix test * update eos token * update to allow fast tokenization * styling * fix codeLlama as well for the update post processor * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add more copied from statements * update * doc passes doctest * remove `# final layer norm?` * change docstring prompot * update * Update README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * don't doctest the conversion script as it requires more packages * don't init a model in the config * oups * fix doctest --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>		2023-09-12 11:33:27 +02:00
..
internal	Generate: add missing logits processors docs (#25653 )	2023-08-25 11:56:17 +01:00
main_classes	[doc] Always call it Agents for consistency (#25958 )	2023-09-05 12:27:20 +01:00
model_doc	[`Persimmon`] Add support for persimmon (#26042 )	2023-09-12 11:33:27 +02:00
tasks	[`Persimmon`] Add support for persimmon (#26042 )	2023-09-12 11:33:27 +02:00
_config.py
_toctree.yml	[`Persimmon`] Add support for persimmon (#26042 )	2023-09-12 11:33:27 +02:00
accelerate.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
add_new_model.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
add_new_pipeline.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
add_tensorflow_model.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
attention.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
autoclass_tutorial.md	Update autoclass_tutorial.md (#25929 )	2023-09-04 11:16:49 +01:00
benchmarks.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
bertology.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
big_models.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
community.md	Update community.md (#25928 )	2023-09-04 11:16:34 +01:00
contributing.md
create_a_model.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
custom_models.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
custom_tools.md	[doc] Always call it Agents for consistency (#25958 )	2023-09-05 12:27:20 +01:00
debugging.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
fast_tokenizers.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
generation_strategies.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
glossary.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
hpo_train.md	Update RayTune doc link for Hyperparameter tuning (#24422 )	2023-06-22 10:38:01 -04:00
index.md	[`Persimmon`] Add support for persimmon (#26042 )	2023-09-12 11:33:27 +02:00
installation.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
llm_tutorial.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
model_memory_anatomy.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
model_sharing.md	Fix typos (#25936 )	2023-09-04 11:15:12 +01:00
model_summary.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
multilingual.md	Fix typo in example code (#25583 )	2023-08-18 07:58:59 +02:00
notebooks.md
pad_truncation.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
peft.md	[`PEFT`] Peft integration alternative design (#25077 )	2023-08-18 19:08:03 +02:00
perf_hardware.md	🌐 [i18n-KO] Translated `perf_hardware.md` to Korean (#24966 )	2023-07-25 07:44:24 -04:00
perf_infer_cpu.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_infer_gpu_many.md	[`Docs` / `BetterTransformer` ] Added more details about flash attention + SDPA (#25265 )	2023-08-18 10:32:28 +02:00
perf_infer_gpu_one.md	[`Docs`] More clarifications on BT + FA (#25823 )	2023-08-29 13:52:25 +02:00
perf_infer_special.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_torch_compile.md	Fix rendering for `torch.compile()` docs (#25432 )	2023-08-10 13:25:00 +02:00
perf_train_cpu_many.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_train_cpu.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_train_gpu_many.md	deprecate `sharded_ddp` training argument (#24825 )	2023-07-17 06:57:42 -04:00
perf_train_gpu_one.md	Modify efficient GPU training doc with now-available adamw_bnb_8bit optimizer (#25807 )	2023-08-31 10:55:10 +01:00
perf_train_special.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_train_tpu_tf.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
perf_train_tpu.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
performance.md	[docs] Performance docs tidy up, part 1 (#23963 )	2023-07-24 08:57:24 -04:00
perplexity.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
philosophy.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
pipeline_tutorial.md	Support loading base64 images in pipelines (#25633 )	2023-08-29 19:24:24 +01:00
pipeline_webserver.md	Suggestions on Pipeline_webserver (#25570 )	2023-08-18 10:17:44 +02:00
pr_checks.md	Document check copies (#25291 )	2023-08-04 14:56:29 +02:00
preprocessing.md	Removal of deprecated vision methods and specify deprecation versions (#24570 )	2023-06-29 15:09:51 +01:00
quicktour.md	[TYPO] fix typo/format in quicktour.md (#25519 )	2023-08-16 08:03:23 +02:00
run_scripts.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
sagemaker.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
serialization.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
task_summary.md	Fix doctest (#25031 )	2023-07-25 22:10:06 +02:00
tasks_explained.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
testing.md	fix wrong path in some doc (#25658 )	2023-08-23 08:34:30 +02:00
tf_xla.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
tflite.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
tokenizer_summary.md	Fix typo: Roberta -> RoBERTa (#25302 )	2023-08-03 14:17:30 -07:00
torchscript.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
training.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
transformers_agents.md	[doc] Always call it Agents for consistency (#25958 )	2023-09-05 12:27:20 +01:00
troubleshooting.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00