transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

History

Jinan Zhou a91020aed0 Add TimesFM Time Series Forecasting Model (#34082 ) * initial documentation * rename mask to attention_mask * smaller tests * fixup * fix copies * move to time series section * sort docs * isort fix * batch_size is not a configuration * rename to TimesFMModelForPrediction * initial script * add check_outputs * remove dropout_rate * works with torch.Tensor inputs * rename script * fix docstrings * fix freq when window_size is given * add loss * fix _quantile_loss * formatting * fix isort * add weight init * add support for sdpa and flash_attention_2 * fixes for flash_attention * formatting * remove flash_attention * fix tests * fix file name * fix quantile loss * added initial TimesFMModelIntegrationTests * fix formatting * fix import order * fix _quantile_loss * add doc for SDPA * use timesfm 2.0 * bug fix in timesfm decode function. * compare mean forecasts * refactor type hints, use CamelCase * consolidate decode func * more readable code for weight conversion * fix-copies * simpler init * renaem TimesFmMLP * use T5LayerNorm * fix tests * use initializer_range * TimesFmModel instead of TimesFmDecoder * TimesFmPositionalEmbedding takes config for its init * 2.0-500m-pytorch default configs * use TimesFmModel * fix formatting * ignore TimesFmModel for testing * fix docstring * override generate as its not needed * add doc strings * fix logging * add docstrings to output data classes * initial copy from t5 * added config and attention layers * add TimesFMPositionalEmbedding * calcuate scale_factor once * add more configs and TimesFMResidualBlock * fix input_dims * standardize code format with black * remove unneeded modules * TimesFM Model * order of imports * copy from Google official implementation * remove covariate forecasting * Adapting TimesFM to HF format * restructing in progress * adapted to HF convention * timesfm test * the model runs * fixing unit tests * fixing unit tests in progress * add post_init * do not change TimesFMOutput * fixing unit tests * all unit tests passed * remove timesfm_layers * add intermediate_size and initialize with config * initial documentation * rename mask to attention_mask * smaller tests * fixup * fix copies * move to time series section * sort docs * isort fix * batch_size is not a configuration * rename to TimesFMModelForPrediction * initial script * add check_outputs * remove dropout_rate * works with torch.Tensor inputs * rename script * fix docstrings * fix freq when window_size is given * add loss * fix _quantile_loss * formatting * fix isort * add weight init * add support for sdpa and flash_attention_2 * fixes for flash_attention * formatting * remove flash_attention * fix tests * fix file name * fix quantile loss * added initial TimesFMModelIntegrationTests * fix formatting * fix import order * fix _quantile_loss * add doc for SDPA * use timesfm 2.0 * bug fix in timesfm decode function. * compare mean forecasts * refactor type hints, use CamelCase * consolidate decode func * more readable code for weight conversion * fix-copies * simpler init * renaem TimesFmMLP * use T5LayerNorm * fix tests * use initializer_range * TimesFmModel instead of TimesFmDecoder * TimesFmPositionalEmbedding takes config for its init * 2.0-500m-pytorch default configs * use TimesFmModel * fix formatting * ignore TimesFmModel for testing * fix docstring * override generate as its not needed * add doc strings * fix logging * add docstrings to output data classes * add _CHECKPOINT_FOR_DOC * fix comments * Revert "fix comments" This reverts commit `8deeb3e191`. * add _prepare_4d_attention_mask * we do not have generative model classes * use Cache * return past_key_values * modules initialized with config only * update year * Update docs/source/en/model_doc/timesfm.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * add layer_idx to cache * modular timesfm * fix test * unwrap sequential class * fix toctree * remove TimesFmOnnxConfig * fix modular * remove TimesFmStackedDecoder * split qkv layer into individual layers * rename projection layers * use ALL_ATTENTION_FUNCTIONS * is_causal is True * rename config * does not support flash_attn_2 * formatting * fix typo in docsstring * rename inputs * add time series mapping * Update src/transformers/models/olmo2/modeling_olmo2.py * Update src/transformers/models/moonshine/modeling_moonshine.py * use updated arguments * fix class name * add MODEL_FOR_TIME_SERIES_PREDICTION_MAPPING * isort * consolidate _preprocess into forward * fix a typo * fix a typo * fix toc * fix modular * remove aaserts * use self.config._attn_implementation * move to _postprocess_output * remove timesfm_get_large_negative_number * use view unstead of multiple unsqueeze * make helpers static methods of the Model * use to_tuple * use to_tuple if not return_dict * remove unused intitialization block as its incorporated in nn.Linear * remove unused num_key_value_groups * use the same convention as the masking method * update modular * do not use unsqueeze * use view instead of unsqueeze * use buffer for inv_timescales * formatting * modular conversion * remove unneeded intialization * add missing docstrings * remove cache * use simple_eager_attention_forward * support tp_plan * support for flex and flash attention masks * Revert "support for flex and flash attention masks" This reverts commit `def36c4fcf`. * fix device * fix tests on gpu * remove unsued large model test * removed unneeded comments * add example usage * fix style * add import * Update docs/source/en/model_doc/timesfm.md Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com> * inherit from LlamaRMSNorm * use can_return_tuple decorator * remvoe return_dict * fix year * Update docs/source/en/model_doc/timesfm.md Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com> * pretrained does not inherit from GenerationMixin * use model for integration test --------- Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: Rajat Sen <rsen91@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com> Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>		2025-04-16 15:00:53 +02:00
..
test_module	AutoImageProcessor (#20111 )	2022-11-08 19:54:41 +00:00
tf_ops	Check TF ops for ONNX compliance (#10025 )	2021-02-15 07:55:10 -05:00
add_pipeline_model_mapping_to_test.py	update ruff version (#30932 )	2024-05-22 06:40:15 +02:00
check_bad_commit.py	Fix `utils/check_bad_commit.py` (#37272 )	2025-04-04 12:18:20 +02:00
check_build.py	Use `deformable_detr` kernel from the Hub (#36853 )	2025-03-21 13:08:47 +01:00
check_config_attributes.py	Multiple llama4 fixe (#37353 )	2025-04-08 11:14:49 +02:00
check_config_docstrings.py	Add Granite Speech Support (#36801 )	2025-04-11 18:52:00 +02:00
check_copies.py	chore: fix typos in utils module (#36668 )	2025-03-13 15:12:44 +00:00
check_doc_toc.py	update ruff version (#30932 )	2024-05-22 06:40:15 +02:00
check_docstrings.py	Add MLCD model (#36182 )	2025-04-15 11:33:09 +01:00
check_doctest_list.py	update ruff version (#30932 )	2024-05-22 06:40:15 +02:00
check_dummies.py	Add llama4 (#37307 )	2025-04-05 22:02:22 +02:00
check_inits.py	Simplify soft dependencies and update the dummy-creation process (#36827 )	2025-04-11 11:08:36 +02:00
check_model_tester.py	Add a new script to check model testers' config (#22063 )	2023-03-13 19:11:19 +01:00
check_modular_conversion.py	Fix wrong argparse type in modular checker script (#37472 )	2025-04-14 16:11:29 +01:00
check_repo.py	Add TimesFM Time Series Forecasting Model (#34082 )	2025-04-16 15:00:53 +02:00
check_self_hosted_runner.py	Tiny fix for `check_self_hosted_runner.py` (#24052 )	2023-06-06 18:17:41 +02:00
check_tf_ops.py	Check TF ops for ONNX compliance (#10025 )	2021-02-15 07:55:10 -05:00
create_dependency_mapping.py	Modular Conversion --fix_and_overwrite on Windows (#36583 )	2025-03-06 13:12:30 +00:00
create_dummy_models.py	CI: fix `efficientnet` pipeline timeout and prevent future similar issues due to large image size (#33123 )	2024-08-27 11:58:27 +01:00
custom_init_isort.py	chore: fix typos in utils module (#36668 )	2025-03-13 15:12:44 +00:00
deprecate_models.py	chore: fix typos in utils module (#36668 )	2025-03-13 15:12:44 +00:00
download_glue_data.py	Update ruff to `0.11.2` (#36962 )	2025-03-25 16:00:11 +01:00
extract_warnings.py	update github actions packages' version to suppress warnings (#30249 )	2024-04-15 15:08:09 +02:00
fetch_hub_objects_for_ci.py	Try to avoid/reduce some remaining CI job failures (#37202 )	2025-04-02 14:39:57 +02:00
get_ci_error_statistics.py	Add artifact name in job step to maintain job / artifact correspondence (#28682 )	2024-01-31 15:58:17 +01:00
get_github_job_time.py	Update ruff to `0.11.2` (#36962 )	2025-03-25 16:00:11 +01:00
get_modified_files.py	exclude deleted files in the fixup script (#21436 )	2023-02-03 12:57:02 -05:00
get_previous_daily_ci.py	Ping team members for new failed tests in daily CI (#34171 )	2024-10-17 16:11:52 +02:00
get_test_info.py	CI: fix `efficientnet` pipeline timeout and prevent future similar issues due to large image size (#33123 )	2024-08-27 11:58:27 +01:00
important_models.txt	ENH: [`CI`] Add new workflow to run slow tests of important models on push main if they are modified (#29235 )	2024-04-12 10:01:28 +02:00
models_to_deprecate.py	update ruff version (#30932 )	2024-05-22 06:40:15 +02:00
modular_model_converter.py	Introduce modular files for speech models (#35902 )	2025-04-04 11:46:27 +02:00
not_doctested.txt	[agents] remove agents 🧹 (#37368 )	2025-04-11 18:42:37 +01:00
notification_service_doc_tests.py	Refactor doctest (#30210 )	2024-04-15 13:20:36 +02:00
notification_service_quantization.py	Update ruff to `0.11.2` (#36962 )	2025-03-25 16:00:11 +01:00
notification_service.py	Fix new failure reports not including anything other than `tests/models/` (#37415 )	2025-04-10 14:47:23 +02:00
past_ci_versions.py	Update ruff to `0.11.2` (#36962 )	2025-03-25 16:00:11 +01:00
patch_helper.py	[`Patch helper`] update to not have to checkout main (#34006 )	2024-10-09 09:21:46 +02:00
pr_slow_ci_models.py	notify new model merged to `main` (#36375 )	2025-02-24 17:53:18 +01:00
print_env.py	Print more library versions in CI (#17384 )	2022-06-02 10:24:16 +02:00
process_bad_commit_report.py	Tiny update after #34383 (#34404 )	2024-10-28 12:01:05 +01:00
process_circleci_workflow_test_reports.py	Update ruff to `0.11.2` (#36962 )	2025-03-25 16:00:11 +01:00
process_test_artifacts.py	fix the parallel number of CI nodes when it is smaller than number of tests (#33276 )	2024-09-03 16:53:21 +02:00
release.py	Remove research projects (#36645 )	2025-03-11 13:47:38 +00:00
set_cuda_devices_for_ci.py	Fix Cohere CI (#31263 )	2024-06-10 15:16:58 +02:00
slow_documentation_tests.txt	Update CodeLlama references (#30218 )	2024-05-09 22:57:52 +02:00
sort_auto_mappings.py	update ruff version (#30932 )	2024-05-22 06:40:15 +02:00
split_doctest_jobs.py	chore: fix typos in utils module (#36668 )	2025-03-13 15:12:44 +00:00
split_model_tests.py	consistent job / pytest report / artifact name correspondence (#30392 )	2024-04-24 22:32:42 +02:00
tests_fetcher.py	Fix the test fetcher (#37452 )	2025-04-11 12:19:27 +02:00
update_metadata.py	Update ruff to `0.11.2` (#36962 )	2025-03-25 16:00:11 +01:00
update_tiny_models.py	Mention model_info.id instead of model_info.modelId (#32106 )	2024-07-22 14:14:47 +01:00