mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-29 17:22:25 +06:00
![]() * up * up * test * logits ok * up * up * few fixes * conversion script * up * nits * nits * update * nuke * more updates * nites * fix many issues * nit * scatter * nit * nuke megablocks * nits * fix conversion script * nit * remove * nits * nit * update * oupsssss * change * nits device * nits * fixup * update * merge * add copied from * fix the copy mentions * update tests * more fixes * nits * conversion script * add parts of the readme * Update tests/models/mixtral/test_modeling_mixtral.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * new test + conversion script * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Apply suggestions from code review * fix * fix copies * fix copies * ooops * fix config * Apply suggestions from code review * fix nits * nit * add copies * add batched tests * docs * fix flash attention * let's add more verbose * add correct outputs * support router ouptus * ignore copies where needed * fix * cat list if list is given for now * nits * Update docs/source/en/model_doc/mixtral.md * finish router refactoring * fix forward * fix expected values * nits * fixup * fix * fix bug * fix * fix dtype mismatch * fix * grrr grrr I support item assignment * fix CI * docs * fixup * remove some copied form * fix weird diff * skip doctest fast on the config and modeling * mark that is supports flash attention in the doc * update * Update src/transformers/models/mixtral/modeling_mixtral.py Co-authored-by: Lysandre Debut <hi@lysand.re> * Update docs/source/en/model_doc/mixtral.md Co-authored-by: Lysandre Debut <hi@lysand.re> * revert router logits config issue * update doc accordingly * Update src/transformers/models/mixtral/convert_mixtral_weights_to_hf.py * nits * use torch testing asssert close * fixup * doc nits --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Lysandre Debut <hi@lysand.re> |
||
---|---|---|
.. | ||
test_module | ||
tf_ops | ||
add_pipeline_model_mapping_to_test.py | ||
check_build.py | ||
check_config_attributes.py | ||
check_config_docstrings.py | ||
check_copies.py | ||
check_doc_toc.py | ||
check_docstrings.py | ||
check_doctest_list.py | ||
check_dummies.py | ||
check_inits.py | ||
check_model_tester.py | ||
check_repo.py | ||
check_self_hosted_runner.py | ||
check_table.py | ||
check_task_guides.py | ||
check_tf_ops.py | ||
create_dummy_models.py | ||
custom_init_isort.py | ||
download_glue_data.py | ||
extract_warnings.py | ||
get_ci_error_statistics.py | ||
get_github_job_time.py | ||
get_modified_files.py | ||
get_previous_daily_ci.py | ||
get_test_info.py | ||
not_doctested.txt | ||
notification_service_doc_tests.py | ||
notification_service.py | ||
past_ci_versions.py | ||
print_env.py | ||
release.py | ||
slow_documentation_tests.txt | ||
sort_auto_mappings.py | ||
tests_fetcher.py | ||
update_metadata.py | ||
update_tiny_models.py |