[model_cards] Add language metadata to existing model cards

This will enable filtering on language (amongst other tags) on the website cc @loretoparisi, @stefan-it, @HenrykBorzymowski, @marma
2025-07-31 02:02:21 +06:00 · 2020-02-10 17:42:42 -05:00 · 2020-02-10 17:42:42 -05:00 · 95bac8dabb
commit 95bac8dabb
parent ba498eac38
13 changed files with 49 additions and 1 deletions
--- a/model_cards/KB/albert-base-swedish-cased-alpha/README.md
+++ b/model_cards/KB/albert-base-swedish-cased-alpha/README.md
@ -1,3 +1,7 @@
+---
+language: swedish
+---
+
 # Swedish BERT Models

 The National Library of Sweden / KBLab releases three pretrained language models based on BERT and ALBERT. The models are trained on aproximately 15-20GB of text (200M sentences, 3000M tokens) from various sources (books, news, government publications, swedish wikipedia and internet forums) aiming to provide a representative BERT model for Swedish text. A more complete description will be published later on.
--- a/model_cards/KB/bert-base-swedish-cased-ner/README.md
+++ b/model_cards/KB/bert-base-swedish-cased-ner/README.md
@ -1,3 +1,7 @@
+---
+language: swedish
+---
+
 # Swedish BERT Models

 The National Library of Sweden / KBLab releases three pretrained language models based on BERT and ALBERT. The models are trained on aproximately 15-20GB of text (200M sentences, 3000M tokens) from various sources (books, news, government publications, swedish wikipedia and internet forums) aiming to provide a representative BERT model for Swedish text. A more complete description will be published later on.
--- a/model_cards/KB/bert-base-swedish-cased/README.md
+++ b/model_cards/KB/bert-base-swedish-cased/README.md
@ -1,3 +1,7 @@
+---
+language: swedish
+---
+
 # Swedish BERT Models

 The National Library of Sweden / KBLab releases three pretrained language models based on BERT and ALBERT. The models are trained on aproximately 15-20GB of text (200M sentences, 3000M tokens) from various sources (books, news, government publications, swedish wikipedia and internet forums) aiming to provide a representative BERT model for Swedish text. A more complete description will be published later on.
--- a/model_cards/Musixmatch/umberto-commoncrawl-cased-v1/README.md
+++ b/model_cards/Musixmatch/umberto-commoncrawl-cased-v1/README.md
@ -1,3 +1,7 @@
+---
+language: italian
+---
+
 # UmBERTo Commoncrawl Cased

 [UmBERTo](https://github.com/musixmatchresearch/umberto) is a Roberta-based Language Model trained on large Italian Corpora and uses two innovative approaches: SentencePiece and Whole Word Masking. Now available at [github.com/huggingface/transformers](https://huggingface.co/Musixmatch/umberto-commoncrawl-cased-v1)
--- a/model_cards/Musixmatch/umberto-wikipedia-uncased-v1/README.md
+++ b/model_cards/Musixmatch/umberto-wikipedia-uncased-v1/README.md
@ -1,3 +1,7 @@
+---
+language: italian
+---
+
 # UmBERTo Wikipedia Uncased

 [UmBERTo](https://github.com/musixmatchresearch/umberto) is a Roberta-based Language Model trained on large Italian Corpora and uses two innovative approaches: SentencePiece and Whole Word Masking. Now available at [github.com/huggingface/transformers](https://huggingface.co/Musixmatch/umberto-commoncrawl-cased-v1)
--- a/model_cards/canwenxu/BERT-of-Theseus-MNLI/README.md
+++ b/model_cards/canwenxu/BERT-of-Theseus-MNLI/README.md
@ -1,5 +1,5 @@
 ---
-thumbnail: https://github.com/JetRunner/BERT-of-Theseus/blob/master/bert-of-theseus.png?raw=true
+thumbnail: https://raw.githubusercontent.com/JetRunner/BERT-of-Theseus/master/bert-of-theseus.png
 ---

 # BERT-of-Theseus
--- a/model_cards/dbmdz/bert-base-german-cased/README.md
+++ b/model_cards/dbmdz/bert-base-german-cased/README.md
@ -1,3 +1,7 @@
+---
+language: german
+---
+
 # 🤗 + 📚 dbmdz German BERT models

 In this repository the MDZ Digital Library team (dbmdz) at the Bavarian State
--- a/model_cards/dbmdz/bert-base-german-uncased/README.md
+++ b/model_cards/dbmdz/bert-base-german-uncased/README.md
@ -1,3 +1,7 @@
+---
+language: german
+---
+
 # 🤗 + 📚 dbmdz German BERT models

 In this repository the MDZ Digital Library team (dbmdz) at the Bavarian State
--- a/model_cards/dbmdz/bert-base-italian-cased/README.md
+++ b/model_cards/dbmdz/bert-base-italian-cased/README.md
@ -1,3 +1,7 @@
+---
+language: italian
+---
+
 # 🤗 + 📚 dbmdz BERT models

 In this repository the MDZ Digital Library team (dbmdz) at the Bavarian State
--- a/model_cards/dbmdz/bert-base-italian-uncased/README.md
+++ b/model_cards/dbmdz/bert-base-italian-uncased/README.md
@ -1,3 +1,7 @@
+---
+language: italian
+---
+
 # 🤗 + 📚 dbmdz BERT models

 In this repository the MDZ Digital Library team (dbmdz) at the Bavarian State
--- a/model_cards/dbmdz/bert-base-italian-xxl-cased/README.md
+++ b/model_cards/dbmdz/bert-base-italian-xxl-cased/README.md
@ -1,3 +1,7 @@
+---
+language: italian
+---
+
 # 🤗 + 📚 dbmdz BERT models

 In this repository the MDZ Digital Library team (dbmdz) at the Bavarian State
--- a/model_cards/dbmdz/bert-base-italian-xxl-uncased/README.md
+++ b/model_cards/dbmdz/bert-base-italian-xxl-uncased/README.md
@ -1,3 +1,7 @@
+---
+language: italian
+---
+
 # 🤗 + 📚 dbmdz BERT models

 In this repository the MDZ Digital Library team (dbmdz) at the Bavarian State
--- a/model_cards/henryk/bert-base-multilingual-cased-finetuned-dutch-squad2/README.md
+++ b/model_cards/henryk/bert-base-multilingual-cased-finetuned-dutch-squad2/README.md
@ -1,3 +1,7 @@
+---
+language: dutch
+---
+
 # Multilingual + Dutch SQuAD2.0

 This model is the multilingual model provided by the Google research team with a fine-tuned dutch Q&A downstream task.