Sentence Similarity
sentence-transformers
Safetensors
xlm-roberta
mteb
feature-extraction
Generated from Trainer
dataset_size:2560000
loss:MultipleNegativesRankingLoss
Eval Results (legacy)
text-embeddings-inference
Instructions to use PaDaS-Lab/xlm-roberta-base-msmarco-webfaq with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use PaDaS-Lab/xlm-roberta-base-msmarco-webfaq with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("PaDaS-Lab/xlm-roberta-base-msmarco-webfaq") sentences = [ "ما هي أفضل الفنادق في ايبوهبالقرب من Ipoh Parade Shopping Centre؟", "Bei ORION gibt es eine Sale-Rubrik, in der alle reduzierten Artikel zu finden sind. Wenn du also auf der Suche nach einem Schnäppchen bist, weißt du, an welcher Stelle auf der Webseite du fündig wirst. Der Sale umfasst viele verschiedene Produke. Von Toys bis hin zu Dessous und Drogerieartikel - es spielt keine Rolle, wonach du suchst. Aufgrund der Produktvielfalt ist die Chance, dass du im Sale den passenden Gegenstand findest, groß.", "عادة ما يكون لأصحاب النفوذ الجزئي ما بين 10000 و 100000 متابع.", "المسافرون الموثّقون إلى مدينة ايبوه الذين أقاموا قرب Ipoh Parade Shopping Centre أعطوا أعلى التقييمات لـفندق فايل ، Zone Hotel (Ipoh) وGolden Roof Hotel Ampang Ipoh." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -2383,11 +2383,11 @@ This is a [sentence-transformers](https://www.SBERT.net) model trained. It maps
|
|
| 2383 |
|
| 2384 |
### Model Description
|
| 2385 |
- **Model Type:** Sentence Transformer
|
| 2386 |
-
- **Base model:** [XLM-RoBERTa-base-MSMARCO](https://huggingface.co/
|
| 2387 |
- **Maximum Sequence Length:** 512 tokens
|
| 2388 |
- **Output Dimensionality:** 768 dimensions
|
| 2389 |
- **Similarity Function:** Cosine Similarity
|
| 2390 |
-
- **Training Dataset:** [WebFAQ Retrieval Dataset](https://huggingface.co/datasets/
|
| 2391 |
<!-- - **Language:** Unknown -->
|
| 2392 |
- **License:** MIT
|
| 2393 |
|
|
@@ -2421,7 +2421,7 @@ Then you can load this model and run inference.
|
|
| 2421 |
from sentence_transformers import SentenceTransformer
|
| 2422 |
|
| 2423 |
# Download from the 🤗 Hub
|
| 2424 |
-
model = SentenceTransformer("
|
| 2425 |
# Run inference
|
| 2426 |
sentences = [
|
| 2427 |
'Muss der Deckel der TipBox beim Autoklavieren geöffnet werden?',
|
|
|
|
| 2383 |
|
| 2384 |
### Model Description
|
| 2385 |
- **Model Type:** Sentence Transformer
|
| 2386 |
+
- **Base model:** [XLM-RoBERTa-base-MSMARCO](https://huggingface.co/PaDaS-Lab/xlm-roberta-base-msmarco)
|
| 2387 |
- **Maximum Sequence Length:** 512 tokens
|
| 2388 |
- **Output Dimensionality:** 768 dimensions
|
| 2389 |
- **Similarity Function:** Cosine Similarity
|
| 2390 |
+
- **Training Dataset:** [WebFAQ Retrieval Dataset](https://huggingface.co/datasets/PaDaS-Lab/webfaq-retrieval)
|
| 2391 |
<!-- - **Language:** Unknown -->
|
| 2392 |
- **License:** MIT
|
| 2393 |
|
|
|
|
| 2421 |
from sentence_transformers import SentenceTransformer
|
| 2422 |
|
| 2423 |
# Download from the 🤗 Hub
|
| 2424 |
+
model = SentenceTransformer("PaDaS-Lab/xlm-roberta-base-msmarco-webfaq")
|
| 2425 |
# Run inference
|
| 2426 |
sentences = [
|
| 2427 |
'Muss der Deckel der TipBox beim Autoklavieren geöffnet werden?',
|