impresso-project/wiki_comparable_corpus_en_de_hi_it_ko_zh Viewer • Updated 15 days ago • 69.2k • 29
impresso-project/ner-stacked-bert-multilingual-v1.1.0 Token Classification • 42.1M • Updated 16 days ago • 2.3k • 2
Running Multilingual Named Entity Recognition 👻 Multilingual Named Entity Recognition in Historical Data
impresso-project/halloween_workshop_ocr_robust_preview Sentence Similarity • 0.3B • Updated 18 days ago • 65
impresso-project/halloween_workshop_ocr_robust_with_lux_preview Sentence Similarity • 0.3B • Updated 18 days ago • 156
impresso-project/OCR-robust-gte-multilingual-base Sentence Similarity • 0.3B • Updated Oct 23, 2025 • 23
impresso-project/halloween_workshop_ocr_robust_with_lux_preview Sentence Similarity • 0.3B • Updated 18 days ago • 156
impresso-project/halloween_workshop_ocr_robust_preview Sentence Similarity • 0.3B • Updated 18 days ago • 65
impresso-project/histlux-paraphrase-multilingual-mpnet-base-v2 Sentence Similarity • 0.3B • Updated Jul 20, 2025 • 2
impresso-project/histlux-gte-multilingual-base Sentence Similarity • 0.3B • Updated Jul 20, 2025 • 13
impresso-project/OCR-robust-gte-multilingual-base Sentence Similarity • 0.3B • Updated Oct 23, 2025 • 23
impresso-project/histlux-gte-multilingual-base Sentence Similarity • 0.3B • Updated Jul 20, 2025 • 13
impresso-project/histlux-paraphrase-multilingual-mpnet-base-v2 Sentence Similarity • 0.3B • Updated Jul 20, 2025 • 2
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 45
PARAPHRASUS : A Comprehensive Benchmark for Evaluating Paraphrase Detection Models Paper • 2409.12060 • Published Sep 18, 2024