A collection of resources for evaluation of LLM capabilities in the Estonian language.
AI & ML interests
Natural Language Processing
Recent Activity
View all activity
Papers
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation
GliLem: Leveraging GliNER for Contextualized Lemmatization in Estonian
Organization Card
We are the research group of natural language processing at the Institute of Computer Science, University of Tartu. Our areas of focus include machine translation, speech synthesis, NLP for Estonian and others.
Llama-2-based LLMs fine-tuned for grammatical error correction. This collection also includes AEG (Artificial Error Generation) models.
-
To Err Is Human, but Llamas Can Learn It Too
Paper • 2403.05493 • Published • 6 -
tartuNLP/Llamma-2-7b-ukr-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 2 -
tartuNLP/Llammas-base-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 2 • 3 -
tartuNLP/leo-hessianai-7b-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 5
A collection of resources for evaluation of LLM capabilities in the Estonian language.
Llama-2-based LLMs fine-tuned for grammatical error correction. This collection also includes AEG (Artificial Error Generation) models.
-
To Err Is Human, but Llamas Can Learn It Too
Paper • 2403.05493 • Published • 6 -
tartuNLP/Llamma-2-7b-ukr-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 2 -
tartuNLP/Llammas-base-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 2 • 3 -
tartuNLP/leo-hessianai-7b-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 5
models
72
tartuNLP/Llamma-2-7b-ukr-AEG
Text Generation
•
7B
•
Updated
•
1
tartuNLP/leo-hessianai-7b-p1-llama-errors-p2-GEC
Text Generation
•
7B
•
Updated
•
5
tartuNLP/Llama-2-7b-Ukrainian
Text Generation
•
7B
•
Updated
•
6
•
2
tartuNLP/Llamma-2-7b-ukr-p1-llama-errors-p2-GEC
Text Generation
•
7B
•
Updated
•
2
tartuNLP/Llammas-base-AEG
Text Generation
•
7B
•
Updated
tartuNLP/Llammas-base-p1-GPT-4o-human-error-explain-from-pseudo-m2
Text Generation
•
7B
•
Updated
•
1
tartuNLP/Llammas-base-p1-GPT-4o-human-error-pseudo-m2
Text Generation
•
7B
•
Updated
•
1
tartuNLP/Llammas-base-p1-GPT-4o-human-error-mix-paragraph-GEC
Text Generation
•
7B
•
Updated
•
13k
tartuNLP/Llammas-base-p1-llama-errors-p2-GEC
Text Generation
•
7B
•
Updated
•
2
•
3
tartuNLP/Llammas-translate
Text Generation
•
7B
•
Updated
•
4
•
1
datasets
35
tartuNLP/Estonian_Subjectivity
Viewer
•
Updated
•
1k
•
847
tartuNLP/finepdfs-et
Viewer
•
Updated
•
554k
•
86
tartuNLP/finetranslations-et
Viewer
•
Updated
•
10M
•
360
tartuNLP/fineweb-2-et
Viewer
•
Updated
•
9.65M
•
47
tartuNLP/SynEst_Parallel
Updated
•
1
tartuNLP/ifeval_et
Viewer
•
Updated
•
541
•
17
tartuNLP/truthfulqa_multiple_choice
Viewer
•
Updated
•
817
•
6
tartuNLP/lumiopen-truthfulqa_et_multiple_choice
Viewer
•
Updated
•
817
•
2
tartuNLP/ifeval_en
Viewer
•
Updated
•
541
•
4
tartuNLP/smugri4-data
Updated
•
36
•
1