| Title: | Grammatical error correction of Slovenian school essays using large language models |
|---|
| Authors: | ID Klemen, Matej (Author) ID Božič, Martin (Author) ID Arhar Holdt, Špela (Author) ID Robnik Šikonja, Marko (Author) |
| Files: | URL - Source URL, visit https://www.sodobna-pedagogika.net/clanki/03-2025_popravljanje-slovnicnih-napak-v-slovenskih-esejih-z-velikimi-jezikovnimi-modeli/
PDF - Presentation file, download (284,22 KB) MD5: 8FCBC4808B27CF0F4E3E8D990C1369CB
|
|---|
| Language: | English |
|---|
| Typology: | 1.02 - Review Article |
|---|
| Organization: | ZDPDS - Association of Societies of Educational Workers of Slovenia
|
|---|
| Abstract: | Grammatical error correction (GEC) is the task of automatically detecting and correcting grammatical errors in text. Large language models have enabled the development of accurate automated methods for detecting and correcting certain types of errors. In the educational domain, the aim of GEC is to aid teachers in correcting student errors. Excessive paraphrasing is a property of Generative Pre-trained Transformer-based models and is undesirable in the language education context. To avoid this, we develop multiple Slovenian models for correcting errors in spelling, word case (capitalization), word form, and word order. We describe the training data construction, training process, and model evaluation approach using the Šolar-Eval 1.0 corpus of school essays authored by primary and secondary school students. Our quantitative evaluation shows that the developed models have reasonably high accuracy levels, and our qualitative evaluation highlights the strengths and weaknesses of the models and the evaluation process. The analysis reveals multiple challenges and promising future directions for improving both model development and the evaluation process. |
|---|
| Keywords: | large language models, grammatical error correction, educational domain, synthetic data construction |
|---|
| Publication status: | Published |
|---|
| Publication version: | Version of Record |
|---|
| Publication date: | 01.10.2025 |
|---|
| Year of publishing: | 2025 |
|---|
| Number of pages: | str. 162-176 |
|---|
| Numbering: | Letn. 76 = 142, št. 3 |
|---|
| PID: | 20.500.12556/DiRROS-24472  |
|---|
| UDC: | 371.68 |
|---|
| ISSN on article: | 0038-0474 |
|---|
| DOI: | 10.63384/sptB53z793a  |
|---|
| COBISS.SI-ID: | 259208195  |
|---|
| Publication date in DiRROS: | 01.12.2025 |
|---|
| Views: | 61 |
|---|
| Downloads: | 35 |
|---|
| Metadata: |  |
|---|
|
:
|
Copy citation |
|---|
| | | | Share: |  |
|---|
Hover the mouse pointer over a document title to show the abstract or click
on the title to get all document metadata. |