<?xml version="1.0"?>
<metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dc="http://purl.org/dc/elements/1.1/"><dc:title>Grammatical error correction of Slovenian school essays using large language models</dc:title><dc:creator>Klemen,	Matej	(Avtor)
	</dc:creator><dc:creator>Božič,	Martin	(Avtor)
	</dc:creator><dc:creator>Arhar Holdt,	Špela	(Avtor)
	</dc:creator><dc:creator>Robnik Šikonja,	Marko	(Avtor)
	</dc:creator><dc:subject>large language models</dc:subject><dc:subject>grammatical error correction</dc:subject><dc:subject>educational domain</dc:subject><dc:subject>synthetic data construction</dc:subject><dc:description>Grammatical error correction (GEC) is the task of automatically detecting and correcting grammatical errors in text. Large language models have enabled the development of accurate automated methods for detecting and correcting certain types of errors. In the educational domain, the aim of GEC is to aid teachers in correcting student errors. Excessive paraphrasing is a property of Generative Pre-trained Transformer-based models and is undesirable in the language education context. To avoid this, we develop multiple Slovenian models for correcting errors in spelling, word case (capitalization), word form, and word order. We describe the training data construction, training process, and model evaluation approach using the Šolar-Eval 1.0 corpus of school essays authored by primary and secondary school students. Our quantitative evaluation shows that the developed models have reasonably high accuracy levels, and our qualitative evaluation highlights the strengths and weaknesses of the models and the evaluation process. The analysis reveals multiple challenges and promising future directions for improving both model development and the evaluation process.</dc:description><dc:date>2025</dc:date><dc:date>2025-12-01 09:40:22</dc:date><dc:type>Neznano</dc:type><dc:identifier>24472</dc:identifier><dc:identifier>UDK: 371.68</dc:identifier><dc:identifier>ISSN pri članku: 0038-0474</dc:identifier><dc:identifier>DOI: 10.63384/sptB53z793a</dc:identifier><dc:identifier>COBISS_ID: 259208195</dc:identifier><dc:language>sl</dc:language></metadata>
