Digital repository of Slovenian research organisations

Show document
A+ | A- | Help | SLO | ENG

Title:High entropy alloys database generated with large language model
Authors:ID Chizhevskiy, Vladimir, Institut "Jožef Stefan" (Author)
ID Cvelbar, Uroš, Institut "Jožef Stefan" (Author)
ID Zavašnik, Janez, Institut "Jožef Stefan" (Author)
ID Nominé, Alexandre, Institut "Jožef Stefan" (Author), et al.
Files:URL URL - Source URL, visit https://www.nature.com/articles/s41597-026-06930-z
 
.pdf PDF - Presentation file, download (2,01 MB)
MD5: 5A05EBDAFB4B35107FCA04A3D962E91B
 
Language:English
Typology:1.01 - Original Scientific Article
Organization:Logo IJS - Jožef Stefan Institute
Abstract:High entropy alloys (HEAs) represent a promising area in materials science, but systematic analysis of the extensive literature remains a challenge. In this study, we used Natural Language Processing (NLP) techniques to analyze 4,625 scientific articles from a restricted corpus representing publisher-accessible literature, successfully identifying and characterizing 12,427 of different high entropy alloys. Through prompt engineering and experiments with Large Language Models (LLMs), including mamba-transformer hybrid architectures, we developed a structured database that captures important parameters such as alloy compositions, phase numbers and crystallographic structures. In our analysis, we distinguish between theoretical and experimental studies, considering specific methodological details for each category. For theoretical work, we have systematically documented modeling approaches and key computational parameters, while experimental studies are cataloged with their synthesis methods and critical processing conditions. This database represents a large-scale, automated extraction of HEA research data. The accuracy of the data ranges from 78.7% for HEA phase identification to 94.3% for HEA composition.
Keywords:high-entropy alloys, natural language processing, materials informatics
Publication status:Published
Publication version:Version of Record
Submitted for review:28.08.2025
Article acceptance date:17.02.2026
Publication date:16.04.2026
Publisher:Nature Publishing Group
Year of publishing:2026
Number of pages:str. [1-8]
Numbering:Vol. 13, [article no.] 612
Source:Združeno kraljestvo
PID:20.500.12556/DiRROS-29238 New window
UDC:620.1/.2
ISSN on article:2052-4463
DOI:10.1038/s41597-026-06930-z New window
COBISS.SI-ID:271580163 New window
Copyright:© The Author(s) 2026
Note:Nasl. z nasl. zaslona; Soavtorji iz Slovenije: Uroš Cvelbar, Janez Zavašnik, Alexandre Nominé; Opis vira z dne 13. 3. 2026;
Publication date in DiRROS:30.04.2026
Views:39
Downloads:24
Metadata:XML DC-XML DC-RDF
:
Copy citation
  
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:Scientific data
Publisher:Nature Publishing Group
ISSN:2052-4463
COBISS.SI-ID:523393305 New window

Document is financed by a project

Funder:EC - European Commission
Project number:101046835
Name:A paradigm shift for the future's thermal management devices through radical innovation in new materials and additive manufacturing
Acronym:ThermoDust

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:P1-0417-2022
Name:Plazma in kvantne strukture

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:J2-4440-2022
Name:Načrtovanje in razvoj DT-procesiranih Fe-Al zlitin s samotvornimi preprekami za prepustnost vodika za najzahtevnejša okolja

Funder:Ministry of Science, Technological Development, and Innovation of the Republic of Serbia
Project number:451-03-136/2025-03/200023

Licences

License:CC BY-NC-ND 4.0, Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International
Link:http://creativecommons.org/licenses/by-nc-nd/4.0/
Description:The most restrictive Creative Commons license. This only allows people to download and share the work for no commercial gain and for no other purposes.
Licensing start date:16.04.2026
Applies to:VoR

Secondary language

Language:Slovenian
Keywords:visokokonfuzijske zlitine, obdelava naravnega jezika, informatika materialov


Back