Digital repository of Slovenian research organisations

Show document
A+ | A- | Help | SLO | ENG

Title:Knots and $\theta$-curves identification in polymeric chains and native proteins using neural networks
Authors:ID da Silva, Fernando Bruno (Author)
ID Gabrovšek, Boštjan (Author)
ID Korpacz, Marta (Author)
ID Luczkiewicz, Kamil (Author)
ID Niewieczerzal, Szymon (Author)
ID Sikora, Maciej (Author)
ID Sulkowska, Joanna I. (Author)
Files:.pdf PDF - Presentation file, download (3,61 MB)
MD5: 79038CF745FF655DA51C1A5503B257E3
 
URL URL - Source URL, visit https://pubs.acs.org/doi/10.1021/acs.macromol.3c02479
 
Language:English
Typology:1.01 - Original Scientific Article
Organization:Logo IMFM - Institute of Mathematics, Physics, and Mechanics
Abstract:Entanglement in proteins is a fascinating structural motif that is neither easy to detect via traditional methods nor fully understood. Recent advancements in AI-driven models have predicted that millions of proteins could potentially have a nontrivial topology. Herein, we have shown that long short-term memory (LSTM)-based neural networks (NN) architecture can be applied to detect, classify, and predict entanglement not only in closed polymeric chains but also in polymers and protein-like structures with open knots, actual protein configurations, and also $\theta$-curves motifs. The analysis revealed that the LSTM model can predict classes (up to the $6_1$ knot) accurately for closed knots and open polymeric chains, resembling real proteins. In the case of open knots formed by protein-like structures, the model displays robust prediction capabilities with an accuracy of 99%. Moreover, the LSTM model with proper features, tested on hundreds of thousands of knotted and unknotted protein structures with different architectures predicted by AlphaFold 2, can distinguish between the trivial and nontrivial topology of the native state of the protein with an accuracy of 93%.
Keywords:machine learning, topology, protein databases, entanglements, open knots, closed knots
Publication status:Published
Publication version:Version of Record
Publication date:01.05.2024
Year of publishing:2024
Number of pages:str. 4599-4608
Numbering:Vol. 57, iss. 9
PID:20.500.12556/DiRROS-20578 New window
UDC:004.85:004.725.4
ISSN on article:0024-9297
DOI:10.1021/acs.macromol.3c02479 New window
COBISS.SI-ID:194735875 New window
Publication date in DiRROS:23.10.2024
Views:126
Downloads:396
Metadata:XML DC-XML DC-RDF
:
Copy citation
  
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:Macromolecules
Shortened title:Macromolecules
Publisher:American Chemical Society
ISSN:0024-9297
COBISS.SI-ID:25886464 New window

Document is financed by a project

Funder:Other - Other funder or multiple funders
Funding programme:NCN - National Science Centre, Poland
Project number:2022/47/B/NZ1/03480

Funder:Other - Other funder or multiple funders
Funding programme:NCN - National Science Centre, Poland
Project number:2021/43/I/NZ1/03341

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:N1-0278
Name:Biološka koda vozlov - identifikacija vzorcev vozlanja v biomolekulah z uporabo umetne inteligence

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.

Secondary language

Language:Slovenian
Keywords:strojno učenje, topologija, proteinska baza podatkov, zavozlanost, odprti vozli, sklenjeni vozli


Back