1. NutriGreen image dataset : a collection of annotated nutrition, organic, and vegan food productsJan Drole, Igor Pravst, Tome Eftimov, Barbara Koroušić-Seljak, 2024, original scientific article Abstract: In this research, we introduce the NutriGreen dataset, which is a collection of images representing branded food products aimed for training segmentation models for detecting various labels on food packaging. Each image in the dataset comes with three distinct labels: one indicating its nutritional quality using the Nutri-Score, another denoting whether it is vegan or vegetarian origin with the V-label, and a third displaying the EU organic certification (BIO) logo. Published in DiRROS: 23.04.2024; Views: 63; Downloads: 27 Full text (2,84 MB) |
2. |
3. PS-AAS : portfolio selection for automated algorithm selection in black-box optimizationAna Kostovska, Gjorgjina Cenikj, Diederick Vermetten, Anja Janković, Ana Nikolikj, Urban Škvorc, Peter Korošec, Carola Doerr, Tome Eftimov, 2023, published scientific conference contribution Keywords: automated algorithm selection, portfolio selection, black box optimization Published in DiRROS: 11.12.2023; Views: 329; Downloads: 132 Full text (1,90 MB) This document has many files! More... |
4. Sensitivity analysis of RF+clust for leave-one-problem-out performance predictionAna Nikolikj, Michal Pluhacek, Carola Doerr, Peter Korošec, Tome Eftimov, 2023, published scientific conference contribution Keywords: automated performance prediction, autoML, single-objective black-box optimization, zero-shot learning Published in DiRROS: 13.11.2023; Views: 322; Downloads: 197 Full text (4,94 MB) This document has many files! More... |
5. MsGEN : measuring generalization of nutrient value prediction across different recipe datasetsGordana Ispirova, Tome Eftimov, Sašo Džeroski, Barbara Koroušić-Seljak, 2023, original scientific article Abstract: In this study, we estimate the generalization of the performance of previously proposed predictive models for nutrient value prediction across different recipe datasets. For this purpose, we introduce a quantitative indicator that determines the level of generalization of using the developed predictive model for new unseen data not presented in the training process. On a predefined corpus of recipe embeddings from six publicly available recipe datasets (i.e., projecting them in the same meta-feature vector space), we train predictive models on one of the six recipe datasets and test the models on the rest of the datasets. In parallel, we define and calculate generalizability indexes which are numbers that indicate how generalizable a predictive model is i.e., how well will a predictive model learned on one dataset perform on another one not involved in the training. The evaluation results prove the validity of these indexes – their relation with the accuracy of the predictions. Further, we define three sampling techniques for selecting representative data instances that will cover all parts from the feature space uniformly (involving data from all datasets) and further will improve the generalization of a predictive model. We train predictive models with these generalized datasets and test them on instances from the six recipe datasets that are not selected and included in the generalized datasets. The results from the evaluation of these predictive models show improvement compared to the results from the predictive models trained on one recipe dataset and tested on the others separately. Keywords: ML pipeline, predictive modeling, nutrient prediction, recipe datasets Published in DiRROS: 25.09.2023; Views: 385; Downloads: 185 Full text (3,27 MB) This document has many files! More... |
6. Algorithm instance footprint : separating easily solvable and challenging problem instancesAna Nikolikj, Sašo Džeroski, Mario Andrés Muñoz, Carola Doerr, Peter Korošec, Tome Eftimov, 2023, published scientific conference contribution Keywords: black-box optimization, algorithms, problem instances, machine learning Published in DiRROS: 15.09.2023; Views: 283; Downloads: 191 Full text (2,03 MB) This document has many files! More... |
7. Assessing the generalizability of a performance predictive modelAna Nikolikj, Gjorgjina Cenikj, Gordana Ispirova, Diederick Vermetten, Ryan Dieter Lang, Andries Petrus Engelbrecht, Carola Doerr, Peter Korošec, Tome Eftimov, 2023, published scientific conference contribution Keywords: algorithms, predictive models, machine learning Published in DiRROS: 15.09.2023; Views: 301; Downloads: 202 Full text (935,67 KB) This document has many files! More... |
8. |
9. DynamoRep : trajectory-based population dynamics for classification of black-box optimization problemsGjorgjina Cenikj, Gašper Petelin, Carola Doerr, Peter Korošec, Tome Eftimov, 2023, published scientific conference contribution Keywords: black-box single-objective optimization, optimization problem classification, problem representation, meta-learning Published in DiRROS: 30.08.2023; Views: 332; Downloads: 226 Full text (650,13 KB) This document has many files! More... |
10. FooDis : a food-disease relation mining pipelineGjorgjina Cenikj, Tome Eftimov, Barbara Koroušić-Seljak, 2023, original scientific article Abstract: Nowadays, it is really important and crucial to follow the new biomedical knowledge that is presented in scientific literature. To this end, Information Extraction pipelines can help to automatically extract meaningful relations from textual data that further require additional checks by domain experts. In the last two decades, a lot of work has been performed for extracting relations between phenotype and health concepts, however, the relations with food entities which are one of the most important environmental concepts have never been explored. In this study, we propose FooDis, a novel Information Extraction pipeline that employs state-of-the-art approaches in Natural Language Processing to mine abstracts of biomedical scientific papers and automatically suggests potential cause or treat relations between food and disease entities in different existing semantic resources. A comparison with already known relations indicates that the relations predicted by our pipeline match for 90% of the food-disease pairs that are common in our results and the NutriChem database, and 93% of the common pairs in the DietRx platform. The comparison also shows that the FooDis pipeline can suggest relations with high precision. The FooDis pipeline can be further used to dynamically discover new relations between food and diseases that should be checked by domain experts and further used to populate some of the existing resources used by NutriChem and DietRx. Keywords: text mining, relation extraction, named entity recognition, named entity linking, food-disease relations Published in DiRROS: 25.05.2023; Views: 361; Downloads: 169 Full text (1,11 MB) This document has many files! More... |