Title: | SegMine workflows for semantic microarray data analysis in Orange4WS |
---|
Authors: | ID Podpečan, Vid (Author) ID Lavrač, Nada (Author) ID Mozetič, Igor (Author) ID Kralj Novak, Petra (Author) ID Trajkovski, Igor (Author) ID Langohr, Laura (Author) ID Kulovesi, Kimmo (Author) ID Toivonen, Hannu (Author) ID Petek, Marko (Author) ID Motaln, Helena (Author) ID Gruden, Kristina (Author) |
Files: | PDF - Presentation file, download (3,09 MB) MD5: 1B38989383A996814B87F82BE365B05C
URL - Source URL, visit https://doi.org/10.1186/1471-2105-12-416
|
---|
Language: | English |
---|
Typology: | 1.01 - Original Scientific Article |
---|
Organization: | NIB - National Institute of Biology
|
---|
Abstract: | Background
In experimental data analysis, bioinformatics researchers increasingly rely on tools that enable the composition and reuse of scientific workflows. The utility of current bioinformatics workflow environments can be significantly increased by offering advanced data mining services as workflow components. Such services can support, for instance, knowledge discovery from diverse distributed data and knowledge sources (such as GO, KEGG, PubMed, and experimental databases). Specifically, cutting-edge data analysis approaches, such as semantic data mining, link discovery, and visualization, have not yet been made available to researchers investigating complex biological datasets.
Results
We present a new methodology, SegMine, for semantic analysis of microarray data by exploiting general biological knowledge, and a new workflow environment, Orange4WS, with integrated support for web services in which the SegMine methodology is implemented. The SegMine methodology consists of two main steps. First, the semantic subgroup discovery algorithm is used to construct elaborate rules that identify enriched gene sets. Then, a link discovery service is used for the creation and visualization of new biological hypotheses. The utility of SegMine, implemented as a set of workflows in Orange4WS, is demonstrated in two microarray data analysis applications. In the analysis of senescence in human stem cells, the use of SegMine resulted in three novel research hypotheses that could improve understanding of the underlying mechanisms of senescence and identification of candidate marker genes.
Conclusions
Compared to the available data analysis systems, SegMine offers improved hypothesis generation and data interpretation for bioinformatics in an easy-to-use integrated workflow environment. |
---|
Publication status: | Published |
---|
Publication version: | Version of Record |
---|
Publication date: | 26.10.2011 |
---|
Year of publishing: | 2011 |
---|
Number of pages: | str. 416-1-416-16 |
---|
Numbering: | Vol. 12, no. 416 |
---|
PID: | 20.500.12556/DiRROS-20051 |
---|
UDC: | 004.8 |
---|
ISSN on article: | 1471-2105 |
---|
DOI: | 10.1186/1471-2105-12-416 |
---|
COBISS.SI-ID: | 25208871 |
---|
Publication date in DiRROS: | 05.08.2024 |
---|
Views: | 315 |
---|
Downloads: | 202 |
---|
Metadata: | |
---|
:
|
Copy citation |
---|
| | | Share: | |
---|
Hover the mouse pointer over a document title to show the abstract or click
on the title to get all document metadata. |