Spark-SPELL: Low-latency query-based search for gene expression compendia on cluster computers
Permanent lenke
https://hdl.handle.net/10037/6555Åpne
(PDF)
Source code (Ukjent)
Resultater (Ukjent)
Dato
2014-06-01Type
Master thesisMastergradsoppgave
Forfatter
Raknes, Inge AlexanderSammendrag
Exploratory analyses are vital to fully realize the potential for scientific discoveries in large-scale biomedical data compendia. Specifically, most biomedical data analyses require a human expert to interactively explore the data to find novel hypotheses or conclusions. However, recent developments in biotechnology instruments are generating Tera-scale datasets. No interactive biomedical data analysis systems scale to such large datasets. We present the design, implementation and optimization of the SPELL biomedical search algorithm on the Spark framework. We demonstrate the scalability and interactive performance of our Spark-SPELL system. In addition, we demonstrate the performance improvements of our optimizations to the SPELL algorithm and the Spark framework.
Forlag
UiT Norges arktiske universitetUiT The Arctic University of Norway
Metadata
Vis full innførselSamlinger
Copyright 2014 The Author(s)
Følgende lisensfil er knyttet til denne innførselen: