dc.contributor.author | Krutto, Annika | |
dc.contributor.author | Haugdahl Nost, Therese | |
dc.contributor.author | Thoresen, Magne | |
dc.date.accessioned | 2024-10-14T11:10:53Z | |
dc.date.available | 2024-10-14T11:10:53Z | |
dc.date.issued | 2024-05-29 | |
dc.description.abstract | This article addresses the limitations of existing statistical models in analyzing and interpreting highly skewed miRNA-seq raw read count data that can range from zero to millions. A heavy-tailed model using discrete stable distributions is proposed as a novel approach to better capture the heterogeneity and extreme values commonly observed in miRNA-seq data. Additionally, the parameters of the discrete stable distribution are proposed as an alternative target for differential expression analysis. An R package for computing and estimating the discrete stable distribution is provided. The proposed model is applied to miRNA-seq raw counts from the Norwegian Women and Cancer Study (NOWAC) and the Cancer Genome Atlas (TCGA) databases. The goodness-of-fit is compared with the popular Poisson and negative binomial distributions, and the discrete stable distributions are found to give a better fit for both datasets. In conclusion, the use of discrete stable distributions is shown to potentially lead to more accurate modeling of the underlying biological processes. | en_US |
dc.identifier.citation | Krutto A, Haugdahl Nost, Thoresen. A heavy-tailed model for analyzing miRNA-seq raw read counts. Statistical Applications in Genetics and Molecular Biology. 2024;23(1) | en_US |
dc.identifier.cristinID | FRIDAID 2276855 | |
dc.identifier.doi | 10.1515/sagmb-2023-0016 | |
dc.identifier.issn | 1544-6115 | |
dc.identifier.uri | https://hdl.handle.net/10037/35225 | |
dc.language.iso | eng | en_US |
dc.publisher | De Gruyter | en_US |
dc.relation.journal | Statistical Applications in Genetics and Molecular Biology | |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/H2020/801133/Norway/SCIENTIA-FELLOWS II: International Postdoctoral Fellowship Programme/SCIENTIA-FELLOWS II/ | en_US |
dc.rights.accessRights | openAccess | en_US |
dc.rights.holder | Copyright 2024 The Author(s) | en_US |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0 | en_US |
dc.rights | Attribution 4.0 International (CC BY 4.0) | en_US |
dc.title | A heavy-tailed model for analyzing miRNA-seq raw read counts | en_US |
dc.type.version | publishedVersion | en_US |
dc.type | Journal article | en_US |
dc.type | Tidsskriftartikkel | en_US |
dc.type | Peer reviewed | en_US |