Publications

You can also find my articles on my Google Scholar profile.

Characterizing Bugs in Python and R Data Analytics Programs

Published in arXiv, 2023

This paper presents a comprehensive study of bugs in Python and R data analytics tasks based on Stack Overflow posts, GitHub commits, and issues.

Recommended citation: Shibbir Ahmed, Mohammad Wardat, Hamid Bagheri, Breno Dantas Cruz, Hridesh Rajan, Characterizing Bugs in Python and R Data Analytics Programs, arXiv:2306.08632 [cs.SE], https://doi.org/10.48550/arXiv.2306.08632 https://arxiv.org/abs/2306.08632

Detecting and correcting misclassified sequences in the large-scale public databases

Published in Bioinformatics, 2020

This paper presents a heuristic method to detect and correct potentially misclassified taxonomic assignments in public databases.

Recommended citation: Hamid Bagheri, Andrew J Severin, Hridesh Rajan, Detecting and correcting misclassified sequences in the large-scale public databases, Bioinformatics, Volume 36, Issue 18, September 2020, Pages 4699–4705, https://doi.org/10.1093/bioinformatics/btaa586 https://academic.oup.com/bioinformatics/article/36/18/4699/5862012

Shared data science infrastructure for genomics data

Published in BMC Bioinformatics, 2019

This paper discusses Boag, a shared data science infrastructure for efficiently processing and parsing large genomics data repositories.

Recommended citation: Bagheri, H., Muppirala, U., Masonbrink, R.E. et al. Shared data science infrastructure for genomics data. BMC Bioinformatics 20, 436 (2019). https://doi.org/10.1186/s12859-019-2967-2 https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-2967-2