CV
Education
- Ph.D. in Computer Science, Minor in Bioinformatics and Computational Biology, Iowa State University (ISU)
- M.Sc. in Computer Science, Iowa State University (ISU)
- B.Sc. and M.Sc. in Software Engineering, Shahid Beheshti University
Experience
- Data Scientist
John Deere Financial, Johnston, IA
Jan 2023 - Present- Developed secure, domain-specific AI solutions for the finance sector, leveraging GenAI and LLMs to maintain data privacy and drive innovative outcomes.
- Developed a time-series analytics platform, enhancing competitive analysis and pricing.
- Full-stack development of Channel Analytics, synthesizing multifaceted financial metrics to streamline decision-making processes.
- Enhanced data quality with LLM, revealing $1B in transactions within an $8-10B volume, a 10% gain.
- Built infrastructure for the ‘Datafication’ project, establishing robust databases and APIs for Emerging Tech.
- Data Scientist
John Deere, Pricing and Incentive, Olathe, KS
Jan 2022 - Jan 2023- Developed a customer analysis system using NLP techniques for product and customer segmentation.
- Created a Sales & Marketing intelligence tool, synthesizing diverse data for informed decisions.
- Engineered and deployed LLM models for feature extraction from complex, messy text datasets, enhancing data quality and actionable insights for downstream tasks.
- Data Scientist
John Deere, Intelligent Solution Group, Urbandale, IA
Jan 2021 - Jan 2022- Created a convolutional neural network to estimate crop types and growth stages from satellite imagery.
- Used historical and forecasted weather data to estimate Field Readiness Index.
- Built an automatic report generation pipeline for the 2021 planting season.
- Ph.D. Student and Research Assistant
Dept. of Computer Science, Iowa State University, Ames, IA
Aug 2015 - Dec 2020- Built a domain-specific language for genomics, leveraging distributed systems to enhance parallelization and storage optimization.
- Implemented an automated data cleaning technique for mislabeled sequences in the large-scale database.
- Conducted in-depth analysis of data science bugs within R and Python, uncovering distinctive bug patterns and revealing insights for enhanced software practices.
- Deputy Manager of IT Center
University of Kurdistan, Sanandaj, Iran
Aug 2014 - Aug 2015- Trained and managed 20 junior IT employees and oversaw 30+ IT projects.
- Helped in establishing, budgeting, and managing outsourced projects.
- Optimized IT infrastructure and analytics for university-wide departments, streamlining operations.
- Senior Network Engineer - Analytics
University of Kurdistan, Sanandaj, Iran
Aug 2008 - Aug 2015- Analyzed network data that led to a refund of 9 months of internet bandwidth from the internet provider.
- Boosted data security with IP, encryption, and wireless protocols, cutting annual costs significantly.
- Enhanced incident response by analyzing network logs with Security Incident and Event Management systems.
- Software Engineer and Co-Founder of Saral System
Saral System Kurdistan, Sanandaj, Iran
Sep 2005 - Dec 2009- Led design and full-stack development of 10+ projects, including custom hotel and accounting systems.
- Developed tailored accounting, sales, and inventory systems for 100+ businesses, boosting efficiency.
- Innovated with automation across various projects, boosting user experience and workflow optimization.
- Software Engineering Part-time Lecturer
Yazdan Panah University, Sanandaj, Iran
Sep 2005 - Jan 2007- Taught Software Engineering, Database Systems in classes ranging 40-60 students.
- Supervised 15 undergraduate students for their final projects.
Skills
- Programming Languages: Python, R, Java, JavaScript, SQL, PySpark, Solidity, Rust
- Analytics and Machine Learning Tools: TensorFlow, PyTorch, Keras, Scikit-Learn, Spark, Databricks, Large Language Models
- Statistical and Data Analysis: Proficient with Python, R, SQL, and Tableau
Publications
- Ahmed, S., Wardat, M., Bagheri, H., Cruz, B. D., & Rajan, H. (2023). Characterizing Bugs in Python and R Data Analytics Programs. arXiv preprint arXiv:2306.08632.
- Bagheri, H., Severin, A. J., & Rajan, H. (2020). Detecting and correcting misclassified sequences in large-scale public databases. Bioinformatics, 36(18), 4699-4705.
- Bagheri, H., Muppirala, U., Masonbrink, R. E., Severin, A. J., & Rajan, H. (2019). Shared data science infrastructure for genomics data. BMC Bioinformatics, 20(1), 1-13.
Honors/Grants
- XSEDE NSF Awards (2019-2020): Secured a combined $14,200 for computational infrastructure initiatives.
- International Graduate Award, 2018: Recognized for academic excellence at Iowa State University.
- Best Employee Award, 2014: Earned for outstanding professional contributions at University of Kurdistan.