Skip to content
View svermai's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report svermai

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
svermai/README.md

MasterHead

Hi there, I'm Shubham Verma

πŸ”¬ Bioinformatics & Data Science enthusiast | 🧬 Transforming sequencing data with ML


πŸ“ Focused on AMR surveillance in Dubai πŸ‡¦πŸ‡ͺ



LinkedIn Profile

With over four years of experience in bioinformatics, data science, machine learning, and natural language processing, I specialize in transforming large-scale biological sequencing data into actionable insights. My expertise spans extracting features from diverse tool outputs to train predictive models and discover novel biomarkers. Skilled in handling both short- and long-read sequencing data, I have a strong track record in managing and interpreting complex datasets.

  • πŸ”¬ Currently, I am focused developing solutions to enhance antimicrobial resistance surveillance in the Dubai region
  • πŸ’¬ Happy to chat about Bioinformatics, WGS, Metagenomics, ML,Nextflow, Docker & AWS
  • πŸ“§ Reach out to me at subhamverma844@gmail.com

Professional Highlights:

  • Bioinformaticis Data Scientist - Dubai Health, UAE (2025- Present)
  • Bioinformaticis Scientist - Basepair, USA (2022- 2024)
  • Bioinformaticis Engineer - Navipoint Health, India (2021- 2022)
Languages and Tools:

aws bash docker git linux mongodb mysql python nextflow R

Pinned Loading

  1. bactopia/bactopia bactopia/bactopia Public

    A flexible pipeline for complete analysis of bacterial genomes

    Nextflow 501 80

  2. EDA-pandas-profiling EDA-pandas-profiling Public

    A web-based EDA tool powered by pandas-profiling

    HTML

  3. PneumoKITy PneumoKITy Public

    Forked from CarmenSheppard/PneumoKITy

    PneumoKITy (Nanopore-enhanced) – Fast, sensitive pneumococcal capsular serotype screening from WGS data, now extended to support Nanopore sequencing of pure isolates.

    Python

  4. nextflow-line-counter nextflow-line-counter Public

    This repository contains a Nextflow script that counts the number of lines in a given file and stores the result in output.txt

    Nextflow

  5. collaborativebioinformatics/MetaTango collaborativebioinformatics/MetaTango Public

    MetaTango is a benchmarking pipeline for evaluating structural variant (SV) calling in microbiome datasets. It provides a direct comparison between reference-based methods (e.g., Sniffles2) and gra…

    Python 1 1

  6. Bioinfo-training-2025-26 Bioinfo-training-2025-26 Public

    Comprehensive Bioinformatics Training Program

    Shell 3