I've got some human amplicon sequencing data which I know contains variants, but I'm trying to work out the best way to call the variants, as most variant callers seem set up for whole genome data. I've got a mix of short read and long read (ONT and PacBio) data for the same amplicon.
The amplicon is at chr15:30904775-30905567, and you can see the indexed and sorted bam in the attached IGV screenshot. It contains an indel in the centre and a heterozygous SNP to the right. The bam was generated using minimap2 as follows:
minimap2 -ax map-ont -R "$rg" "$ref" "$fastq" | samtools sort -o "$out/${sample_name}.bam" Can anyone suggest the best way to call variants, including indels and SNPs, in my amplicon. I want to integrate it into a flexible pipeline.
