Questions tagged [gatk]
Genome Analysis Toolkit. Popular open-source bioinformatics suite to call variants from second-generation sequencing data.
67 questions
3 votes
1 answer
104 views
I need to index a GTF (gene transfer format) annotation file
This question was also asked on Biostars I created a GTF file for HLA alleles to be used as a resource for GATK Funcotator. Running Funcotator without indexing the GTF gives this error: ...
5 votes
1 answer
109 views
Replicating VCF Filtering & Trait-Based SNP Extraction Workflow
I'm new to handling SNP genotyping data in VCF format. My goal is to identify significant SNPs linked to a specific trait (like "height") for specific samples in a multi-sample VCF file. My ...
1 vote
1 answer
59 views
Problem while mapping reads to mtDNA (SortSam)
I am trying to map MiSeq reads to a reference genome and extract mutations using MToolBox, which implements gsnap, GATK, Picard, and other tools. When running the tool with example data, there were no ...
0 votes
0 answers
251 views
Using GATK4.2 to create .idx using IndexFeatureFile for multiple vcf files
I have used GATK script : gatk IndexFeatureFile --input sampleid_hc.g.vcf to create .idx files for individual vcf files. However I am trying to create a loop and ...
2 votes
1 answer
154 views
How do I quantify a specific somatic variant?
I'm working with targeted Illumina sequencing data generated with DNA from diseased and healthy tissue (this is an age-related disease and is not cancer/neoplastic). My hypothesis is that the diseased ...
1 vote
1 answer
251 views
How to adjust cutoffs in Mutect2
I'm currently performing some initial analyses on targeted Illumina sequencing data generated with DNA from diseased tissue. My hypothesis is that one particular gene might contain somatic mutations ...
1 vote
0 answers
72 views
java.lang.ArrayIndexOutOfBoundsException: Index 86 out of bounds for length 86
I use the next versions: gatk 4.4.0.0 minimap2 2.26-r1175 samtools 1.19 I have a fastq file. And try to implement gatk MergeBamAlignment to generate a new ...
0 votes
0 answers
116 views
ERROR IN FUNCOTATOR: [SQLITE_IOERR_LOCK] I/O error in the advisory file locking logic (disk I/O error)
I ran the "funcotator" step from the GATK somatic short variant discovery (snvs + indels) pipeline and used the output of filtermutectcalls as input, but ...
2 votes
2 answers
236 views
How to run a GATK Docker Image with local files?
I'm trying to use the HaplotypeCaller from the GATK toolkit but I keep getting an error. I pulled GATK through Docker and am using this command: ...
2 votes
1 answer
92 views
I am trying to create a subset of 10k variants from 25-30 unmapped contigs of a g.vcf file including the header
My objective is to take a g.vcf.gz file and from 25-30 unmapped contigs with titles like "NW_020192317.1", I want to make a subset of ~10k variants from each of the unmapped contigs and make ...
5 votes
1 answer
158 views
MergeBamAlignment error
I doing the alignment of samples following the GATK pipeline, and doing the MergeBamAlignment,like this: ...
0 votes
1 answer
233 views
A bash script for running on a bunch of bam files
I have some bam files in this directory /data/Continuum/WES/results/ I want to run GATK mutation calling over bam files I ...
2 votes
1 answer
99 views
Importing local files to a GATK docker image
I have pulled GATK docker on my computer but when running a GATK command, can not locate my input file My OS version ...
1 vote
0 answers
40 views
GenomicsDB and recalibration recommended for cohort of mixed exome (GATK)
We have a cohort of around 3k exomes, most of them from different capture kits, such as Agilent v6, v8, Nextera, Twist, Clinical Research exome and many others. Right now we are creating genome db on ...
1 vote
1 answer
564 views
Getting VCF file that contain common SNPs from 6 VCF file using isec
I have 6 VCF files, where I would like to obtain the SNPs that are common (by position) in all the 6 files. I have tried this command ...