Skip to main content

Questions tagged [pandas]

Python module widely used in data science designed around database commands and calculations

1 vote
0 answers
63 views

I'm having some major chaos with the output table generated and then passing that to my plot function. New to this (1st time), could somebody review this piece of code and suggest some corrections. I ...
aleksk31's user avatar
1 vote
2 answers
1k views

I'm trying to use a Python script to plot with a rolling window using pandas and seaborn. This code worked for the longest time but now it's giving me an error that I don't know how to fix. Here is ...
rimo's user avatar
  • 1,063
1 vote
2 answers
62 views

I have a tsv file that lists the reads and read lengths from a FASTA file but some reads are duplicated - that's just from the analysis I did previously - but I want to only take one instance of the ...
rimo's user avatar
  • 1,063
0 votes
1 answer
574 views

This question was also asked on Stack Overflow Bioinformatics rationale eggNOG files can be very big and sump all available RAM for regular to medium sized desktops. I am looking for advice on using ...
M__'s user avatar
  • 13.4k
1 vote
1 answer
71 views

I am analysing some WGS data in MEGAN and would like to do some additional analysis in Python/R I am having trouble Tidying the Taxonomic data in a format which would be conducive to this. Originally ...
dunc4n's user avatar
  • 35
1 vote
1 answer
50 views

I was doing pathway enrichment analysis using the below code ...
Megha's user avatar
  • 395
1 vote
1 answer
53 views

I was trying to calculate the frequency of disease_present (yes) when smoking status is y (yes) for each group (A, B, C, D) <...
Tom Morgan's user avatar
3 votes
3 answers
2k views

Hi I am trying to get the cancer ontologies (obo_id and label) from EBI-OLS. Earlier I have used the below code to get the obo_id terms and ...
Megha's user avatar
  • 395
4 votes
1 answer
279 views

I have a dataset for mutation data and I want to calculate mutation frequencies across all genes df (This is only the small subset of data) ...
Priya's user avatar
  • 351
2 votes
1 answer
249 views

I have a pandas dataframe that reads in a PAF file from minimap2. What I would like to do is take the first 5 columns of the data from to create a BED file. I used this to extract the first 5 columns: ...
rimo's user avatar
  • 1,063
2 votes
1 answer
69 views

...
Riya's user avatar
  • 317
2 votes
1 answer
104 views

My data = data ...
Megha's user avatar
  • 395
3 votes
2 answers
103 views

The input file looks like this, and the complete file can be found here: ...
user3523406's user avatar
2 votes
1 answer
830 views

I have a data frame (df) which has correlations calculated for different genes with respect to different ID combinations. I want to get separate histogram plot based on the gene name (separate plot ...
Riya's user avatar
  • 317

15 30 50 per page