Questions tagged [pca]
Principal Components Analysis. A statistical method used to reduce the dimensionality of a dataset while keeping as much variance in the first principal components as possible. It can be used to visualise samples with many variables in 2-D or 3-D, thus allowing for a visual non-supervised grouping of points.
56 questions
2 votes
1 answer
31 views
Linearization versus Normalization when it comes to omics data
I am taking my first course in bioinformatics, and as such I am quite the beginner. This week we've discussed relative log expression, centered log ratio, and using those methods to normalize the data ...
0 votes
0 answers
41 views
SVD on gene expression data
This question was also asked on Reddit I am trying to perform SVD on gene expression data (Genes in the rows and samples in the column). I begin with row centering of the data. Then I do column ...
1 vote
1 answer
134 views
Optimal PCA Dimensionality Reduction Before UMAP for 2D Visualization of High-Dimensional Data
I am working with scRNA data that I reduced to the 100 High Variance Genes and using UMAP to visualize it in 2D. Before applying UMAP, I perform PCA for dimensionality reduction. I am trying to decide ...
2 votes
2 answers
112 views
Should I apply PCA on the combined scRNA dataset or on each individual file?
I am working with Single-Cell RNA-seq count matrices (gene expression x cells) from multiple files, which represent different samples and patients. I want to perform dimensionality reduction using PCA ...
2 votes
1 answer
67 views
How can i resolve "too many entries" error in EIGENSTRAT?
I have genomic data for already converted in eigensoft format. Specifically I have SNP data for 5 populations and 1 outgroup. I ran smartpca with the following ...
2 votes
0 answers
54 views
Plotting PCA from cell counting data, viable way for analysis?
I wanted to help out a colleague and I wanted to know if the things I'm doing are making sense-ish. The data consists of tissue sections which had been stained for markers. The tissue segments are ...
0 votes
0 answers
125 views
pca plot from multiple sequence alignment
I would like to get a pca plot from the multiple sequence alignment (fasta file). I extracted snps in vcf format using snp-sites program. ...
1 vote
1 answer
164 views
How can I quantitatively evaluate which UMAP is best in terms of clustering & embedding?
This question was also asked on Reddit I am new to sc-rna analysis, I have the dataset that I am trying to find out the best UMAP, experimented on trying out different values of the parameters as in (<...
1 vote
2 answers
245 views
PCA of bulk RNA-seq doesn't show clustering and show large variation in general
I am very new to analyzing RNAseq and I am in a group with very little experience in this regard and I am looking for some advice. My PCA after performing DESEQ2 analysis on my dataset doesn't show ...
0 votes
0 answers
33 views
Done miRNA sequencing from animal tissue & have 1 control sample in duplicates 1 treatment group as test. In PCA plot clustering is not there
I have done miRNA sequencing from animal tissue and have control sample in duplicates . Like wise one treatment group as test. In PCA plot clustering is not there. There is very randomness. Is it ...
2 votes
2 answers
100 views
Predict population based on PC coordinates
Utilizing a reference panel, I want to assign most likely population label to each individual in the study. Following are the files I have: Reference panel population labels: ...
2 votes
0 answers
583 views
How to perform PCA on proteomic data set
I'm trying to perform Principal Component Analysis using R on a proteomics dataset. As the dataset contains a lot missing values I tried different approaches. I ran PCA using ...
4 votes
2 answers
614 views
Question about umap using different numbers of pca components as initialization
I am new to the scRNA-seq field and I have been doing some experiments of visualization of UMAP using different numbers of PCA components for initialization. The process involves projecting scRNA-seq ...
0 votes
0 answers
149 views
scRNA: What are good dimensionality reduction/clustering parameters to get biologically plausible groupings?
I've got a moderately large set of PBMCs, over 1M cells. That means I can't easily do a grid search of dimensionality reduction/clustering parameters/methods. Some examples results I'm getting with ...
1 vote
1 answer
102 views
Test for differences between groups of samples
Sorry if the answer to this should be obvious. I have RNA-expression results from 24 samples which can be divided into 6 groups, (wildtype and two different mutants at two different ages) with a total ...