Timeline for How to get all columns based on two columns using awk?

Current License: CC BY-SA 3.0

25 events

when toggle format	what		by	license	comment
Apr 24, 2017 at 9:08	comment	added	terdon♦		Use Kusalananda's answer, but change the character range from `[ACTG]` to all allowed characters: `[ACTG-]` or, depending on what you are doing, `[ACTGUN-]`. So, something like `awk '$5 ~ /^[ACGT-]$/ && $6 ~ /^[ACGT-]$/ \|\| NR == 1' data.in`
Apr 24, 2017 at 9:06	comment	added	stack_learner		but @terdon Could you also tell me how to get the single nucleotide deletions.
Apr 24, 2017 at 9:05	comment	added	stack_learner		Actually for this question I need only Insertions
Apr 24, 2017 at 8:54	comment	added	Thor		@user3351523: No you didn't. Please answer terdon's last question
Apr 24, 2017 at 8:48	answer	added	user218374		timeline score: 1
Apr 24, 2017 at 8:41	comment	added	stack_learner		yes..I made changes for that....do you think this is right? awk 'NR==1{print; next} $5 ~ /^[A-Z]$/ \| awk -F'\t' '$6=="-"' filename
Apr 24, 2017 at 8:38	comment	added	terdon♦		And do you also want single nucleotide deletions? What if the ref is `T` and the alt is `-`, for example? Both `T` and `-` are single characters, should that line be printed?
Apr 24, 2017 at 8:37	history	edited	terdon♦	CC BY-SA 3.0	deleted 4 characters in body
Apr 24, 2017 at 8:35	vote	accept	stack_learner
Apr 24, 2017 at 8:30	vote	accept	stack_learner
Apr 24, 2017 at 8:35
Apr 24, 2017 at 8:23	answer	added	Kusalananda♦		timeline score: 7
Apr 24, 2017 at 8:22	vote	accept	stack_learner
Apr 24, 2017 at 8:30
Apr 24, 2017 at 8:17	answer	added	Valentin B.		timeline score: 7
Apr 24, 2017 at 8:14	comment	added	stack_learner		Very sorry. I edited now and I need all the columns based on "ref" and "alt" columns with single character.
Apr 24, 2017 at 8:13	history	edited	stack_learner	CC BY-SA 3.0	added 95 characters in body
Apr 24, 2017 at 8:11	comment	added	Kusalananda♦		It is clear that you want to pick out a unique line for each sample, but I can't see what criteria you use to pick the one you do.
Apr 24, 2017 at 8:11	comment	added	terdon♦		We can't give you the right command if you don't tell us what you are trying to do. What is the relationship between your input and output? How do you get from the input to that output? You seem to just be printing 3 random lines. Please edit your question and explain your objective.
Apr 24, 2017 at 8:11	comment	added	stack_learner		@ZumodeVidrio No that doesn't work.
Apr 24, 2017 at 8:09	history	edited	terdon♦	CC BY-SA 3.0	That's a file, not a data frame. ; edited tags
Apr 24, 2017 at 8:09	comment	added	Zumo de Vidrio		I think you may want to take the values of 5th and 6th columns with this `awk` syntax: `awk 'OFS="\t" {print $5,$6}' dataframe_file`, but you haven't provided enough information about how you want to filter based on these fields.
Apr 24, 2017 at 8:09	comment	added	stack_learner		Ya I know but I'm not aware about what I need to give. Could you please give me the right command.
S Apr 24, 2017 at 8:08	history	suggested	Zumo de Vidrio	CC BY-SA 3.0	improved formatting
Apr 24, 2017 at 8:05	comment	added	Kusalananda♦		You will have to explain a bit more about how the expected output is derived from the given input.
Apr 24, 2017 at 8:04	review	Suggested edits
S Apr 24, 2017 at 8:08
Apr 24, 2017 at 8:01	history	asked	stack_learner	CC BY-SA 3.0

toggle format