I have SNPS in the .fst file containing the estimated fst value in the 6th column like:
2R 4459 1 1.000 96.0 1:2=0.01762811
2R 9728 1 1.000 99.0 1:2=0.01340363
2R 9828 1 1.000 100.0 1:2=0.01554609
2R 9928 1 1.000 99.0 1:2=0.01454173
2R 10028 1 1.000 100.0 1:2=0.01317223
2R 10128 1 1.000 100.0 1:2=0.01554917
2R 10228 1 1.000 100.0 1:2=0.01202964
2R 10328 1 1.000 100.0 1:2=0.01316962
2R 10428 1 1.000 100.0 1:2=0.01317223
2R 10528 1 1.000 100.0 1:2=0.01316962
2R 10628 1 1.000 100.0 1:2=0.01778599
2R 10728 1 1.000 100.0 1:2=0.01554609
2R 10828 1 1.000 100.0 1:2=0.01554917
I want to filer those SNPs that have a value greater than 0.9 so I am trying this command in Linux:
awk -F"\t" '$6>0.02' file.fst
But it's not fetching an exact 0.9 from the 6th column due to the presence of 1:2=0 in every row of the 5th column.
Which changes do I need to make in the awk command?
After finding SNPs, I need to annotate them by using snp eff so is it possible to apply SnpEff to the .fst file?
- DYH17 Structure Prediction
- Pediococcus acidilactici sequence analysis
- Gene labels problem in enhanced volcano
- Combine GEO ids
- Is Bioinformatics a Professional Degree?
- RSEM implementation
- Variants that are different between the clown and control
- Error while running GATK4 Mutect2 in Galaxy
- Reference genome of python molurus bivittatus
- Installing pybedtools in Windows 10
- Pig Reference Genome
- Use bismark files to run methylKit
- Analyze 2 or 3 different datasets from GEO2R
- Split trio vcf file
- Obtaining bcf and vcf files and use bcf tool annotate
- Design and Contrast Matrix in Limma Analysis
- expression of the GAT1 transport protein
- Use tximport to merge and transform the transcript counts to gene counts
- Disease information from gene names
- How to create bigwig file in galaxy after MACS procedure(Chip Seq)
- Transform Data into DNA picture
- Blast Variations
- How to generate pathway analysis report for a list of proteins
- Merging different fastq files into one folder
- Python script to process NGS data
- How can we classify our protein as an α, a β, an α/ β or an α + β protein?
- Need help to read samples from PyVCF
- Read multiple VCF files through PyVCF
- MSA and distance matrix in R
- t-SNE analysis on multiple fasta files in R
- Using ENSEMBL API to fetch the snps for the list of genes
- Best Database/Wesbite to show inhibiting and promoting substances of gene
- Making python script for downloading genomes for OMA Analysis
- Difference between between eukaryotic and prokaryotic primer designs
- Unable to install pysam on windows 7
- Spike proteins sequences of all variants of Covid
- Identification of Gene through GEN SCAN
- Dataset cleaning through python
- Needleman and Wunsch Algorithm
- Need help to remove NA values from data frame
- Differential Gene Expression
- Error in DSSP Module of Biopython
- Use bioinformatics tool in iphone 7
- Protein Structure drawing on graph paper
- Remove spaces from fasta file in python
- Is Biopython a Package or Library?
- Network X used in Biopython
- Double Digest through Regular Expression in python
- Translation from fasta file which contains 6 sequences
- RNA Seq Analysis in R
- biopython script to alingn mutiple sequences
- Python script for text matching
- PPI Analysis in String Database
- DEG Analysis in R
- Problem in calculating p values
- how to extract mutation from MAF file in python
- mummer plot interpretation
- Can we consider Bioinformatics as a engineering discipline?
- I want to predict the gene through Knn algorit