Convert gene list to fasta file
1
1
Entering edit mode
7.1 years ago

Hello,

I have a gene list with 504 genes and I want to convert them to fasta file, because I want to do alignment some NGS data with only these genes. How could I do that?

Thanks

next-gen alignment • 3.1k views
ADD COMMENT
1
Entering edit mode

You can always extract the reads aligning to a position on the genome/genes (e.g. with samtools, bedtools, featurecounts, deeptools ... many) once all the reads are aligned to whole genome. If you remove the possibility for a read to align to a best position on the genome, it will be forced to align (or discarded) to a position where it doesn't belong. This introduces a lot of noise in your data.

I'm really curious to know what would you do in downstream analysis after aligning to only these 504 genes. Would you go for differential analysis with this set?

ADD REPLY
0
Entering edit mode

504 genes from a single organism? Post a few examples if you want further help.

ADD REPLY
0
Entering edit mode

Yes, these genes are from human.

ADD REPLY
0
Entering edit mode

This doesn't sound like a good idea. Why would you want to align to just these genes?

ADD REPLY
0
Entering edit mode

Because these genes are connecting to a specific phenotype

ADD REPLY
0
Entering edit mode

I don't think that reason justifies aligning just to those. If you align to a subset of the genome you will get false positive alignments. Always align NGS to the full genome.

ADD REPLY
0
Entering edit mode
7.1 years ago
theobroma22 ★ 1.2k

Assuming you have the sequences and you can use R, in R you can use the write.fasta() command which is in the seqinr package.

ADD COMMENT

Login before adding your answer.

Traffic: 1373 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6