How to deal with the Ns in gene sequences?
0
0
Entering edit mode
7.5 years ago

I have around 30 k sequences and need to find the codon frequencies how to deal with the presence of Ns in the sequence

Should I need to mask the regions or to replace with specific nucleotides or try any other methods to overcome this issue? Will this manipulate my results?

gene sequence • 1.7k views
ADD COMMENT
0
Entering edit mode

I'd recommend masking these regions.

ADD REPLY
0
Entering edit mode

How to mask using Python program

ADD REPLY
0
Entering edit mode

Why use a Python program? Is this an assignment question?

ADD REPLY
0
Entering edit mode

These are NGS reads?

ADD REPLY
0
Entering edit mode

These are gene sequences retrieved from Ensembl database

ADD REPLY
0
Entering edit mode

Is it fasta format then? If this is an assignment, can you use biopython?

ADD REPLY
0
Entering edit mode

Yes the sequence is in fasta format and how to mask with biopython also I need to draw a CGR how can it be done with biopython

ADD REPLY

Login before adding your answer.

Traffic: 2957 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6