Entering edit mode
22 months ago
Tsigabu
•
0
I have a reference genome explained by accession number instead of chromosome number. Any clue how to convert it into a chromosome number in the Linux terminal? Any suggestion is kindly appreciated.
Could you provide an example for your question, including the input and desired output? This will make it easier to assist you.
Thanks for the reply. As displayed below, the fasta reference file is starting with
>NC_
, but what I'm looking for is I need to substitute the accession number by Chromosome number. For-example, the accession numberNC_056070.1
stands for chromosome 17. instead of starting the first column withNC_056070.1
I need Chromosome 17. The fast file has 26 chromosomes, X chromosome and mitochondrial sequence as well.kindly.
If the above is a consistent pattern, edit the header so it has just the
chromosome [^,]+
but aschromosome_[^,]+
like so:Test before running on the whole file and double check headers once done.