Entering edit mode
6.9 years ago
h_asma
•
0
Hello
I want a perl script to extract name of cis regulatory module ("Sry-alpha_CANBG ") chromosome number ("2R") and coordinate (2377048_2377297) in a tab separated format from this header
>Sry-alpha_CANBG:Dmel:3R:25866821-25867071_2R_2377048_2377297
^-------------^ ^^ ^-------------^
Output should look like this:
Sry-alpha_CANBG 2R 2377048 2377297
I have tried this
grep "^>" filename.fasta
but this returns all the headers. And as theses are variables I can't use specific pattern to find and replace it with nothing. Any suggestions would be appreciated
Thanks
As you are a new user, you should know that first you need to try something yourself, in awk/perl or whatever language, and if you still cant figure out, people help in modifying your script/command.
People here doesn't provide ready-made scripts as per your requirements.
There are many many posts on this forum handling fasta headers. So check them.
what have you tried ?
Hello hasiba.asma!
I'm closing your post until you add information on what you've tried. Once you add that information, I will open the post so people can answer your question.