Entering edit mode
2.7 years ago
diversitree
▴
10
I have a fasta file of assembled contigs that have Ns in them plus IAPUC codes. I want to calculate the average contig length in the fasta file, excluding Ns.
here is a snippet of the fasta file:
>uce-4216_species1 |uce-4216
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGTGGTTCTGAGATGCCTGGCATTCAGGATGATTTGTAATGTAAATTATATAATTGTACTTTCACATATTTTAACATCAAATAGAATGATTGACTACAGAATTTGAGCTGTCTACAGGTGGGGGTCAATTATCATCTGAATAATCACACTGCCACACAAGAATAGCATGGCCATGGAGTGTGACATATTTTTATCTCTATGCATTTCAATGAAGTCAGCCTGGTACATAAAAGGTTATCACCTAGGAAACATATTTTCCTAAGCACAAGTTAAACATGCAAGCAAGATCAGCATAGATATTCAATTTAGCCAGTCAACCCTAACCTATTAATATTTTAACAAAATCCAGTGAGGATAATTTTTTTCTTTGATCCCATCTCATTTGAGCAGCCTGGAAAGGGAAGAAAAATTAAAAACAAAATAGTCAAGCATACAGAATGAGGTTATGTATTAAGTGGGCTATTTAATGTTTTTGGCATATTATAGCCCTAGGGAAAGTGTGGATGGATTTAACAATCAAGATCTGTGTTCCCTGGGCCCACAAAGTTCGAGAAACATAAAATAATCTATACTTCCGAGCTGACAAATCTTACCTGACACACTGCTTCATCTCACTGGGACTCTCTAGCTCAGCATTAATCATATGTTACAGGGAGTAAAAAGAAAAGTAAATCACACTAAGCTGGAATG
>uce-4175_species1 |uce-4175
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGTGAAATTAAGCTTGCTATTTTTCTGTCACACATATAGATACAGTCAAAGCTTGTTTGATGATAAGAACTCAGTTCAGGATCTCATCTTTTGCCCTTGGCTTTAACTTATGTATGCCTTTGTCTGTATTGTCTGTACTTGTCTGTACTGCAAACACTTATGCATGTTTCTGCTATTATATATAGACTAAATATGTCATAACACATGAATGCAAAAGGATCAAAAATGCCTTCCTACTTTATAATCTGCTCAGCCAGAAACAGACTCTGTTTCTACCCTGCCTTTTCCTACATGTCATATTATCATCAGCTGCTCTTATATCCCAAAAGAATACTAACTACTGATCGATTGCCYGGACATGTCTGGCCGTGGCCTACATGTGCCCCGGGTAGTTCATTTATTTGCCACGGTGGATTTGCTAGAGTGGAATTTAATCAATAGCTAATTCATTAATTCTGGTCCTCGAGTATATAGGGATTGTGCAGTATAAAAATGACTGGCTGGCTTCAGCTTTGATTGAAATATGACAAACACTGCAGCTGACAGCCTTGGCAGTTGCCAGGCTGAATATGAATTTTGCTTATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTGTTGCTGCAGAGATTAGAAAAGTTTAAAGAAACCTTTGGGTTGTTTTGCCA
>uce-1234_species1 |uce-1234
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGTCNCAGAGGGGAATGCAGAATTGAGAAGTATCATTCCTAAACCTTGACAACCTTTCATCAGGCAAGTGATAAGGAATTGCATGTGTGGAGTACAGGCAGGTTCTCTTCTTGCTGTAGAGGATTTCCACCAGTGCCTGTGTTCTTGTTGAGTAATAATGAGATACCATAATGAAACAGTAGAAGATGGTTCTCCAATAACATTAGGAAAAAAGCAGCTGATGTATGGGATATTGAAGGCAGATTATGTTGTTAATGTATATTAGTATATTCTTAATTTCCTTTTAATTGAAAAAGACATATTGACTTTAATTAAAATCATTTCACAGGAACTGTCAATTAGCACATGTCAAACTAGTTAATTCAGAACAGAATTCTTTTAATTAGGGTCTGCTTTCCTTTAACTGTGGGGCCAATGAAATCAGCCTTTCCTTATCAAGACTTTAAATGTCTCTAAGAAATACAATACAAATCTCTAAAAACTCTTATCTATTATTAGAATCCCATATGGATAACATTAAAATRRTSKTKCTKSMAWKSYWYMWKYYWCMKKWWTYWKWKYYWMMTWKMYRKMWAKKWRRMWMWRAWWMMKGWR
>uce-2732_species1 |uce-2732
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGAATTTAGTTTTCTGCTACTGATCTGTTTGTACATGATCTCTCCCTTTCCCCCCCTCCCTTTCTCTTTCCCCCTCTCTCTCGCTCTCTTTCATATTTAATGTCTGCGTATNNNNNNNNNNNGGATGTACTTTGTCATTTTAAGGTAATTGCGATTTCTCTCAGAATAMMRRSMWMRSMTYANTNMTAANSYKAGSCTGGTATGTGGCTAACTGAAATGCAAAAGGAAGAAGAGGCTTTTTTTTTTTTTTAAGGGGTGGGGGAGAGTTAATTTCCACATTGACATTTTGGAGATACAAATGCAGAGCAAAATCCTTGGGGGGGGGTGTNNNNNNNNNNNNNNNNNNNNNNNNNGATTGGTAATTTTCTTTTTGGTGGACTGCGCAATAGGTATGGTAATTTTAAAAGAGGGTGATTTATATGAGCTTCAGTAAATGCTGCATATTGTATTTCAAAGAGTTTCCTGTCGTGACCTCATAAAAAGAGGAGGAGGCTTGTATGTGTTGCAGTGCCTAGTATATGTCGATTTTGTTGCATCGTTGGGCAGCAGCGCTGTAAGAAGGAATGTCAGCTTTTACATAACGCTCTTTTTGCTTTTGACTCTGTGAGGGGCTGTAAGGGTCCATCTTTGTGATCACAGATGGAGTGGAATGGCTTGAAAATGGTAAGTGAACGGGGAGAGCCTGCTCGTGGGGTTTTGTCTCG