Hi
By running this command:
makeblastdb -in Total.assembly.fasta -parse_seqids -dbtype nucl -out my_db
blastn -db my_db -query X.fasta -out results.out
The following results were obtained:
Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb
Miller (2000), "A greedy algorithm for aligning DNA sequences", J
Comput Biol 2000; 7(1-2):203-14.
Database: Total.assembly.fasta
87,103 sequences; 164,122,436 total letters
Query= c41837_g1_i1
Length=1353
Score E
Sequences producing significant alignments: (Bits) Value
c41837_g1_i1 len=1353 path=[1:0-297 299:298-304 @306@!:305-511 5... 2499 0.0
>c41837_g1_i1 len=1353 path=[1:0-297 299:298-304 @306@!:305-511 513:512-532
534:533-730 732:731-733 @735@!:734-1164 1166:1165-1223 1225:1224-1352]
Length=1353
Score = 2499 bits (1353), Expect = 0.0
Identities = 1353/1353 (100%), Gaps = 0/1353 (0%)
Strand=Plus/Plus
Query 1 CaaaaacaaaaacaaagaaaacttaagaaaaaaTGCGCGCAATCCTCGCTCTTGCATTCA 60
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 1 CAAAAACAAAAACAAAGAAAACTTAAGAAAAAATGCGCGCAATCCTCGCTCTTGCATTCA 60
Query 61 TAGGCGCTGTCTTTGCTCAAACCACCGTCACTGACGTCCTTCAATCATACCGTGTCACCT 120
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct 61 TAGGCGCTGTCTTTGCTCAAACCACCGTCACTGACGTCCTTCAATCATACCGTGTCACCT 120
How can filter these data based on the positive and negative numbers or low or high numbers of Scores and Values?
And also, how can extract IDs from these data?
can you elaborate what exactly you mean with "based on the positive and negative numbers or low or high numbers of Scores and Values" ?