I would like to quantify similarities between sequences using blastn from package BLAST+, but I'm not sure if it is a right tool. The output I get from blastn contains only statistically significant alignments and their p-values. Is there any way to get alignments for every sequence in database?
I have approximately 100 000 sequences for which I'm trying to quantify the similarity between each pair of sequences and my database for blastn contains only these sequences.
Thank you for the response but how to create distance matrix for all sequences using blastn, which gives me e-value or bit-score for only significant alignments.
Blast is not the right tool because there is a limit on how many sequences it will report. In theory you could make one database for each sequences or divide the sequences into many small groups and then run blast for each sequence against that database but that would be inefficient.
A more efficient way is to use a program such as MOTHUR once you have the sequences aligned. Then you will get your distance matrix and will be able to play with that data. MOTHUR is used for microbiome analysis where you compare libraries of the same gene and define groups based on similarity.