While going through the code of string graph assembler (SGA) and especially Heng Li's implementation of BCR algorithm for fast indexing of sequence libraries (https://raw.github.com/lh3/ropebwt/master/bcr.c), I came to realize that constructing Burrows Wheeler transform in SGA has many similarities with k-mer counting in de Bruijn graph-based genome assembler. In my naive understanding of BWT (but not BCR code), constructing BWT index is likely to take more time in regions with highly repetitive k-mers. Does anyone has further insight into the process? Are these kind of code and algorithm-related questions appropriate for Biostar?
Yes, bioinformatics code and algorithm questions are entirely appropriate. It helps if questions are specific, "further insight into the process" is rather vague.