I know a node can refer to the orignal kmer and its forward-reverse kmer, but how to deal with the problem that these short reads can also be in different orientation to the reference. For example, given genome reference AAACCT, should ACCT(TGGA)(forward) and TCCA(AGGT)(backward) considered also a same node in the de-Bruijn graph? Or just divided them into two seperated node?
So that's to say when traversing the graph, a node actually represents four conditions(forward(ACCT), backward(TCCA), forward-reverse(TGGA), backward-reverse(AGGT))? Would it cause more branches?
I assumed you had 8-mers. A node never represents its reverse, it's either the sequence or its reverse complement.