I have 2 files, one is about 7 MB and other is 1 GB. The first file seems:
rs58108140 A/G chr1 10582
rs10218492 A/G chr1 10827
rs10218493 A/G chr1 10903
and the second one is:
rs58108140
rs10218493
rs11240777
I need to search second files snps in the first file and return the matching rows. It should be like that:
rs58108140 A/G chr1 10582
rs10218493 A/G chr1 10903
I tried to put all them in a database and take their inner join based on snpids but it is still working. Do you know a fast solution?
I strongly recommend alessia's solution! But just for the record: have you build an index over the columns that you join on (i.e., the columns containing the IDs)? Otherwise, it is not surprising that the query takes forever ;-)