Question

Solid State Drive For Bioinformatics Data Crunching

4

Entering edit mode

14.9 years ago

Khader Shameer 18k

Any one here using a laptop/server/cluster with a Solid State Drive for bioinformatics data processing ? A friend recommended to use them for the data crunching as the speed and efficiency of I/O calls will be faster on SSD comparing to HDD. Any experiences or thoughts?

hardware • 4.4k views

ADD COMMENT • link updated 6.6 years ago by Ram 45k • written 14.9 years ago by Khader Shameer 18k

Ram · Answer 1 · 2010-06-30

7

Entering edit mode

14.8 years ago

Istvan Albert 102k

I have tried a few times optimizing data processing via a virtual file systems stored in RAM. These are supposedly providing the fastest I/O. I have not observed a substantial performance increase in tasks relating to mapping reads to the genome (somewhat disappointingly).

Other applications may show different performance characteristics. I would test the performance change with tmpfs first because it is easy to set up.

ADD COMMENT • link updated 6.6 years ago by Ram 45k • written 14.8 years ago by Istvan Albert 102k

0

Entering edit mode

Thanks for sharing your thoughts.

ADD REPLY • link 14.8 years ago by Khader Shameer 18k

0

Entering edit mode

+1 for the simple test. On modern Linux no setups is necessary. Out of the box /dev/shm/ (tmpfs) will be mounted - it addresses half of the RAM.

ADD REPLY • link 14.2 years ago by Aleksandr Levchuk 3.2k

score 6 · Answer 2 · 2010-06-30

6

Entering edit mode

14.8 years ago

Jussi ▴ 180

SSD may help only if the software is consuming significant time waiting for disk. Especially if you have lots of small random access IO requests Random access time of SSD is much better compared to spinning disks.

If you access the data serially, then the throughput of HDD is quite good and you do not get big improvements changing into SSD. For example, Oracle gets over 10 GB/s throughput from HDD in their Exadata database machine.

ADD COMMENT • link 14.8 years ago by Jussi ▴ 180

1

Entering edit mode

Thanks for your points Jussi. I was looking at a process where I have a lot of small random access IO requests. I was curious to know if someone is already running bioinformatics applications on a SSD, because machines with SSDs are very costly comparing to same configuration with HDD.

ADD REPLY • link 14.8 years ago by Khader Shameer 18k