Ladies and Gentlemen, I'm now planning to buy a new high-performance computing server for bioinfomatics. I prefer the tower type because the server is supposed to be laid at the corner of our office as we don't have a single computer room.
There are several options for me as follows:
1) Dell T640, with two Intel Xeon 8280 CPU (each 28 cores, 56 thrds, 2.70~4.00GHz)(best tower type of server in market)
2) H3C UniServer R4950 G5, with two AMD EPYC Milan 7763 CPU (each 64 cores, 128 thrds, 2.45~3.50GHz)(though rack type, but a silent cabinet only costs less than 1000 CNY = 155 USD at 2021/07/03)
3) Wait for a tower server to come to market with 3rd Gen Intel Xeon CPUs (especially Intel® Xeon® Platinum 8380) or AMD EPYC Milan CPUs (especially AMD EPYC™ 7763)
I wonder if there are any compatibility issues between AMD cpu and common-used softwares? The AMD seems much cheaper and could leave me more budget for RAM, but compatibility is more essential, right?
Also, have anyone tested computing performance of these CPUs(INTEL 8280 vs. AMD 7742; INTEL 8380 vs. AMD 7763)? Some people said the calculating capability of 2 AMD threads equals that of a single Intel thread.
Besides, is the rack type of server really very noisy?
Thank you in advance for sharing your opinion and your experience.
Relevant reviews from Hexus: Milan 7763, Ice Lake 8380. My impression is that the AMD offering is better.
Why would this be the case?
Somebody should send this to Intel's marketing department?
You don't want to have a rack server in your office.
Thx for your reply.
As most discussions about this issue on the Interent are contaminated by AMD fans, Intel fans and hardware merchants, i cannot get enough valuable info. And that's why I raise this question. There are only two useful points I got as below: 1) GATK team said they have worked with Intel team to optimize code. 2) BUG reported on Github page.
OK. I don't want to apply hydrocooling system on reck server either, so it seems that tower type is the only feasible option.
Thank you for sharing those links. Didn't know about either of these things. I wonder why there isn't a
GATK
-AMD partnership--perhaps it's in the works.That bug report is pretty concerning also.
This comment from the bug report stands out to me. I've had something similar happen to me on a couple of AMD Rome machines (2x 64 cores, 128/256 threads). I'd submit a job to the machine (via
Slurm
), come back and take a look, and there'd be no job-related CPU activity, but some 50-60 GB of RAM would have been occupied. But I presume the issues I had faced probably stem fromSlurm
mis-configurations, network file storage problems, and maybe Linux kernel issues than the processors themselves. (These machines were recent acquisitions.) The problem never occurred on the much older Intel Haswell hardware that co-exists with these machines.If systems integrators don't offer tower-builds, you could try throwing a machine together yourself. You'll be on your own when it comes to troubleshooting over the lifetime of the device though.
Depending on the bandwidth available to get the data to and from your server, a remote/cloud server could be a competitive option.
Also if you have the resources, you could test a few different hardware set ups with some cloud providers before buying your server.
Since you mention cloud compute: do you actually know of any good cost-efficient providers in Europe that don't happen to be Google, Amazon, or one of the other big companies?
I can't speak about cost because this really depends on the specifics of the tasks and the provider. However, besides the commercial entities, there are also a number of academic clouds such as the EMBL-EBI Embassy cloud or the German de.NBI cloud. You can find moe providers by browsing the European Open Science Cloud marketplace.