Top Posts Tagged with #high performance computing

what does stop being babies about thz computing entail?

[this is what the refrance]

Thomas Sterling has two main bugbears, one is his own baby, the Beowulf Cluster, and the other is the Von Neumann Architecture.

On the easier side, there's the Von Neumann Architecture, which is a way of thinking about how a computer accesses programs and memory, and it basically hasn't been true for a decade now. In theory, a computer has memory which is used for both instructions and data. In real life, modern computers are magic plinko machines of data where things are computed out of order and backwards wherever it might provide a tiny speedup, but before the programmer sees any of this tiny hypervisors grab all this data and reshuffle it so that the external interface to how we program computers hasn't really changed since the days of the PDP-11.

Sterling thinks that if we were willing to totally redesign computers to allow simultaneous manipulation of data and code, moving main memory into the processor, as well as integrating the specialty features modern processors use internally for speedup, plus your own dedicated specialty instructions for high performance single-chip compute, we could dramatically improve performance and chip speed by reducing memory wait time and improving memory bandwidth. Redesigning the entire chip architecture to take advantage of modern compute techniques would allow chips to run much, much faster, but requires you to stop being a baby about it.

Far more wacky though, is his idea to redesign the way HPC systems work. I'm going to reference this older talk a bit because I can't find a free access copy of his newer shit. Way back before the 90's supercomputers existed as hyper-specialized custom built deals by Cray or IBM that cost millions of dollars. Then, Thomas Sterling comes along, and invents the Beowulf cluster. In this, you just buy a bunch of cheap off-the-shelf PC's, network them together with high end consumer network gear, and write some very clever job allocation and parallelization code to break up jobs across the cluster. Bam, you now have something that is as fast as the supercomputers aerospace engineering and oil and gas companies were using around the same time for less than a tenth of the price.

Everyone went, predictably, apeshit over this. They replaced their million-dollar cray system with a thousand e-machines crammed into a closet somewhere and this kicked off the modern field of high performance computing, which is defined by this style of cluster networking.

Over the years, though, we started to specialize. Now you have high performance networking made specifically for datacenters, high reliability and core density servers, and as we've gotten closer and closer to the limits of conventional computing, it's harder to scale out compute without running into awkward limits. If you have too many nodes it becomes difficult to efficiently scale your system, breaking up and reassembling tasks takes too long. Cramming more power into each node makes everything hot, and that means you need more cooling, as seen in the rise of liquid cooling for the datacenter in the past five years. You need faster networks for these computers to interact without waiting on each other, so we get PCIe Over Fabric. And you run into the above Von Neumann bottleneck! So what can you do.

In Sterling's opinion, you redesign the computer into a terahertz-processor memory-on-chip system linked by fibre optics and cooled by a constant liquid-nitrogen liquid-helium loop. This design is completely unlike anything today that exists outside of his company labs, as far as I'm aware. I'm sure there's a few clones in some secret government labs.

The idea here is that you can reduce power consumption by consolidating everything onto one chip, using far fewer duplicate systems that run extremely fast, and linking everything with asynchronous optical networking. Usually that would come with untenable increases in power density, but helium cooling can deal with kilowatt chips just fine, so while your power per-chip goes up, overall system power goes down, as well as reducing datacenter footprint. But it requires you to commit to building an insane helium cooled custom processor with on-chip optics, which requires you to stop being a baby about it.

#ask #anonymous #high performance computing #hpc #computer stuff

•18+ Adults Only

Watch Anya Live on Cam

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.

✓ Live Streaming✓ Interactive Chat✓ Private Shows✓ HD Quality✓ Free Actions

Free to watch • No registration required • HD streaming

New AMD server processors have 768MB (!!!) of L3 cache, you could load an entire CD into that thing! Bonkers.

Memory bandwidth is such a limiting factor in many tasks that this could have a serious performance improvement, I wish I had any excuse at all to benchmark some fluid dynamics/smooth particle hydrodynamics codes on those.

#high performance computing #computer stuff

The Hikari supercomputer at the University of Texas is joint venture between a Japanese company and UT on a solar powered DC-only supercomputing system but I'll be honest the main reason I think about it is because it has a super pretty logo that contains a cowboy.

#cowboyposting #hpc #supercomputing #tacc #hikari #computer stuff #high performance computing

I just now noticed you have a pinup of an Intel Xeon chip on your wall. Two questions: 1) how are you such a nerd, and 2) why the Xeon in particular?

oh that is one of my pieces of memorabilia from when we did the ISC2019 Student Cluster Competition. Which I suppose answers 1), I actively seek out Nerd Experience Points.

As for 2), While we were setting up our compute cluster, we also set up all our booth decorations which included some mylar balloons spelling out "CHPC", a cheap RGB light string we had way back in selection rounds, and a South African Flag.

The actually real official corporate booths had these little fancy poster board signs from Intel showing off that they were using various grades of Xeon processors. As it so happened, our cluster had a dozen 8180 Xeon Platinum processors in it, and so of course we knew we had to have one.

So, we went off, found the Intel table, and asked them if we could pretty please have one for our cluster. They ended up giving us two, and I brought this one home once we were done.

Those 8180's were ten thousand dollars a piece, 28-core monsters, and we almost didn't get them: we actually had to get our supervisors to ask Intel to lend them to us, since our usual sponsors cut our budget and only gave us 18-core Gold tier processors. Bah, only Gold? A meager 3500 dollars each? Ridiculous. We were also totally right in asking for them, in the end we only won by two percentage points, there's no way we would have won on those Golds.

Other memorabilia I have from that competition includes the Geeko plush that the OpenSUSE reps gave us. He hangs from my desk light and serves as a pincushion these days.

#ask #kwarrtz #isc19 #isc19 scc #high performance computing #computer stuff #kalium lore

Applying to a bunch of sysadmin jobs at biotech research entities in part because I want to be biotech/biochem adjacent and in part because it seems like there's some kind of prion disease that makes bioscientists want to build and run their own clusters and fortunately they also have the money to pay for that.

The head of the CS department at my old university bought four top-of-the-line Compute GPU's out of her own pocket to build her own protein simulation cluster. That's like. A moderately fancy car worth of GPU's. At least USD 40k in hardware.

#high performance computing #job search...2!

•18+ Adults Only

Watch Anya Live on Cam

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.

✓ Live Streaming✓ Interactive Chat✓ Private Shows✓ HD Quality✓ Free Actions

Free to watch • No registration required • HD streaming

for some reason, seeing you on the dash talking about high performance computing is emotionally validating. convinces the monkey brain that gaining knowledge of HPC was justified, because at least one other real monkey somewhere in the world gives a shit about it

Oh huge mood, I followed you after I saw your reply about memory bandwidth and the KNL OpenMP upgrade to VASP when (coincidentally also) argumate asked about something. I'd actually done a class presentation on that paper a few months earlier! Exciting to see it in the wild. It's such a cool field of specialization, you get to actually use the full capacity of a computer's performance.

I just started a new job at a Linux Shop and hopefully I will wangle my way into the HPC world once again.

#ask #firebendinglemur #high performance computing #computer stuff #my three possible paths are HPC Embedded or VLSI #they're all fair options but re. the DAG of life i was talking about #yeah

Speaking of the student cluster competition, I'm kinda annoyed that the pandemic means it's all happening online because usually it's a great opportunity to see some fancy new technology and like, a lot of things got shaken up around 2019 that would have been fascinating to see. Now that it's all online everyone just shares one boring Intel cluster in Singapore and don't have to tune power consumption.

The Spanish team always brings ARM clusters and never does well but that's because they're fighting uphill, Nvidia put out CUDA drivers for ARM in 2019 so they had a chance to be competitive now and that would be really cool. The new AMD Epyc line is way more power efficient than Intel's offering and has competitive performance, and when I was there the teams with Epycs really put on a good show. There's been a lot of development with the Vector Processor market too, and Heidelberg was running those. Maybe we'd even see someone running A64FX chips and I want to see what that does to some of the fluid dynamics codes so bad.

Processors were so fuckin boring from like 2010 until 2017

#high performance computing

Japan’s new supercomputer just absolutely wiped the floor with everyone else a few days ago. Fugaku is more than twice as fast as the next place down in a dead sprint and it’s not even burning an exorbitant amount of energy to do that, like the old Sunway supercomputers did.

The 2nd (and 3rd) fastest machines are IBM Power clusters with a huge number of Nvidia V100 GPU’s hanging off of them, like, 3 GPU’s per CPU and 2 CPU’s per rack, it’s extremely dense compute. The 4th fastest is a Sunway machine, which is a wacky architecture that basically only exists in China and they make low-speed processors with just a ludicrous number of cores, something like 260 cores per socket.

The new Fugaku machine is ARM powered, of all things, and its big party trick is that they used the Fujitsu A64FX. In addition to having just Big Chungus Grade Wide Vector Processors, when Fujitsu was making this processor they looked at all the parts on a computer and went “those are all too far away, what if they weren’t.”

The CPU has on-die HBM2 memory rather than traditional DDR4 DIMMs, so it has truly LUDICROUS memory bandwidth: something like 1 teraBYTE per second from memory to CPU. That’s easily 5× what you can find on most server CPU’s. The only things that really can compete are some GPU’s and those NEC vector processors, which use the same memory technology.

They ALSO put a big chunk of the network controller on the die, which means you no longer have to shunt data out to a PCIe card before it can go to network. This is very similar to how Intel’s first round of OmniPath stuff was going to work until they chickened out on strapping network interfaces directly to their CPU. Which is a shame because I thought it was hilarious.

Yeah just plug it in don’t worry it’ll be fine.

Fugaku hits 0.5 exaflops, which, I mean: Exascale has been a buzzword in supercomputers for a while. There’s a bunch of companies with big contracts out to make The First Exascale Computer, and here’s this machine coming out and just, getting halfway there with almost none of the Fancy Heterogeneous Compute Technologies, just raw CPU power and an unorthodox silicon design.

It’s not even /that/ unorthodox, “Strap the memory to the CPU” is not a new idea, GPU’s do it all the time.

I am interested to see if on-die RAM will become more of a thing, not that memory bandwidth is really a huge deal in consumer usage. One of my friends was telling me about how moving OpenFOAM fluid dynamics simulations to GPU’s doesn’t actually yield a usable performance gain sometimes because, while the compute speed goes up, the overhead transferring to and from VRAM is so high it makes it not worth it for very large simulations. Having Real Fast memory attached directly to the CPU might allow those performance gains to happen without any GPU’s at all.

#supercomputing #high performance computing #computer stuff #fugaku

what does stop being babies about thz computing entail?

[this is what the refrance]

Thomas Sterling has two main bugbears, one is his own baby, the Beowulf Cluster, and the other is the Von Neumann Architecture.

#ask #anonymous #high performance computing #hpc #computer stuff

•18+ Adults Only

Watch Anya Live on Cam

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.

✓ Live Streaming✓ Interactive Chat✓ Private Shows✓ HD Quality✓ Free Actions

Free to watch • No registration required • HD streaming

New AMD server processors have 768MB (!!!) of L3 cache, you could load an entire CD into that thing! Bonkers.

#high performance computing #computer stuff

#cowboyposting #hpc #supercomputing #tacc #hikari #computer stuff #high performance computing

I just now noticed you have a pinup of an Intel Xeon chip on your wall. Two questions: 1) how are you such a nerd, and 2) why the Xeon in particular?

oh that is one of my pieces of memorabilia from when we did the ISC2019 Student Cluster Competition. Which I suppose answers 1), I actively seek out Nerd Experience Points.

So, we went off, found the Intel table, and asked them if we could pretty please have one for our cluster. They ended up giving us two, and I brought this one home once we were done.

Other memorabilia I have from that competition includes the Geeko plush that the OpenSUSE reps gave us. He hangs from my desk light and serves as a pincushion these days.

#ask #kwarrtz #isc19 #isc19 scc #high performance computing #computer stuff #kalium lore

#high performance computing #job search...2!

•18+ Adults Only

Watch Anya Live on Cam

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.

✓ Live Streaming✓ Interactive Chat✓ Private Shows✓ HD Quality✓ Free Actions

Free to watch • No registration required • HD streaming

I just started a new job at a Linux Shop and hopefully I will wangle my way into the HPC world once again.

#ask #firebendinglemur #high performance computing #computer stuff #my three possible paths are HPC Embedded or VLSI #they're all fair options but re. the DAG of life i was talking about #yeah

Processors were so fuckin boring from like 2010 until 2017

#high performance computing

Yeah just plug it in don’t worry it’ll be fine.

It’s not even /that/ unorthodox, “Strap the memory to the CPU” is not a new idea, GPU’s do it all the time.

#supercomputing #high performance computing #computer stuff #fugaku

Top Posts Tagged with #high performance computing | Tumlook

Trending Tags

Last Seen Tags

#high performance computing

Trending Tags

Last Seen Tags

#high performance computing