Data centers will look ridiculous with tiny future servers.

Doctorzoidy@lemmynsfw.com · edit-2 5 months ago

Data centers will look ridiculous with tiny future servers.

4am@lemm.ee · 5 months ago

You think that if we can scale 6 racks down into one cube that someone wouldn’t just buy 6 racks of cubes?

They’ll always hunger for more.

MNByChoice@midwest.social · 5 months ago

They look silly now. Many data centers are not scaling up power per rack. With GPUs, there are often two chassis per rack.

InverseParallax@lemmy.world · 5 months ago

Have that problem ourselves, they didn’t provision power or cooling for this kind of density, and how do you pipe in multiple megawatts to a warehouse in the middle of nowhere?

Geologist@lemmy.zip · 5 months ago

I had this problem with Equinix! They limited our company to like 10kva per rack, and we were installing nvidia dgx servers. Depending on the model we could fit only one or two lol.

𝕽𝖚𝖆𝖎𝖉𝖍𝖗𝖎𝖌𝖍@midwest.social · 5 months ago

I think what will happen is that we’ll just start seeing sub-U servers. First will be 0.5U servers, then 0.25U, and eventually 0.1U. By that point, you’ll be racking racks of servers, with 10 0.1U servers slotted into a frame that you mount in an open 1U slot.

Silliness aside, we’re kind of already doing that in some uses, only vertically. Multiple GPUs mounted vertically in an xU harness.

Lucien [he/him]@mander.xyz · 5 months ago

You’ve reinvented blade servers

partial_accumen@lemmy.world · 5 months ago

The future is 12 years ago: HP Moonshot 1500

“The HP Moonshot 1500 System chassis is a proprietary 4.3U chassis that is pretty heavy: 180 lbs or 81.6 Kg. The chassis hosts 45 hot-pluggable Atom S1260 based server nodes”

source

MNByChoice@midwest.social · 5 months ago

That did not catch on. I had access to one and the use case and deployment docs were foggy at best

InverseParallax@lemmy.world · 5 months ago

It made some sense before virtualization for job separation.

Then docker/k8s came along and nuked everything from orbit.

partial_accumen@lemmy.world · 5 months ago

The other use case was for hosting companies. They could sell “5 servers” to one customer and “10 servers” to another and have full CPU/memory isolation. I think that use case still exists and we see it used all over the place in public cloud hyperscalers.

Meltdown and Spectre vulnerabilities are a good argument for discrete servers like this. We’ll see if a new generation of CPUs will make this more worth it.

InverseParallax@lemmy.world · 5 months ago

128-192 cores on a single epyc makes almost nothing worth it, the scaling is incredible.

Also, I happen to know they’re working on even more hardware isolation mechanisms, similar to sriov but more enforced.

partial_accumen@lemmy.world · 5 months ago

128-192 cores on a single epyc makes almost nothing worth it, the scaling is incredible.

Sure, which is why we haven’t seen a huge adoption. However, in some cases it isn’t so much an issue of total compute power, its autonomy. If there’s a rogue process running on one of those 192 cores and it can end up accessing the memory in your space, its a problem. There are some regulatory rules I’ve run into that actually forbid company processes on shared CPU infrastructure.

InverseParallax@lemmy.world · 5 months ago

There are, but at that point you’re probably buying big iron already, cost isn’t an issue.

Sun literally made their living from those applications for a long while.

MNByChoice@midwest.social · 5 months ago

VMs were a thing in 2013.

Interestinly, Docker was released in March 2013. So it might have prevented a better company from trying the same thing.

InverseParallax@lemmy.world · 5 months ago

Yes, but they weren’t as fast, vt-x and the like were still fairly new, and the VM stacks were kind of shit.

Yeah, docker is a shame, I wrote a thin stack on lxc, but BSD Jails are much nicer, if only they improved their deployment system

MNByChoice@midwest.social · 5 months ago

Agreed.

Highlighting how often software usability reduces adoption of good ideas.

𝕽𝖚𝖆𝖎𝖉𝖍𝖗𝖎𝖌𝖍@midwest.social · 5 months ago

Yeah, that’s the stuff.

XeroxCool@lemmy.world · 5 months ago

Only if storage density out paces storage demand. Eventually, physics will hit a limit

mic_check_one_two@lemmy.dbzer0.com · 5 months ago

Physics is already hitting limits. We’re already seeing CPUs be limited by things like atom size, and the speed of light across the width of the chip. Those hard physics limitations are a large part of why quantum computing is being so heavily researched.

XeroxCool@lemmy.world · 5 months ago

Which means it doesn’t seem like the limit has been hit yet. For standard devices, the general market has not moved to the current physical limitations

🇰 🌀 🇱 🇦 🇳 🇦 🇰 🇮 @pawb.social · edit-2 5 months ago

I sometimes wonder how powerful a computer could be made if we kept the current transistor size we have now, but still built the machine to take up an entire room. At what point would the number of transistors and the size of the machine become more of a problem than a solution? 🤔

Deepus@lemm.ee · 5 months ago

Isnt the main limiting factor signal integrity? Like we could do a CPU the size of a room now but it’s pointless as the stuff at one end wouldnt be able to even talk to the stuff in the middle as the signal just get fucked up on the way?

Jolteon@lemmy.zip · edit-2 5 months ago

IIRC, light speed delay (or technically, electricity speed delay) it’s also a factor, but I can’t remember how much of a factor.

BullishUtensil@lemmy.world · 5 months ago

It’s significant already. If I get the math right (warning, I’m on my phone in bed at 3am and it’s been 10 years) I think that a 1 inch chip running at 3GHz clock rate could, if you aren’t careful with the design of the clock network, end up with half a clock cycle physically fitting on the chip. That is, the trace that was supposed to move the signal from one end of the chip to the other, would instead see the clock signal as a standing wave, not moving at all. (Of course people has (tried?) to make use of that effect. I think it was called “resonant clock distribution” or some such)

LH0ezVT@sh.itjust.works · 5 months ago

Signal integrity will probably be fine, you can always go with optical signalling for the long routes. What would be more of an issue is absurd complexity, latency from one end to the other, that kind of stuff. At some point, just breaking it down into a lot of semi-autonomous nodes in a cluster makes more sense. We kind of already started this with multi-core CPUs (and GPUs are essentially a lot of pretty dumb cores). The currently biggest CPUs all have a lot of cores, for a reason.

UniversalFlamingo@lemmy.world · 5 months ago

🤯

theotherbelow@lemmynsfw.com · 5 months ago

They tend to fill the space. I mean if you drive by a modern data center, so much grid electrical equipment is just right there. Now if hypothetically supermachine uses all that power sure, small data center. Unless they have a nuclear reactor they should (fu felon musk) only rely on grid/solar/renewables.