'Consider a system with no DRAM' replaced by a 'recycling fiber loop': John Carmack envisages bold future to avoid AI-driven RAM crisis

tal@lemmy.today · 4 days ago

'Consider a system with no DRAM' replaced by a 'recycling fiber loop': John Carmack envisages bold future to avoid AI-driven RAM crisis

CubitOom@infosec.pub · 4 days ago

Or we could just like…not do the terrible thing that is bad in everyway.

edgemaster72@lemmy.world · 4 days ago

and then the datacenters adopt that tech and hoard it all too

JasonDJ@lemmy.zip · 4 days ago

That’s the idea. It’s pretty worthless for home use, but for AI workloads, it might make sense, the problem is that it’s not quite scalable yet.

Essentially, if you’ve got 256Tb/s going over 200km of fiber, that means that there’s quite literally 32,000,000,000 bytes (32GB) “in flight”, living on the fiber at any period of time.

So it’s essentially it’s a revolving sushi belt of bytes, roughly as large as London (inside M25), moving at nearly the speed of light.

Of course, it doesn’t have to be the size of London. You could wind it into something about the size of a softball. Theoretically.

It’s a cool idea and Carmack is no doubt a brilliant man. It seems far fetched but it’s kind of been done before… https://en.wikipedia.org/wiki/Core_rope_memory

Morphit @feddit.uk · 4 days ago

It’s an optical delay-line memory. Early computer memories were acoustic in some manner.

I can’t imagine that the latency of ‘delay line RAM’ would be acceptable to anyone today. Maybe there’s some clever multiplexing that could improve that but it would surely add more complexity that just making more RAM ICs.

tal@lemmy.today · edit-2 4 days ago

Neural net computation has predictable access patterns, so instead of using the thing as a random access memory with latency incurred by waiting for the bit you want to get around to you, I expect that you can load the memory appropriately such that you always have the appropriate bit showing up at the time you need it. I’d guess that it probably needs something like the ability to buffer a small amount of data to get and keep multiple fiber coils in synch due to thermal expansion.

The Hacker’s Jargon File has an anecdote about doing something akin to that with drum memory, “The Story of Mel”.

http://www.catb.org/~esr/jargon/html/story-of-mel.html

vacuumflower@lemmy.sdf.org · 4 days ago

Also optical fiber is used a lot on battlefields now. It just remains there. There’s a lot to be assembled.

mrnobody@reddthat.com · 4 days ago

Then all the necessary mineral prices will shoot up 3,648%.

Goodlucksil@lemmy.dbzer0.com · 4 days ago

Is that a decimal comma or a digit separator comma

mrnobody@reddthat.com · 4 days ago

Digit separator comma

RobotToaster@mander.xyz · 4 days ago

I’m pretty sure 200km of fibre isn’t going to be cheap either

Em Adespoton@lemmy.ca · 4 days ago

Fibre is just strands of extruded glass; one of the most common substances on earth.

Sure beats the blood minerals needed for memory, and to scale up, you just extrude longer strands.

e8CArkcAuLE@piefed.social · 4 days ago

there is a bit of surplus of fibre wire in Ukraine, i hear… /s

UniversalBasicJustice@quokk.au · 4 days ago

I’m not sure which job sounds less appealing; collecting it or splicing it

acosmichippo@lemmy.world · 4 days ago

could be cheaper than enterprise grade DIMMs.

CosmoNova@lemmy.world · 4 days ago

Probably cheaper than tens of thousands of satellites.

solrize@lemmy.ml · 4 days ago

Delay line memory in gigabytes? Bold indeed.

ms.lane@lemmy.world · 4 days ago

It’s an interesting idea, but what’s the floor size for a pair of 200TB/s fibre transceivers vs. 32GB of HBM?

It’s it’s not significantly less, this doesn’t seem like it’d be particularly helpful outside the 200TB/s of streaming data.

tal@lemmy.today · 4 days ago

I’m assuming that the point is the bandwidth.

goes looking for HBM bandwidth

https://en.wikipedia.org/wiki/High_Bandwidth_Memory

It says that HBM 4, which came out one year ago, can do 2 TiB/s.

brucethemoose@lemmy.world · 4 days ago

The issue with AI is “now”

Can they power with solar? Nuclear? Hell, even a natural gas plant? Nope, the data centers need the power right this second, so they get gas turbines on site. Same with cooling; evaporative is just the quickest and cheapest to set up.

Same with its architecture. There’s no time to fix temperature/sampling issues, no time to try bitnet or any of a bazillion interesting papers that came out. A shippable product (model) is needed yesterday; just scale up what we have. “Fail” a single experiment? Your team is fired, which is exactly what happened at Meta.

Everything has to happen right now because of corporate FOMO. So, while this is an interesting musing and maybe Intel or someone will play with it, the actual AI labs could not care less because they can’t get it immediately.

tal@lemmy.today · 4 days ago

Note that this is from last month, though I haven’t seen it submitted.

just_another_person@lemmy.world · 4 days ago

This is… incredibly stupid. This man has done so many drugs he no longer realizes how computers or electricity works.

ETA: https://www.reddit.com/r/answers/comments/23nd6a/i_remember_in_the_90s_illegal_or_black_box_cable/

Shadow@lemmy.ca · 4 days ago

https://en.wikipedia.org/wiki/Delay-line_memory

nova_ad_vitum@lemmy.ca · 4 days ago

The lack of investment in more production capacity for RAM is based on a roughly 3-year horizon for this insane extra AI demand.

Creating workable consumer-grade alternatives with delay line memory of all things would take longer than that, and the market would collapse the moment AI demand for RAM dried up. This is one of those things that is theoretically possible but due to both technology and market conditions will absolutely not be a thing.

tal@lemmy.today · 4 days ago

Creating workable consumer-grade alternatives

I think that this is intended not to replace DIMMs in PCs, but to replace HBM for AI use. If you’re doing neural net computation, you have very predictable access patterns, so you can store your edge weights such that the desired data is showing up at just the right time.

just_another_person@lemmy.world · 4 days ago

It’s not that we don’t know what it is, it, again, is just INCREDIBLY FUCKING STUPID.

doorknob88@lemmy.world · 4 days ago

John Karmack is on drugs?

SuspciousCarrot78@lemmy.world · edit-2 4 days ago

deleted by creator

GreenBottles@lemmy.world · 4 days ago

Save us John

ooterness@lemmy.world · 4 days ago

Were you talking to John Carmack or John Connor?

eleitl@lemmy.zip · 4 days ago

They never mention the word latency even once. It’s a delay line SAM and speed of light in glass is some 200000 km/s. This is hard drive latency.

sepi@piefed.social · 4 days ago

Random Access how?

boonhet@sopuli.xyz · 4 days ago

If you read the article, it’s sequential access but that’s fine for AI use.

ryannathans@aussie.zone · 4 days ago

RAM more like SAM

boonhet@sopuli.xyz · 4 days ago

Yes, but for a niche use case where SAM is fine, not for consumers

geekwithsoul@piefed.social · 4 days ago

I don’t pretend to understand how this would actually work, but wouldn’t this essentially be like token ring networking but used as memory?

cmnybo@discuss.tchncs.de · 4 days ago

It’s delay line memory. It was common back in the days of vacuum tube computers.

tal@lemmy.today · edit-2 4 days ago

A little bit, but normally Token Ring didn’t just keep data running around in a circle on and on — Token Ring works more like a roundabout, where you enter at a given computer on the ring and then exit at another device. Without looking, I suspect that, like Internet Protocol packets, Token Ring probably had a TTL (time-to-live) field in its frames to keep a mis-addressed packet from forever running around in circles.

Also, I’m assuming that an implementation of Carmack’s idea would have only one…I don’t know the right term, might be “repeater”. You need to have some device to receive the data and then retransmit them to keep the signal strong and from spreading out. You wouldn’t want to have a ton of those, because otherwise it’d add cost. On Token Ring, you’d have a bunch of transceivers, to have a bunch of “exits”, since the whole point is to move data from one device to another.