Or we could just like…not do the terrible thing that is bad in everyway.
and then the datacenters adopt that tech and hoard it all too
That’s the idea. It’s pretty worthless for home use, but for AI workloads, it might make sense, the problem is that it’s not quite scalable yet.
Essentially, if you’ve got 256Tb/s going over 200km of fiber, that means that there’s quite literally 32,000,000,000 bytes (32GB) “in flight”, living on the fiber at any period of time.
So it’s essentially it’s a revolving sushi belt of bytes, roughly as large as London (inside M25), moving at nearly the speed of light.
Of course, it doesn’t have to be the size of London. You could wind it into something about the size of a softball. Theoretically.
It’s a cool idea and Carmack is no doubt a brilliant man. It seems far fetched but it’s kind of been done before… https://en.wikipedia.org/wiki/Core_rope_memory
It’s an optical delay-line memory. Early computer memories were acoustic in some manner.
I can’t imagine that the latency of ‘delay line RAM’ would be acceptable to anyone today. Maybe there’s some clever multiplexing that could improve that but it would surely add more complexity that just making more RAM ICs.
Neural net computation has predictable access patterns, so instead of using the thing as a random access memory with latency incurred by waiting for the bit you want to get around to you, I expect that you can load the memory appropriately such that you always have the appropriate bit showing up at the time you need it. I’d guess that it probably needs something like the ability to buffer a small amount of data to get and keep multiple fiber coils in synch due to thermal expansion.
The Hacker’s Jargon File has an anecdote about doing something akin to that with drum memory, “The Story of Mel”.
Also optical fiber is used a lot on battlefields now. It just remains there. There’s a lot to be assembled.
Then all the necessary mineral prices will shoot up 3,648%.
Is that a decimal comma or a digit separator comma
Digit separator comma
I’m pretty sure 200km of fibre isn’t going to be cheap either
Fibre is just strands of extruded glass; one of the most common substances on earth.
Sure beats the blood minerals needed for memory, and to scale up, you just extrude longer strands.
there is a bit of surplus of fibre wire in Ukraine, i hear… /s
I’m not sure which job sounds less appealing; collecting it or splicing it
could be cheaper than enterprise grade DIMMs.
Probably cheaper than tens of thousands of satellites.
Delay line memory in gigabytes? Bold indeed.
It’s an interesting idea, but what’s the floor size for a pair of 200TB/s fibre transceivers vs. 32GB of HBM?
It’s it’s not significantly less, this doesn’t seem like it’d be particularly helpful outside the 200TB/s of streaming data.
I’m assuming that the point is the bandwidth.
goes looking for HBM bandwidth
https://en.wikipedia.org/wiki/High_Bandwidth_Memory
It says that HBM 4, which came out one year ago, can do 2 TiB/s.
The issue with AI is “now”
Can they power with solar? Nuclear? Hell, even a natural gas plant? Nope, the data centers need the power right this second, so they get gas turbines on site. Same with cooling; evaporative is just the quickest and cheapest to set up.
Same with its architecture. There’s no time to fix temperature/sampling issues, no time to try bitnet or any of a bazillion interesting papers that came out. A shippable product (model) is needed yesterday; just scale up what we have. “Fail” a single experiment? Your team is fired, which is exactly what happened at Meta.
Everything has to happen right now because of corporate FOMO. So, while this is an interesting musing and maybe Intel or someone will play with it, the actual AI labs could not care less because they can’t get it immediately.
Note that this is from last month, though I haven’t seen it submitted.
This is… incredibly stupid. This man has done so many drugs he no longer realizes how computers or electricity works.
ETA: https://www.reddit.com/r/answers/comments/23nd6a/i_remember_in_the_90s_illegal_or_black_box_cable/
The lack of investment in more production capacity for RAM is based on a roughly 3-year horizon for this insane extra AI demand.
Creating workable consumer-grade alternatives with delay line memory of all things would take longer than that, and the market would collapse the moment AI demand for RAM dried up. This is one of those things that is theoretically possible but due to both technology and market conditions will absolutely not be a thing.
Creating workable consumer-grade alternatives
I think that this is intended not to replace DIMMs in PCs, but to replace HBM for AI use. If you’re doing neural net computation, you have very predictable access patterns, so you can store your edge weights such that the desired data is showing up at just the right time.
It’s not that we don’t know what it is, it, again, is just INCREDIBLY FUCKING STUPID.
John Karmack is on drugs?
deleted by creator
Save us John
Were you talking to John Carmack or John Connor?
They never mention the word latency even once. It’s a delay line SAM and speed of light in glass is some 200000 km/s. This is hard drive latency.
Random Access how?
If you read the article, it’s sequential access but that’s fine for AI use.
RAM more like SAM
Yes, but for a niche use case where SAM is fine, not for consumers
I don’t pretend to understand how this would actually work, but wouldn’t this essentially be like token ring networking but used as memory?
It’s delay line memory. It was common back in the days of vacuum tube computers.
A little bit, but normally Token Ring didn’t just keep data running around in a circle on and on — Token Ring works more like a roundabout, where you enter at a given computer on the ring and then exit at another device. Without looking, I suspect that, like Internet Protocol packets, Token Ring probably had a TTL (time-to-live) field in its frames to keep a mis-addressed packet from forever running around in circles.
Also, I’m assuming that an implementation of Carmack’s idea would have only one…I don’t know the right term, might be “repeater”. You need to have some device to receive the data and then retransmit them to keep the signal strong and from spreading out. You wouldn’t want to have a ton of those, because otherwise it’d add cost. On Token Ring, you’d have a bunch of transceivers, to have a bunch of “exits”, since the whole point is to move data from one device to another.












