HeliumHydride 18 hours ago

https://scholar.harvard.edu/files/mickens/files/theslowwinte...

"Unfortunately for John, the branches made a pact with Satan and quantum mechanics [...] In exchange for their last remaining bits of entropy, the branches cast evil spells on future generations of processors. Those evil spells had names like “scaling-induced voltage leaks” and “increasing levels of waste heat” [...] the branches, those vanquished foes from long ago, would have the last laugh."

  • Hackbraten 18 hours ago

    I love James Mickens!

    https://www.usenix.org/system/files/1401_08-12_mickens.pdf

    > The Mossad is not intimidated by the fact that you employ https://. If the Mossad wants your data, they’re going to use a drone to replace your cellphone with a piece of uranium that’s shaped like a cellphone, and when you die of tumors filled with tumors, […] they’re going to buy all of your stuff at your estate sale so that they can directly look at the photos of your vacation instead of reading your insipid emails about them.

    • wood_spirit 16 hours ago

      So this is where they got the pager and walkie talkie ideas from

      • genewitch 4 hours ago

        You know, I didn't really think of this till your comment: this was a vast conspiracy spanning years. You always hear, when discussions of "conspiracies" happen, things like "it would involve too many people, too many moving parts" like opsec would be impossible. And then you have the pagers.

        Kinda like the old chestnut that rich people are only rich on paper and then, Musk buys twitter. Not tesla, or some DBA, Musk.

        This decade might actually be the season of reveal.

        • angra_mainyu 3 hours ago

          I think this is a human thing, being very myopic given our short lifespans, we forget even relatively recent things.

          The Cold War for example was full of these intricate, complex and stunning feats of spycraft that they'd pull off on each other.

  • btown 10 hours ago

    This is absolute gold!

    > “Making processors faster is increasingly difficult,” John thought, “but maybe people won’t notice if I give them more processors.” This, of course, was a variant of the notorious Zubotov Gambit, named after the Soviet-era car manufacturer who abandoned its attempts to make its cars not explode, and instead offered customers two Zubotovs for the price of one, under the assumption that having two occasionally combustible items will distract you from the fact that both items are still occasionally combustible.

    > Formerly the life of the party, John now resembled the scraggly, one-eyed wizard in a fantasy novel who constantly warns the protagonist about the variety of things that can lead to monocular bescragglement.

    And in 2013 the below would have been correct, but we live in a very different world now:

    > John’s massive parallelism strategy assumed that lay people use their computers to simulate hurricanes, decode monkey genomes, and otherwise multiply vast, unfathomably dimensioned matrices in a desperate attempt to unlock eigenvectors whose desolate grandeur could only be imagined by Edgar Allen Poe. Of course, lay people do not actually spend their time trying to invert massive hash values while rendering nine copies of the Avatar planet in 1080p.

    He wasn't too far off about the monkeys, though...

  • bee_rider 17 hours ago

    The bit about vast matrices shows some silver lining though; it turns out John’s little brother figured out how to teach those matrices to talk like a person.

    • yvdriess 15 hours ago

      Yes but those transistors moved to greener pastures.

progval 19 hours ago
  • ncr100 19 hours ago

    Impact illustration:

    > [...] the contents of the entire memory to be read over time, explains Rüegge. “We can trigger the error repeatedly and achieve a readout speed of over 5000 bytes per second.” In the event of an attack, therefore, it is only a matter of time before the information in the entire CPU memory falls into the wrong hands.

    • formerly_proven 19 hours ago

      Prepare for another dive maneuver in the benchmarks department I guess.

      • tsukikage 13 hours ago

        We need software and hardware to cooperate on this. Specifically, threads from different security contexts shouldn't get assigned to the same core. If we guarantee this, the fences/flushes/other clearing of shared state can be limited to kernel calls and process lifetime events, leaving all the benefits of caching and speculative execution on the table for things actually doing heavy lifting without worrying about side channel leaks.

        • tankenmate 5 hours ago

          I get you, but devs struggle to configure nginx to serve their overflowing cauldrons of 3rd party npm modules of witches incantations. Getting them securely design and develop security labelled cgroup based micro (nano?) compute services for inferencing text of various security levels is beyond even 95% of coders. I'd posit that it would be a herculean effort even for 1% devs.

          Just fix the processors?

          • tsukikage 3 hours ago

            It's not a "just" if the fix cripples performance; it's a tradeoff. It is forced to hurt everything everywhere because the processor alone has no mechanism to determine when the mitigation is actually required and when it is not. It is 2025 and security is part of our world; we need to bake it right into how we think about processor/software interaction instead of attempting to bolt it on after the fact. We learned that lesson for internet facing software decades ago. It's about time we learned it here as well.

            • tankenmate 30 minutes ago

              Is the juice worth the squeeze? Not everything needs Orange Book (DoD 5200.28-STD) Class B1 systems.

        • immibis 4 hours ago

          how will this prevent JavaScript from leaking my password manager database?

      • cenamus 19 hours ago

        And if not, why did they introduce severe bugs for a tiny performance improvement?

        • bloppe 19 hours ago

          It's not tiny. Speculative execution usually makes code run 10-50% faster, depending on how many branches there are

          • bee_rider 18 hours ago

            Yeah… folks who think this is just some easy to avoid thing should go look around and find the processor without branch prediction that they want to use.

            On the bright side, they will get to enjoy a much better music scene, because they’ll be visiting the 90’s.

            • wbl 9 hours ago

              IBM Stretch had branch prediction. Pentium in the early 1990s had it. It's a huge win with any pipelining.

          • titzer 15 hours ago

            That's a vast underestimate. Putting in lfence before every branch is on the order of 10X slowdown.

            • grumbelbart2 4 hours ago

              There is of course a slight chicken-egg-thing here: If there was no (dynamic) branch prediction, we (as in compilers) would emit different code that is faster for non-predicting CPUs (and presumably slower for predicting CPUs). That would mitigate a bit of that 10x.

          • autoexec 13 hours ago

            How much improvement would there still be if we weren't so lazy when it comes to writing software. If we were working to get as much performance out of the machines as possible and avoiding useless bloat instead of just counting on the hardware to be "good enough" to handle the slowness with some grace.

        • umanwizard 14 hours ago

          A modern processor pipeline is dozens of cycles deep. Without branch prediction, we would need to know the next instruction at all times before beginning to fetch it. So we couldn’t begin fetching anything until the current instruction is decoded and we know it’s not a branch or jump. Even more seriously, if it is a branch, we would need to stall the pipeline and not do anything until the instruction finishes executing and we know whether it’s taken or not (possibly dozens of cycles later, or hundreds if it depends on a memory access). Stalling for so many cycles on every branch is totally incompatible with any kind of modern performance. If you want a processor that works this way, buy a microcontroller.

          • superblas 12 hours ago

            > If you want a processor that works this way, buy a microcontroller.

            The ARM Cortex-R5F and Cortex-M7, to name a few, have branch predictors as well, for what it’s worth ;)

          • jeffbee 10 hours ago

            You can still have a static branch predictor. That has surprisingly good coverage. I'm not saying this is a great idea, just pointing it out.

  • trebligdivad 18 hours ago

    Thanks! It would be great if someone could update the title URL to that blog post; the press release is worse than useless.

    • dang 18 hours ago
      • alberto-m 3 hours ago

        I don't know guys. Yes, the direct link saves a click, but the original title was more informative for the casual reader. I'm not a professional karma farmer and in dang's shoes I would have made the same adjustment, but I can't deny that seeing the upvote rate going down by 75% after the change was a little harsh.

eigenform 15 hours ago

Great read! Some boiled-down takeaways:

- Predictor updates may be deferred until sometime after a branch retires. Makes sense, otherwise I guess you'd expect that branches would take longer to retire!

- Dispatch-serializing instructions don't stall the pipeline for pending updates to predictor state. Also makes sense, considering you've already made a distinction between "committing the result of the branch instruction" and "committing the result of the prediction".

- Privilege-changing instructions don't stall the pipeline for pending updates either. Also makes sense, but only if you can guarantee that the privilege level is consistent between making/committing a prediction. Otherwise, you might be creating a situation where predictions generated by code in one privilege level may be committed to state used in a different one?

Maybe this is hard because "current privilege level" is not a single unambiguous thing in the pipeline?

mettamage 19 hours ago

Good to see Kaveh Razavi, he used to teach at my uni in the Vrije Universiteit in Amsterdam :) The course Hardware Security was crazy cool and delved into stuff lijke this.

  • markus_zhang 18 hours ago

    I checked out this course (and another one from Vrije about malware) a couple of years ago, back then there was very little public info about the courses.

    Do you know if there is any official recording or notes online?

    Thanks in advance.

    • thijsr 16 hours ago

      As far as I am aware, the course material is not public. Practical assignments are an integral part of the courses given by the VUSEC group, and unfortunately those are difficult to do remotely without the course infrastructure.

      The Binary and Malware Analysis course that you mentioned builds on top of the book "Practical Binary Analysis" by Dennis Andriesse, so you could grab a copy of that if you are interested.

    • mettamage 15 hours ago

      No, but last time I checked you can be a contracted student for 1200 euro's.

      If I knew what I was getting into at the time, I'd do it. I did pay for extra, but in my case it was the low Dutch rate, so for me it was 400 euro's to follow hardware security, since I already graduated.

      But I can give a rough outline of what they taught. It has been years ago but here you go.

      Hardware security:

      * Flush/Reload

      * Cache eviction

      * Spectre

      * Rowhammer

      * Implement research paper

      * Read all kinds of research papers of our choosing (just use VUSEC as your seed and you'll be good to go)

      Binary & Malware Analysis:

      * Using IDA Pro to find the exact assembly line where the unpacker software we had to analyze unpacked its software fully into memory. Also we had to disable GDB debug protections. Something to do with ptrace and nopping some instructions out, if I recall correctly (look, I only low level programmed in my security courses and it was years ago - I'm a bit flabbergasted I remember the rough course outlines relatively well).

      * Being able to dump the unpacked binary program from memory onto disk. Understanding page alignment was rough. Because even if you got it, there were a few gotcha's. I've looked at so many hexdumps it was insane.

      * Taint analysis: watching user input "taint" other variables

      * Instrumenting a binary with Intel PIN

      * Cracking some program with Triton. I think Triton helped to instrument your binary with the help of Intel PIN by putting certain things (like xor's) into an SMT equation or something and you had this SMT/Z3 solver thingy and then you cracked it. I don't remember got a 6 out of 10 for this assignment, had a hard time cracking the real thing.

      Computer & Network Security:

      * Web securtiy: think XSS, CSRF, SQLi and reflected SQLi

      * Application security: see binary and malware analysis

      * Network security: we had to create our own packet sniffer and we enacted a Kevin Mitnick attack (it's an old school one) where we had to spoof our IP addresses, figure out the algorithm to create TCP packet numbers - all in the blind without feedback. Kevin in '97 I believe attacked the San Diego super computer (might be wrong about the details here). He noticed that the super computer S trusted a specific computer T. So the assignment was to spoof the address of T and pretend we were sending packets from that location. I think... writing this packet sniffer was my first C program. My prof. thought I was crazy that this was my first time writing C. I was, I also had 80 hours of time and motivation per week. So that helped.

      * Finding vulnerabilities in C programs. I remember: stack overflows, heap overflows and format strings bugs.

      -----

      For binary & malware analsys + computer & network security I highly recommend hackthebox.eu

      For hardware security, I haven't seen an alternative. To be fair, I'm not looking. I like to dive deep into security for a few months out of the year and then I can't stand it for a while.

      • markus_zhang 12 hours ago

        Wow, thanks a lot for the detailed answer. I'm going to see if I can register as a contracted student, but they probably do not accept remote students.

        BTW I can see you were very motivated back then. It got to be pretty steep but you managed to break through. Congrats!

        • mettamage 4 minutes ago

          Remote won't work yea. It has to be in-person.

          > BTW I can see you were very motivated back then. It got to be pretty steep but you managed to break through. Congrats!

          Thanks! Yea I was :)

rakingleaves 17 hours ago

Anyone know how this relates to the Training Solo attack that was just disclosed? https://www.vusec.net/projects/training-solo/

  • hashstring 4 hours ago

    Both exploit Spectre V2, but in different ways. My takeaway:

    Training Solo: - Enter the kernel (and switch privilege level) and “self train” to mispredict branches to a disclosure gadget, leak memory.

    Branch predictor race conditions: - Enter the kernel while your trained branch predictor updates are still in flight, causing the updates to be associated with the wrong privilege level. Again, use this to redirect a branch in the kernel to a disclosure gadget, leak memory.

rini17 19 hours ago

If CPU brach predictor had bits of information readily available to check buffer boundaries and privilege level of the code, all this would be much easier to prevent. But apparently that will only happen when we pry out the void* from the cold C programmers' hands and start enriching our pointers with vital information.

  • ajross 18 hours ago

    I don't see how you think that will help? It's not about software abstraction, it's about hardware. Changing the "pointer" does nothing to the transistors.

    Doing what you want would essentially require a hardware architecture where every load/store has to go through some kind of "augmented address" that stores boundary information.

    Which is to say, you're asking for 80286 segmentation. We had that, it didn't do what you wanted. And the reason is that those segment descriptors need to be loaded by software that doesn't mess things up. And it doesn't, it's "just a pointer" to software and amenable to the same mistakes.

    • rini17 16 hours ago

      286 far pointers were used sparingly, to save precious memory. Now we don't have any such problem and there are still unused bits in pointers even on largest 64 bit systems that might be repurposed perhaps. With virtual memory, there are all kinds of hardware supported address mappings and translations and IOMMU already so adding more transistors isn't an issue. The issue is purely cultural as you have just shown, people can't imagine it.

      • ajross 16 hours ago

        That's misunderstanding the hardware. All memory access on a 286 was through a segment descriptor, every access done in protected mode was checked against the segment limit. Every single one.

        A "far pointer" was, again, a *software* concept where you could tell the compiler that this particular pointer needed to use a different descriptor than the one the toolchain assumed (by convention!) was loaded in DS or SS.

    • nine_k 17 hours ago

      Why stop at 80286, consider going back to the ideas of iAPX432, but with modern silicon tech and the ability to spend a few million transistors here and there.

      (CHERI already exists on ARM and RISC-V though.)

      • ajross 12 hours ago

        FWIW, the 286 launched like four months after the 432.

    • nottorp 18 hours ago

      I suppose a CPU that only runs Rust p-code is what the OP is dreaming about...

      • ajross 18 hours ago

        Generated rust "p-code" would presumably be isomorphic to LLVM IR, which doesn't have this behavior either and would be subject to the same exploits.

        Again, it's just not a software problem. In the real world we have hardware that exposes "memory" to running instructions as a linear array of numbers with sequential addresses. As long as that's how it works, you can demand an out of bounds address (because the "bounds" are a semantic thing and not a hardware thing).

        It is possible to change that basic design principle (again, x86 segmentation being a good example), but it's a whole lot more involved than just "Rust Will Fix All The Things".

        • nottorp 18 hours ago

          Holy... I need to stop making fun of Rust (*). I keep getting misinterpreted.

          (*) ... although I don't think I can abstain ...

  • ActorNightly 17 hours ago

    Or people could just understand the scope of the issue better, and realize that just because something has a vulnerability doesn't mean there is a direct line to an attack.

    In the case of speculative execution, you need an insane amount of prep to use that exploit to actually do something. The only real way this could ever be used is if you have direct access to the computer where you can run low level code. Its not like you can write JS code with this that runs on browsers that lets you leak arbitrary secrets.

    And in the case of systems that are valuable enough to exploit with a risk of a dedicated private or state funded group doing the necessary research and targeting, there should be a system that doesn't allow unauthorized arbitrary code to run in the first place.

    I personally disable all the mitigations because performance boost is actually noticeable.

    • vlovich123 16 hours ago

      > Its not like you can write JS code with this that runs on browsers that lets you leak arbitrary secrets

      That's precisely what Spectre and Meltdown were though. It's unclear whether this attack would work in modern browsers but they did reenable SharedArrayBuffer & it's unclear if the existing mitigations for Spectre/Meltdown stimy this attack.

      > I personally disable all the mitigations because performance boost is actually noticeable.

      Congratulations, you are probably susceptible to JS code reading crypto keys on your machine.

      • nine_k 14 hours ago

        Disabling some mitigations makes sense for an internal box that does not run arbitrary code from the internet, like a build server, or a load balancer, or maybe even a stateless API-serving box, as long as it's not a VM on a physical machine shared with other tenants.

        • anyfoo 14 hours ago

          You run "arbitrary code from the internet" as soon as you use a web browser with JS enabled.

          • nine_k 13 hours ago

            This is exactly what you won't do on most of your infrastructure boxes, would you? If you can reasonably trust all the software on the whole box, many mitigations that protect against effects of running adversary code on your machine become superfluous.

            OTOH if an adversary gets a low-privilege RCE on your box, exploiting something like Spectre or RowHammer could help elevate the privilege level, and more easily mount an attack on your other infrastructure.

            • anyfoo 13 hours ago

              Yeah, as stated in a sibling answer, I misread your comment a little bit. It's true, on at least some classes of infrastructure boxes, you more or less "own all that is on the machine" anyway.

              But also note my caveat about database servers, for example. A database server shared between accounts of different trust levels will be affected, if the database supports stored procedures for example. Basically, as soon as there's anything on the box that not all users of it should be able to access anyway, you'll have to be very, very careful.

            • vlovich123 3 hours ago

              What infrastructure box are you running that is running 100% all your code? Unless you ignore supply chain attacks, you’ve always got exposure.

          • dwattttt 13 hours ago

            Or with JS disabled. HTML isn't as expressive, but it's still "arbitrary code from the internet"

            • vlovich123 3 hours ago

              HTML doesn’t have the potential to deliver Spectre like attacks because:

              1. No timers - timers are generally a required gadget & often they need to be hires or building a suitable timing gadget gets harder & your bandwidth of the attack goes down

              2. No loops - you have to do timing stuff in a loop to exploit bugs in the predictor.

            • anyfoo 13 hours ago

              There is a difference. JS is turing complete, pure HTML is far from (as far as I'm aware). So HTML might (!) well be restricted enough to not be able to carry out such an attack.

              But I'd never state to definitively, as I don't know enough about what HTML without JS can do these days. For all I know there's a turing tarpit in there somewhere...

              • nine_k 13 hours ago

                CSS3 is Turing-complete, but creating an exploit using just it would be... quite a feat.

                With JS or WASM, it's much more straightforward.

          • baobun 13 hours ago

            Which you wouldn't do on an internal load balancer or database server, right?

            • anyfoo 13 hours ago

              You are right, I sort of misread the statement I was replying to, but also wanted to reinforce that the large class of personal desktop machines is still very much affected, even if you "think" that you don't run "arbitrary code" on your machine.

              By the way, you have to be careful on your database server to not actually run arbitrary code as well. If your database supports stored procedures (think PL/SQL), that qualifies, if the clients that are able to create the stored procedures are not supposed to be able to access all data on that server anyway.

              • baobun 11 hours ago

                Oh yeah. Supply-chain risk is still a thing too and defense-in-depth is not a bad strategy.

                Physical isolation simplifies a lot of this. This class of attacks isn't (as) relevant for single-tenant single-workload dedicated machines.

                • vlovich123 3 hours ago

                  Based on this thread, I think people badly misjudge what “single-tenant” means in the context of susceptibility to exploits.

                  • baobun 2 hours ago

                    Mind elaborating?

      • gblargg 6 hours ago

        Who these days would trust crypto keys on their machine, given the many hardware wallets available?

        • vlovich123 3 hours ago

          Where do you think the crypto keys for the TLS connection securing your HTTPS browsing are stored? Although from what you said I’m now thinking you’re referring to cryptocurrency & thus aren’t on the same wavelength of the discussion here. Crypto keys —> cryptography keys, not cryptocurrency keys.

    • anyfoo 14 hours ago

      > Or people could just understand the scope of the issue better

      Do you understand the scope of the issue? Do you know that this couldn't personally affect you in a dragnet (so, not targeted, but spread out, think opportunistic ransomware) attack?

      Because this statement of yours:

      > Its not like you can write JS code with this that runs on browsers that lets you leak arbitrary secrets.

      was not true for Spectre. The original spectre paper notoriously mentions JS as an attack vector.

      If you truly disable all mitigations (assuming CPU and OS allow you to do so), you will reopen that hole.

      So:

      > The only real way this could ever be used is if you have direct access to the computer where you can run low level code.

      I'm a low level kernel engineer, and I don't know this to be true in the general case. JITs, i.e. the JavaScript ones, also generate "low level code". How do you know of this not being sufficient?

rtkwe 19 hours ago

I wonder if there's similar gaps in AMD hardware? Seems like speculative execution is simply an extremely hard to patch vulnerability in a share processor space so I wonder how AMD has avoided it.

  • pdpi 19 hours ago

    The short of it is that AMD haven’t “avoided it”. Speculative execution side channels aren’t one vulnerability but rather a whole family of vulnerabilities. This particular one is (apparently) Intel-only, same as Meltdown was, but AMD was also vulnerable to the original Spectre.

  • bee_rider 19 hours ago

    Pedantically, speculative execution isn’t the vulnerability, it is a necessary mechanism for every high-performance CPU nowadays (where “nowadays” started, like, around the turn of the century). However, bugs and vulnerabilities in speculative execution engines are very widespread because they are complicated.

    There are probably similar bugs in AMD and ARM, I mean how long did these bugs sit undiscovered in Intel, right?

    Unfortunately the only real fix is to recognize that you can’t isolate code running on a modern system, which would be devastating to some really rich companies’ business models.

    • fc417fc802 13 hours ago

      > the only real fix is to recognize that you can’t isolate code running on a modern system

      Does pinning VMs to hardware cores (including any SMT'd multiples) fix this particular instance? My understanding was that doing that addressed many of the modern side channel exploits.

      Of course that's not ideal, but it's not too bad in an era where the core count of high end CPUs continues to creep upwards.

      • vlovich123 3 hours ago

        If this allows reading kernel memory, then your VMs could read the host kernel anyway & any security keys contained therein (& that’s assuming pinning cores limits the exploit to memory being accessed by other CPUs on the same core which generally has not been true of side channel attacks as far as I’m aware).

      • everfrustrated 5 hours ago

        Possibly not. Seems like this exploit allows walking memory which would be shared?

  • quotemstr 18 hours ago

    The solution to this particular vulnerability is intuitive to me: snapshot the current privilege level when we enqueue a branch predictor update and carry that snapshot along with the update itself as it flows through the processor's internal buffers. Same problem you might have in software and the same solution, yes?

    • wbl 15 hours ago

      That actually doesn't work. The evaluation of the branch condition may be at some point far away from where the privilege update is recognized and executed. There is no current state to update, it's only recognized in retrospect what the state was. And carrying along data is pricey in a CPU: the instruction pointer isn't even available because of this.

      You could say we only update the predictor at retirement to solve this. But that can get a little dicy also: the retirement queue would have to track this locally and retirement frees up registers, better be sure it's not the one your jump needs to read. Doable but slightly harder than you might think.

smartmic 18 hours ago

> Closing these sorts of gaps requires a special update to the processor’s microcode. This can be done via a BIOS or operating system update and should therefore be installed on our PCs in one of the latest cumulative updates from Windows.

Why mention only Windows, what about Linux users?

  • matja 18 hours ago

    The Linux kernel has had microcode loading support (`CONFIG_MICROCODE` / `CONFIG_MICROCODE_INTEL`) but many years, but it does require that Intel release the microcode files necessary for distribution maintainers to update the packages, then it should be included in a system update.

  • ajross 18 hours ago

    Intel distributes microcode updates for Linux here: https://github.com/intel/Intel-Linux-Processor-Microcode-Dat... , and the distro are all set up to pull from there and distribute automatically.

    Not expert enough to know what to look for to see if these particular mitigations are present yet.

    • brokenmachine 7 hours ago

      On an Ubuntu 24.04.2 machine:

         dpkg -l | grep microcode
      
         ii  amd64-microcode 3.20231019.1ubuntu2.1 amd64 Processor microcode firmware for AMD CPUs
         ii  intel-microcode 3.20250211.0ubuntu0.24.04.1 amd64 Processor microcode firmware for Intel CPUs
         ii  iucode-tool 2.3.1-3build1 amd64 Intel processor microcode tool
yonatan8070 18 hours ago

Just to make sure I got this right, at this point in time there are patches out for all major operating systems that can mitigate this/apply relevant microcode to mitigate it?

  • dboreham 12 hours ago

    Yes. Embargo date May 13 (today).

margorczynski 19 hours ago

I wonder if there's any way to recover for Intel. They don't have anything worthwhile on the market, R&D takes a lot of time and their foundries are a constant source of losses as they're inferior compared to the competition.

On top of that x86 seems to be pushed out more and more by ARM hardware and now increasingly RISC-V from China. But of course there's the US chip angle - will the US, especially after the problems during Covid, let a key manufacturer like Intel bite the dust?

  • chneu 18 hours ago

    Intel really isn't in as much trouble as tech blogs like to act.

    It's not great but lol the sensationalism is hilarious.

    Remember, gamers only make up a few percentage of users for what Intel makes. But that's what you hear about the most. One or two data center orders are larger than all the gaming cpus Intel will sell in a year. And Intel is still doing fine in the data center market.

    Add in that Intel still dominates the business laptop market which is, again, larger than the gamer market by a pretty wide margin.

    • WaxProlix 18 hours ago

      You're right about gamers, but other verticals are looking bad for Intel, too.

      The two areas you mention (data center, integrated OEM/mobile) are the two that are most supply chain and business-lead dependent. They center around reliable deliveries of capable products at scale, hardware certifications, IT department training, and organizational bureaucracy that Intel has had captured for a long time.

      But!

      Data center specifically is getting hit hard from AMD in the x86 world and ARM on the other side. AWS's move to Graviton alone represents a massive dip in Intel market share, and it's not the only game in town.

      Apple is continuing to succeed in the professional workspace, and AMD's share of laptop and OEM contracts just keeps going up. Once an IT department or their chosen vendor has retooled to support non-Intel, that toothpaste is not going back into the tube - not fully, at least.

      For both of these, AMD's improvement in reliability and delivery at scale will be bearing fruit for the next decade (at Intel's expense), and the mindshare, which gamers and tech sensationalism are indicators for, has already shifted the market away from an Intel-dominated world to a much more competitive one. Intel will have to truly compete in that market. Intel has stayed competitive in a price-to-performance sense by undermining their own bottom line, but that lever only has so far it can be pulled.

      So I'm not super bullish on Intel, sensationalism aside. They have a ton of momentum, but will need to make use of it ASAP, and they haven't shown an ability to do that so far.

  • layer8 18 hours ago

    Intel still has well over 70% x86 market share. They have a long runway. Arm had only 15% datacenter market share last year, and still hasn’t made much headway in the Windows market.

    • freeone3000 18 hours ago

      Arm is making huge gains though — five years ago they had less than 5%. The future of x86 is not bright.

      • baq 17 hours ago

        x86 vs arm doesn’t matter. Hardware matters. Intel needs to make the best cpu again. It can be x86, it can be arm, it can be risc-v.

        • adgjlsfhk1 16 hours ago

          Arm vs x86 matters a lot for Intel since they don't make Arm CPUs. x86 used to be a massive moat for Intel/AMD. The rise of ARM market-share means that that moat is draining. 10 years ago, AMD and IBM were the only competition (and they were both in rough shape). Now Intel is competing against AMD, NVidia, Qualcom, Amazon, and Arm. Even if Intel can make the best CPU again, they no longer can charge monopoly prices for it. If you have a 10% faster CPU, that only lets you charge a small premium over everyone else.

          • genewitch 4 hours ago

            There's an Arm cpu in every Intel CPU in ring 0 or -1, it boots a modified MINIX. So Intel knows a bit about ARM.

  • emkoemko 19 hours ago

    didn't i read something about apple,nvidia and other companies looking to use their foundries? why would they do that if its inferior or was that something else?

    • greenavocado 19 hours ago

      Because there's nothing else in America

  • porridgeraisin 18 hours ago

    I guess it depends on your expectations. Will they be fine as a company? I think yes. Will they be as prominent as they were at different points in their history? I think not.

    Product aside, from a shareholder/business point of view (I like to think of this separately these days as financial performance is becoming less and less reflective of the end product) I think they are too big to fail.

arkh 4 hours ago

So is it time for some cryptography coprocessor / cards?

201984 18 hours ago

  mitigations=off
Don't care.
  • matja 18 hours ago

    "Don't mind me running this piece of WASM in a webworker to collect all the useful encryption keys and cookies in your RAM..."

    • 201984 15 hours ago

      Has even a single web exploit ever been found in the wild? Until then, I'm not going to worry and probably not even then.

      • dzaima 14 hours ago

        As long as most people run with mitigations on, you're technically probably indeed safe. But you should still care that things get fixed with mitigations=on otherwise you wouldn't have the shield of "almost everyone has mitigations enabled for this so noone has reason to bother exploiting this"!

    • johnnyjeans 17 hours ago

      Uncaught ReferenceError: WebAssembly is not defined

      • vlovich123 16 hours ago

        You don't need WASM to deploy Spectre/Meltdown. Vanilla JS works just fine which is what was demonstrated in the original paper.

        • brobinson 14 hours ago

          Didn't all the major browsers alter their timing APIs to make this impossible/difficult?

          • vlovich123 3 hours ago

            They temporarily disabled high resolution timing APIs until they rearchitected how JS got executed in the wake of spectre/meltdown. They sandboxed JS into separate processes by domain (site-isolation) & created the concept of cross-origin isolation. The combination of the two lets you gain back sub-millisecond timers and SharedArrayBuffer which are two gadgets that were particularly useful for the Spectre paper.

          • anyfoo 13 hours ago

            I'm not an expert, but I think you can only make this harder by intentionally making timers less precise (even adding some random fuzz). Someone may correct me if I'm wrong, but I think statistically, a less precise timer means you will just need a longer runtime.

            Suppose you want to measure the distribution of the delay between recurring events (which is basically what's at the heart of those vulnerabilities). Suppose the delays are all sub-milliseconds, and that your timer, to pick something ridiculous, only has a 2 second granularity.

            You may at first think that you cannot measure the sub-millisecond distribution with such a corse timer. But consider that event and timers are not synchronized to each other, so with enough patience, you will still catch some events barely on the left or on the right side of your 2 second timer tick. Do this over a long enough time, and you can reconstruct the original distribution. Even adding some randomness to the timer tick just means you need more samples to suss the statistic out.

            Again, I am not an expert, and I don't know if this actually works, but that's what I came up with intuitively, and it matches with what I heard from some trustworthy people on the subject, namely that non-precision timers are not a panacea.

            • vlovich123 3 hours ago

              They are not a panacea (in some cases - the way Cloudflare Workers does them it does more effectively limit attacks vs how browsers have to work) but slowing down an attack is valuable because it can make the attack infeasible because your ability to retrieve anything damaging is bounded by how long you visit that website.

            • teruakohatu 12 hours ago

              > Even adding some randomness to the timer tick just means you need more samples to suss the statistic out.

              If each timer draws from the same random distribution then sure, you could work out the real tick with greater accuracy, but I don’t know if that is practical.

              If the timers draw from different distributions then it is going to be much harder.

              I imagine there is an upper limit of how much processing can be done per tick to before any attack becomes implausible.

              • anyfoo 12 hours ago

                > If the timers draw from different distributions then it is going to be much harder.

                Again, I'm an amateur, but I think you just need to know that distribution, which I guess you usually do (open source vs. closed source barely matters there), law of large numbers and all.

                Anyway, looking through literature, this article presents some actual ways to circumvent timers being made corse-grained: https://attacking.systems/web/files/timers.pdf

                In that article, the "Clock interpolation" sounds vaguely related to what I was describing on a quick read, or maybe it's something else entirely... Later, the article mentions alternative timing sources altogether.

                Either way, the conclusion of the article is that the mitigation approach as a whole is indeed ineffective: "[...] browser vendors decided to reduce the timer resolution. In this article, we showed that this attempt to close these vulnerabilities was merely a quick-fix and did not address the underlying issue. [...]"

    • anthk 5 hours ago

      UBlock Origin with JS turned off, or NoScript. Good luck.

    • bee_rider 17 hours ago

      Yeah, he should really turn mitigations on, so that when running arbitrary code from the internet he can be subject to 9999 vulnerabilities, instead of 10,000.

      • darkmighty 16 hours ago

        There are many kinds of vulnerabilities. Most are pretty mundane afaict. Breaking sandboxes and reading out your entire RAM is basically game over, existential vulnerability (second only to arbitrary code execution, though it can give you SSH keys I guess).

        The mitigating factor is actually that you don't go to malicious websites all the time, hopefully. But it happens, including with injected code on ads and stuff that may enabled by secondary vulnerabilities.

      • anyfoo 14 hours ago

        I challenge you to name another readily available "read arbitrary RAM from userspace"[1] vulnerability.

        [1] Not even including "potentially exploitable from JavaScript", which Spectre was. It's sufficient if you name one where an ordinary userspace program can do it.

        • genewitch 4 hours ago

          Can't you trivially do this with 4 lines of C?

          • Akronymus 19 minutes ago

            Only if you already have the ability to read arbitrary RAM. So running in kernel/hypervisor mode.

            The exploit is being able to do it from usermode through an api (browser/js) that normally forbids that.

            Userspace can only access its own memory, rather than the whole systems.

Alcatros552 14 hours ago

As it seems a lot of people are not aware that this one is a newer generation of branch predictor issue. You can see that Intels eIBRS doesn't mitigate the problems and make them susceptible to attacks. To prevent bigger issues the issue was released after Intel has been informed of the Issue and most systems are patched in the meantime.

tannhaeuser 18 hours ago

> All intel processors since the 9th generation (Coffee Lake Refresh) are affected by Branch Privilege Injection. However, we have observed predictions bypassing the Indirect Branch Prediction Barrier (IBPB) on processors as far back as 7th generation (Kaby Lake).

From that piece of text on the blog, I don‘t quite unterstand if Kaby Lake CPUs are affected or not.

  • chrisweekly 18 hours ago

    I interpret it as including Kaby Lake.

    • autoexec 13 hours ago

      which would mean that all our intel machines have been vulnerable and defenseless for the last 9 years.

  • fwip 18 hours ago

    At least some Kaby Lake CPUs are affected, but they can't say for sure that all of them are.

    • lostmsu 15 hours ago

      No, I think they are saying that they can only demonstrate exploit on Coffee Lake Refresh and later, but the issue that let them create exploit exists all the way back to Kaby Lake. So they are also probably exploitable, but this specific exploit does not target them.

gitroom 16 hours ago

yeah this just makes me wanna see real world numbers on the slowdown, cuz honestly all these microcode fixes feel like trading off years of speed for maybe a little more peace of mind - you ever think well actually move off this cycle or is it just here to stay?

dzdt 18 hours ago

The end-user processor slowdowns from Spectre and Meltdown mitigations were fairly substantial. Has anyone seen an estimate of how much the microcode updates for this new speculative vulnerability are going to cost in terms of slowdown?

  • leonidasv 18 hours ago

    > Our performance evaluation shows up to 2.7% overhead for the microcode mitigation on Alder Lake. We have also evaluated several potential alternative mitigation strategies in software with overheads between 1.6% (Coffee Lake Refresh) and 8.3% (Rocket lake)

    https://comsec.ethz.ch/research/microarch/branch-privilege-i...

    • dzdt 18 hours ago

      Thanks, missed that! I remember seeing benchmarks showing like 15% slowdown from Spectre/Meltdown mitigations, so this is not as bad as that, but that is on top of the other too I guess...

pawanjswal 10 hours ago

Just when we thought Spectre was fading, it pulls a full sequel—Intel CPUs still keeping things spicy!

The28thDuck 16 hours ago

Haven’t we been here before? It seems like it’s very similar to the branch prediction exploits of the late 2010s. Is there something particularly novel about this class of exploits?

  • mettamage 15 hours ago

    Probably, I haven't had time to delve into the article yet. But ever I first learned about them I got the hunch that they'd never fully go away.

    Then people say "no that's not possible, we got security in place."

    So then the researchers showcase a new demo where they use their existing knowledge with the same issue (i.e. scaling-induced voltage leaks).

    I suspect this will go on and on for decades to come.

  • dboreham 11 hours ago

    It's the same exploit, just exploiting a bug in the mitigation for the first exploit.

j45 18 hours ago

Since the cloud is someone else's computer, and someone else's shared CPU, is cloud hosting (including vps) potentially impacted?

Look forward to learning how this can be meaningfully mitigated.

  • andrewla 18 hours ago

    Intel claims [1] that they already have microcode mitigation. Like Spectre and Meltdown this is likely to have performance implications.

    [1] https://www.intel.com/content/www/us/en/security-center/advi...

    • j45 17 hours ago

      Spectre and Meltdown had some pretty big performance hits in the beginning. Wonder how much it will differ here in real world, third party (and independent) testing.

  • matja 18 hours ago

    For reads across different VMs on the same CPU, theoretically TME-MK could mitigate the usefulness of the memory reads by having each VM access memory using a different memory encryption key, but I don't know of any hypervisors that implement this.

    AMD has had SEV support in QEMU for a long time, which some cloud hosting providers use already, that would mitigate any such issue if it occurred on AMD EPYC processors.

    • wahern 13 hours ago

      Memory encryption doesn't usually protect against these kinds of side channels. You're not reading memory directly, but inferring it based on discernible behavior of the privileged code. Inasmuch as SEV and TME-MK are marketed as protecting VM guest memory from host machine snooping, they've proven insufficient many times before.[1] In the end, you have to trust your VM hosting provider, and trust that they've written their hypervisors in a robust way that takes into account these unforeseen (yet predictable) issues when isolating guests from each other.

      [1] See, e.g., https://www.amd.com/en/resources/product-security/bulletin/a... and https://www.intel.com/content/www/us/en/developer/articles/t...

    • anonymousDan 5 hours ago

      Memory encryption typically doesn't keep anything encrypted within the CPU (e.g. in caches). Haven't looked at the details of this bug but I expect it wouldn't help for that reason.

    • j45 13 hours ago

      Appreciate the insight on the AMD side.

      Their new processors are quite inviting, but like with all CPU’s I’d prefer to keep the entire thing to myself.

whatever1 19 hours ago

It’s dead, can you please stop stubbing it?

  • anonymars 18 hours ago

    I thought I understand these words, yet I don't understand what you mean

arghwhat 19 hours ago

> On an up to date Ubuntu 24.04

So not very up to date, but I suppose mitigations haven't changed significantly upstream since then.

  • necubi 19 hours ago

    24.04 is the most recent LTS (long term support) release; it's what users are meant to be running for anything important

    • arghwhat 16 hours ago

      My point is that it is not representative of the current state of the kernel.

      The kernel has nothing to do with Ubuntu, its release schedule and LTS's. Distro LTS releases also often mean custom kernels, backports, hardware enablement, whatnot, which makes it a fork, so unless were analyzing Ubuntu security rather than Linux security, mainline should be used.

      • 42lux 14 hours ago

        Microcode updates have nothing to do with the kernel?

        • arghwhat 6 hours ago

          Mitigations are generally kernel workarounds, not microcode updates. Microcode updates provide some fixes or disable features, but all kernel efforts and why they specify the software at all is because of kernel mitigations.

  • thomasdziedzic 19 hours ago

    That version is significant because it is the latest LTS release. Most servers use LTS releases.

  • blueflow 19 hours ago

    Ubuntu 24.04 is the current LTS release. Our are you intending to say that Ubuntu, regardless of version, is not up to date?

    Edit: "LTS" added due to popular demand

    • pdpi 19 hours ago

      You need a qualifier there — the latest Ubuntu release is 25.04, but 24.04 is the current LTS release.

      • razemio 18 hours ago

        It is up to date, with security patches and fixes. That is obviously what is relevant here. That is why the parent comment got down voted, since it is up to date in context of a security vulnerability. It should be even more secure, since new software versions might introduce unknown attack vectors.

    • arghwhat 16 hours ago

      I am saying that any version of Ubuntu is not representative of the mainline kernel, which is what is relevant when it comes to analyzing current mitigations.

      Distro LTS releases often mean custom kernels, backports, hardware enablement, whatnot, which makes it effectively a fork.

      Unless were interested in discovering kernel variation discrepancies, its more interesting to analyze mainline.

      • fc417fc802 12 hours ago

        I'd expect an awful lot of production workloads to be running on LTS kernel versions (and likely also LTS distro releases). So the mitigations currently available in an LTS release of a mainstream distro are quite relevant.

        • arghwhat 5 hours ago

          They are running their LTS distro's LTS kernel, but that us not an upstream thing

          On an LTS, you'll be running a Canonical kernel, or a Red Hat kernel, or a SuSE kernel, or an Oracle kernel, or...

          Each will have different backports, different hardware enablement, different random patches of choice, and so different bugs and problems.

          Unless were evaluating the security of a particular distro release, mainline is what is Linux and will ultimately be the shared base for future releases.

  • 7bit 19 hours ago

    There is a difference between an up2date Ubuntu 24.04 and an up2date Ubuntu.

    And as security updates are back ported to all supported versions - and 24.04 being an LTS release, it is as up2date as it gets.

    If you're being pedantic, be the right kind of pedantic ;)

    • arghwhat 15 hours ago

      The problem is that it's downstream backports and hardware enablement - you're running an old forked artisinal kernel maintained by Canonical, you will only get bugfixes if known to be severe enough to be flagged, and all this patching deviates it from mainline and can itself introduce new security vulnerabilities not present in mainline.

      This differs from an actual later release which is closer to mainline and includes all newer fixes, including ones that are important but weren't flagged, and with less risk of having new downstream bugs.

      If you're going to fight pedantism by being pedantic, better be the right kind of pedantic. ;)

  • fwip 19 hours ago

    24.04 is an LTS (long term support) release, so it receives updates, including security updates, for much longer than a regular release. I believe it's a 5-year support window, and longer if you shell out for paid support.

    • arghwhat 15 hours ago

      These updates mean that you are no longer running a mainline kernel, but an Ubuntu fork with whatever backports and hardware enablement (and new bugs!) this might introduce. This is also true for other software.

      LTS does not mean you get all updates, it only means you get to drag your feet for longer with random bugfixes. Only the latest release has updates.

      • anyfoo 13 hours ago

        This only matters if the mainline kernel since then somehow experienced changes which would affect this hardware vulnerability (fixed through microcode), which I see no indication of?

        • arghwhat 5 hours ago

          CPU vulnerabilities are first fixed through kernel mitigations, only sometimes through microcode.

          But security research should be done against the current state. Something as simple as a performance optimization can end up affecting the exploitability, and while that doesn't change whether the CPU is vulnerable it does change the conclusion.

          Evaluering if a particular old, forked codebase is security-wise is identical is a fools errand, and then that doesn't answer whether an equivalent Red Hat kernel is vulnerable as that's a different fork with different backports and local patches. Mainline is the shared base.

          • anyfoo 5 hours ago

            I don’t quite understand how that matters here. The researchers found a CPU vulnerability. They demonstrated it on a popular Linux distribution and LTS version, Ubuntu 24.04. They likely picked that to show that the attack is not purely theoretical, but feasible on something that real users currently use for real things. There is a microcode fix available that solves this problem, presumably across all OSes and releases. Whether the kernel is current and how much it diverges is, frankly, irrelevant.