

Yes, sorry, where I live it’s pretty normal for cars to be diesel powered. What I meant by my comparison was that a train, when measured uncritically, uses more energy to run than a car due to it’s size and behavior, but that when compared fairly, the train has obvious gains and tradeoffs.
Deepseek as a 600b model is more efficient than the 400b llama model (a more fair size comparison), because it’s a mixed experts model with less active parameters, and when run in the R1 reasoning configuration, it is probably still more efficient than a dense model of comparable intelligence.
Once they finally lock down the player so it’s impossible to block or skip ads, I look forward to coding a script which screen records each video on my sub list, feeds each video with ads into a purpose made classifier model which labels the ads, stitches out of ads with FFmpeg, and then uploads them to my jellyfin server.