Scoop: Four reasons Musk attacked Trump's "big beautiful bill"

brucethemoose@lemmy.world · edit-2 16 hours ago

Ughhh, I could go on forever, but to keep it short:

Tech bro enshittification: https://old.reddit.com/r/LocalLLaMA/comments/1p0u8hd/ollamas_enshitification_has_begun_opensource_is/
Hiding attribution to the actual open source project it’s based on: https://old.reddit.com/r/LocalLLaMA/comments/1jgh0kd/opinion_ollama_is_overhyped_and_its_unethical/
A huge support drain on llama.cpp, without a single cent, nor a notable contribution, given back.
Constant bugs and broken models from “quick and dirty” model support updates, just for hype.
Breaking standard GGUFs.
Deliberately misnaming models (like the Deepseek Qwen distills and “Deepseek”) for hype.
Horrible defaults (like ancient default models, 4096 context, really bad/lazy quantizations).
A bunch of spam, drama, and abuse on Linkedin, Twitter, Reddit and such.

Basically, the devs are Tech Bros. They’re scammer-adjacent. I’ve been in local inference for years, and wouldn’t touch ollama if you paid me to. I’d trust Gemini API over them any day.

I’d recommend base llama.cpp or ik_llama.cpp or kobold.cpp, but if you must use an “turnkey” and popular UI, LMStudio is way better.

But the problem is, if you want a performant local LLM, nothing about local inference is really turnkey. It’s just too hardware sensitive, and moves too fast.

brucethemoose@lemmy.world · edit-2 16 hours ago

Also, for any interested, desktop inference and quantization is my autistic interest. Ask my anything.

I don’t like Gemma 4 much so far, but if you want to try it anyway:

On Nvidia with no CPU offloading, watch this PR and run it with TabbyAPI: https://github.com/turboderp-org/exllamav3/pull/185
With CPU offloading, watch this PR and the mainline llama.cpp issues they link. Once Gemma4 inference isn’t busted, run it in IK or mainline llama.cpp: https://github.com/ikawrakow/ik_llama.cpp/issues/1572
If you’re on an AMD APU, like a Mini PC server, look at: https://github.com/lemonade-sdk/lemonade
On an AMD or Intel GPU, either use llama.cpp or kobold.cpp with the vulkan backend.
Avoid ollama like it’s the plague.
Learn chat templating and play with it in mikupad before you use a “easy” frontend, so you understand what its doing internally (and know when/how it goes wrong): https://github.com/lmg-anon/mikupad

But TBH I’d point most people to Qwen 3.5/3.6 or Step 3.5 instead. They seem big, but being sparse MoEs, they can run quite quickly on single-GPU desktops: https://huggingface.co/models?other=ik_llama.cpp&sort=modified

brucethemoose@lemmy.world · edit-2 21 hours ago

There’s a whole lot of interest in locally runnable ML. It was there even before ChatGPT 3.5 started the tech bro hype train, when tinkerers were messing with GPT-J 6B and GAN models.

In a nutshell, it’s basically Lemmy vs Reddit. Local and community-developed vs toxic and corporate.

brucethemoose@lemmy.world · edit-2 21 hours ago

They seem to have held back the “big” locally runnable model.

It’s also kinda conservative/old, architecture wise: 16-bit weights, sliding window attention interleaved with global attention. No MTP, no QAT (yet), no tightly integrated vision, no hybrid mamba like Qwen/Deepseek, nothing weird like that. It’s especially glaring since we know Google is using an exotic architecture for Gemini, and has basically infinite resources for experimentation.

It also feels kinda “deep fried” like GPT-OSS to me, see: https://github.com/ikawrakow/ik_llama.cpp/issues/1572

it is acting crazy. it can’t do anything without the proper chat template, or it goes crazy.

IMO it’s not very interesting, especially with so many other models that run really well on desktops.

brucethemoose@lemmy.world · edit-2 2 days ago

it’s a form of private journalism, private opinion, and private art

But without any of the liability hazard.

This is my issue: the big platforms having their cake and eating it. In one breath, they claim to be little open-platform garage startups that can’t possibly be responsible for the content of their users; they’re just a utility. They need protection from Congress. In another breath, they’re the stewards of generations and children, the only ones responsible enough to tame the internet’s criminality. All while making trillions.

They want to be “private content” protected from the government? Fine. Treat them like it, legally.

brucethemoose@lemmy.world · edit-2 2 days ago

It is when it warps the behavior of everyone else around you, and everything in charge of your life.

And I’m not just talking about the lost attention. The algorithms are not neutral.

brucethemoose@lemmy.world · edit-2 3 days ago

Still, what about citations of articles that themselves contain hallucinated citations? It’s a food chain problem.

I guess what I’m saying is the checking should be more… accessible? And less costly, hence machine automatable. This would increase the quality of journals that, for whatever reason, don’t do enough human verification, and it would allow bigger journals to do deeper checks “down the chain” with the same labor.

brucethemoose@lemmy.world · edit-2 3 days ago

Then machine checking should be implemented

Whatever needs to be restructured to make citations automatically verifiable needs to happen, and then this will be less of an issue.

brucethemoose@lemmy.world · edit-2 3 days ago

The citation format needs an overhaul.

Make citations hyperlinked and publicly accessible. Past a certain date (2004?), make it mandatory. And if the research is mega paywalled, well… perhaps we should do something about that, too.

Then they’d be machine-verifiable.

The system has been dysfunctional. As it is elsewhere, the convenience of AI fraud/slop is simply exacerbating the existing issue to breaking points.

brucethemoose@lemmy.world · edit-2 8 days ago

I think you mean monitor their usage.

And to be fair, this is fairly technical. Many parents aren’t very technical. They’re unaware of parental controls they have access to, and I think that’s by design (as it would be unprofitable for social media).

brucethemoose@lemmy.world · 9 days ago

Can we not quote a Polymarket tweet? WTF.

brucethemoose@lemmy.world · 9 days ago

Presumably because one needs a phone app or physical hardware (like a Yubikey) to use them.

I dunno. Shrug

brucethemoose@lemmy.world · edit-2 10 days ago

Yeah. I prefer the idea of a bunch of 9-meters unless they can really perfect a cheap folding mirror to mass produce.

A small upper stage, an ion drive or something could get them to deep space. It’s not worth flying a whole Starship out there and burning more fuel to get it back; the return trip only makes sense for LEO.

brucethemoose@lemmy.world · edit-2 10 days ago

I wonder how big you could get the mirror if you did it James Webb style in starship.

Presumably 7x ~8m hexagons folded up?

That is a good point though. And if one were to design a “budget” 9m space telescope, they could amortize the R&D dramatically by launching the same design many times, perhaps with different sensors for different purposes? Amortization is why the Falcon Heavy and such are so cheap, and why the Space Shuttle and JWST are obscenely expensive.

Okay, you’ve sold me. I hope this does happen.

brucethemoose@lemmy.world · edit-2 10 days ago

Theoretically, even if we assume SpaceX is overshooting, that’s an interesting thought:

https://www.visualcapitalist.com/the-cost-of-space-flight/

launch cost chart

In practice? I’m more concerned about interest in funding astronomy in the first place.

That, and big fat telescopes are fundamentally expensive. And (at least for the optical variety) “swarming” them with a bunch of cheaper units isn’t as effective as building a big one.

I’d love to be wrong though. There are some interesting papers on swarms of optical telescopes for a larger effective aperture, but I’m not qualified to assess them.

brucethemoose@lemmy.world · edit-2 10 days ago

They’re selectively asking for verification to do it. That’s mixed, because:

They’ll only ask to verify “suspicious” accounts. So all the bots that “behave” are going to stay, which is what the bots will now optimize for.
Verification will become another form of selective enforcement. Say the wrong then, and you get either verify or get banned.
As for the methods, see for yourself:

When confirming that there is a human behind an account, we prefer third-party tools that keep a distance between verification and Reddit itself. Any system we use will not expose your real-world identity to Reddit nor your Reddit username or activity to any third party. There are a handful of ways to do this, and I’m sure there will be more. Each have their tradeoffs:

Passkeys (which are well supported by Apple, Google, YubiKey, and various password managers) - These are lightweight, require a human to do something, and don’t require your ID. The tradeoff is that there is no proof of individuality or anything other than “a human probably did something.” Nevertheless, it’s a great starting point.

Third-party biometric services - For example, World ID (yes, the Orb company, though they have non-Orb solutions as well). This technology unlocks proof-of-individual without requiring your name, government ID, or a centralized database. I think the internet needs verification solutions like this, where your account information, usage data, and identity never mix.

Third-party government ID services - In some countries, such as the UK and Australia, governments require us to use these. These are the least secure, least private, and least preferred. When we are forced to do this, we design the integrations so that we never actually see your ID information, so your Reddit data cannot be tied to you.

Draw your own conclusion.

But my take? It’s the worst of everything: Only the most primitive, obvious bots get banned. “Transparent,” sycophantic bots will all stay on Reddit, and get even stealthier. Rebellious human users will get hit with verification, at the whim of whatever opaque algorithm determines they’re “bot-like,” which is a fantastic recipe for censorship without the appearance of doing so.

And this is all if you take Spez at his word. There’s a lot of history suggesting you should not.

brucethemoose@lemmy.world · edit-2 10 days ago

That’s the neat part. The Fediverse doesn’t eat itself; more activity on Piefed is more activity on Lemmy.

brucethemoose@lemmy.world · 10 days ago

Yeah; 100%.

brucethemoose@lemmy.world · edit-2 11 days ago

Go go China !

Bops the tankie.

Like, I have a Chinese LLM loaded right this second and follow them closely, but holy moly. Curb your enthusiasm.

Anyway, OpenAI has plenty of compute to train a Sora 2 if they want, but apparently they don’t. My guess is some combination of:

They couldn’t figure out a more efficient architecture, like you speculated. I buy that. OpenAI’s development is way more conservative than you’d think, and video generation is inherently intense, especially if Sora 1 is the baseline.
…Maybe they looked at metrics, saw Sora is mostly used for spam, scams, or worse, and pulled the plug for liability reasons?
They’re focusing on short-term profitability, as other commenters mentioned.

brucethemoose@lemmy.world · 16 days ago

In beta, available as a Flatpak.

Seemed janky for me, but I only tried it for a few minutes.

brucethemoose@lemmy.world · edit-2 10 months ago

Scoop: Four reasons Musk attacked Trump's "big beautiful bill"