I'm tired of LLM bullshitting. So I fixed it.

SuspciousCarrot78@lemmy.world · edit-2 2 days ago

I'm tired of LLM bullshitting. So I fixed it.

Kobuster@feddit.dk · 3 months ago

Hallucination isn’t nearly as big a problem as it used to be. Newer models aren’t perfect but they’re better.

The problem addressed by this isn’t hallucination, its the training to avoid failure states. Instead of guessing (different from hallucination), the system forces a Negative response. That’s easy and any big and small company could do it, big companies just like the bullshit

SuspciousCarrot78@lemmy.world · edit-2 2 days ago

deleted by creator

ThirdConsul@lemmy.zip · edit-2 3 months ago

A very tailored to llms strengths benchmark calls you a liar.

https://artificialanalysis.ai/articles/gemini-3-flash-everything-you-need-to-know (A month ago the hallucination rate was ~50-70%)

Squizzy@lemmy.world · 3 months ago

Buuuuullshit. Asked different models about the ten highest summer transfer scorers and got wildly different answers. They then tried to explain why amd got more wrong numbers.

I'm tired of LLM bullshitting. So I fixed it.

I'm tired of LLM bullshitting. So I fixed it.

llama-conductor