News Nug
all agent alignment api update architecture benchmark business cuda dataset deployment eval evaluation fine tuning game dev hardware hype inference infrastructure library monitoring new model open source optimization plugin probe targeted prompt engineering quantization rag research rl training security tool training tutorial workflow
OpenAI S-1 🇺🇸, Siri AI 📱, Xiaomi Ultraspeed ⚡
TLDR AI · 45m ago · 5
Rick & Morty
r/LocalLLaMA · 2h ago · 5
Are privacy-preserving techniques actually being used in production ML systems? [D]
r/MachineLearning · 5h ago · 5
Understanding Pytorch better and Moving forward from papers [D]
r/MachineLearning · 5h ago · 5
How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces
HuggingFace Blog · 5h ago · 5
Papers figures [D]
r/MachineLearning · 7h ago · 5
[AINews] FrontierCode: Benchmarking for Code Quality over Slop
Latent Space · 10h ago · 5
2X tk/s (from 19.4 -> 38.1 tk/s on 1 x MI50) Playing with a hypothesis like speculative decoding.. but instead of an additional side model, exploiting that I can run multiple computations side-by-side AS IF I had Qwen3.6-27B loaded twice in memory - small quants don't use all the available compute.
r/LocalLLaMA · 14h ago · 5
Siri AI at WWDC 2026
Simon Willison · 16h ago · 5
Jun 8, 2026SciencePaving the way for agents in biology
Anthropic Research · 18h ago · 5
Me: Arguing with an AI bot who just posted something on this sub about Llama 3.1.
r/LocalLLaMA · 20h ago · 5
STOP racist posts about Chinese researchers [D]
r/MachineLearning · 22h ago · 5
When every other post is an AI generated benchmark report, a question about the best model, or a slop-coded application or engine that pretends to be groundbreaking
r/LocalLLaMA · 22h ago · 5
Université Paris Saclay or TU Delft for Applied Mathematics Masters [R]
r/MachineLearning · 1d ago · 5
Xiaomi just claimed 1,000+ tps on a 1T model using a standard 8-GPU server
r/LocalLLaMA · 1d ago · 5
OpenAI govt stake 🇺🇸, Google compute deal 🚀, Microsoft Scout launch 🤖
TLDR AI · 1d ago · 5
Luce Spark: a 35B MoE on a 16 GB GPU, without the offload tax
r/LocalLLaMA · 1d ago · 5
Confidential submission of draft S-1 to the SEC
OpenAI Blog · 1d ago · 5
mtmd : add video input support by ngxson · Pull Request #24269 · ggml-org/llama.cpp
r/LocalLLaMA · 1d ago · 5
Gemma 4 Chat Template now has preserve thinking
r/LocalLLaMA · 1d ago · 5
<12345…46>