News Nug
all agent benchmark deployment eval fine tuning inference new model open source research tool workflow
Proposal: Use semantic compression as input diffusion to read sessions larger than the context window [R]
r/MachineLearning · 5h ago · 5
google/tabfm-1.0.0
r/LocalLLaMA · 6h ago · 5
BaryGraph - knowledge graph where every relationship is its own embedded document (not an edge) [R]
r/MachineLearning · 8h ago · 5
I built my 'first' flow matching image generator, here's what I learned [P]
r/MachineLearning · 11h ago · 5
Open Source AI Gap Map
Simon Willison · 18h ago · 5
Quoting Josh W. Comeau
Simon Willison · 19h ago · 5
H64LM: A 249M-parameter Mixture-of-Experts Transformer built from scratch in PyTorch [P]
r/MachineLearning · 19h ago · 5
Tom Yeh's AI by hand? is it worth it? [D]
r/MachineLearning · 19h ago · 5
Uh.. Honey, how do you feel about takeout?
r/LocalLLaMA · 20h ago · 5
Contrastive Decoding Diffing (CDD): recovering verbatim finetuning data from logits alone, no weight access needed[R]
r/MachineLearning · 21h ago · 5
Fable's judgement
Simon Willison · 21h ago · 5
Small Language Model SLM [D]
r/MachineLearning · 1d ago · 5
Portugal just released their own LLM Amalia (9B)!
r/LocalLLaMA · 1d ago · 5
June 2026 newsletter
Simon Willison · 1d ago · 5
Mistral released Leanstral-1.5-119B-A6B
r/LocalLLaMA · 1d ago · 5
Google DeepMind and A24 announce first-of-its-kind research partnership
DeepMind Blog · 1d ago · 5
What does "Safe AI" look like? [D]
r/MachineLearning · 1d ago · 5
Follow-up: DeepSeek V4 Flash on 2x RTX PRO 6000 finishes real coding tasks faster than Sonnet and Opus, at about Sonnet quality
r/LocalLLaMA · 1d ago · 5
Claude Code and China: The mechanism is activated when the user sets the ANTHROPIC_BASE_URL environment variable (used for local models)
r/LocalLLaMA · 1d ago · 5
AIEWF Daily Dispatch: The great loops debate and the state of AI engineering
Latent Space · 1d ago · 5
<12345…75>