Foundation Model
Introducing Claude Opus 4.7, Anthropic
·912 words·5 mins
Articoli
AI
Foundation Model
Embarrassingly Simple Self-Distillation Improves Code Generation
·588 words·3 mins
Research
Foundation Model
LLM
Show HN: 1-Bit Bonsai, the First Commercially Viable 1-Bit Large Language Models
·476 words·3 mins
Hacker News
Foundation Model
LLM
AI
PrismML — Concentrating Intelligence
·953 words·5 mins
Articoli
Foundation Model
Machine Learning
AI
GitHub - z-lab/paroquant: [ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning in Large Language Model Inference
·888 words·5 mins
Articoli
AI
LLM
Machine Learning
Foundation Model
Python
Conditional Memory via Scalable Lookup: A New Dimension of Sparsity for Large Language Models
·774 words·4 mins
Research
Foundation Model
LLM
NVIDIA PersonaPlex: Natural Conversational AI With Any Role and Voice - NVIDIA ADLR
·999 words·5 mins
Articoli
AI
Foundation Model
Ask HN: What is the best way to provide continuous context to models?
·631 words·3 mins
Hacker News
API
AI
Foundation Model
Natural Language Processing
Recursive Language Models
·677 words·4 mins
Research
AI
Foundation Model
LLM
Reimagining LLM Memory: Using Context as Training Data Unlocks Models That Learn at Test-Time
·1047 words·5 mins
Corso
Natural Language Processing
AI
Foundation Model
LLM
We present Olmo 3, our next family of fully open, leading language models
·903 words·5 mins
Articoli
LLM
Foundation Model
A2UI
·921 words·5 mins
Articoli
LLM
Foundation Model
Nano Banana Pro: Gemini 3 Pro Image model from Google DeepMind
·1092 words·6 mins
Articoli
Go
Image Generation
Foundation Model
Supercharge your OCR Pipelines with Open Models
·516 words·3 mins
Articoli
Foundation Model
AI
DevOps
Gemini 3: Introducing the latest Gemini AI model from Google
·949 words·5 mins
Articoli
AI
Go
Foundation Model
said we should delete tokenizers
·467 words·3 mins
Articoli
Natural Language Processing
Foundation Model
AI
"🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here"
·567 words·3 mins
Articoli
Tool
Natural Language Processing
AI Agent
Foundation Model
Source: Thanks and Bharat for showing the world you can in fact tra...
·592 words·3 mins
Articoli
AI
Foundation Model
Tongyi DeepResearch: A New Era of Open-Source AI Researchers | Tongyi DeepResearch
·525 words·3 mins
Articoli
Foundation Model
AI Agent
AI
MiniMax-M2
·481 words·3 mins
GitHub
AI Agent
Open Source
Foundation Model
I quite like the new DeepSeek-OCR paper
·489 words·3 mins
Articoli
Foundation Model
Go
Computer Vision
Natural Language Processing
olmOCR 2: Unit test rewards for document OCR | Ai2
·530 words·3 mins
Articoli
Foundation Model
AI
How to Get Consistent Classification From Inconsistent LLMs?
"How to Obtain Consistent Classification From Inconsistent Language Models?"
·568 words·3 mins
Articoli
Foundation Model
Go
LLM
Stanford's ALL FREE Courses [2024 & 2025] ❯ CS230 - Deep Learni...
·611 words·3 mins
Articoli
LLM
Transformer
Deep Learning
Natural Language Processing
Foundation Model
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
·565 words·3 mins
Articoli
Computer Vision
Foundation Model
LLM
NeuTTS Air
·450 words·3 mins
GitHub
Foundation Model
Python
AI
Open Source
My trick for getting consistent classification from LLMs
·550 words·3 mins
Hacker News
Foundation Model
Go
LLM
EU-funded TildeOpen LLM delivers European AI breakthrough for multilingual innovation | Shaping Europe’s digital future
·494 words·3 mins
Articoli
API
AI
Foundation Model
LLM
Qwen-Image
·530 words·3 mins
GitHub
Computer Vision
Open Source
Foundation Model
Python
Image Generation
Natural Language Processing
Huge AI market opportunity in 2025
·508 words·3 mins
Articoli
API
AI
Foundation Model
dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model
·501 words·3 mins
GitHub
Foundation Model
LLM
Python
Open Source
Computer Vision
Small models are the future of agentic ai
·460 words·3 mins
Articoli
AI
AI Agent
Foundation Model
Kimi K2: Open Agentic Intelligence
·534 words·3 mins
Articoli
AI Agent
Foundation Model
Introducing Qwen3-Max-Preview (Instruct)
·440 words·3 mins
Articoli
API
AI
Foundation Model
VibeVoice: A Frontier Open-Source Text-to-Speech Model
·588 words·3 mins
Hacker News
Framework
Best Practices
Foundation Model
Natural Language Processing
Apertus 70B: Truly Open - Swiss LLM by ETH, EPFL and CSCS
·580 words·3 mins
Hacker News
LLM
AI
Foundation Model
Alexander Kruel - Links for 2025-08-24
·505 words·3 mins
Articoli
Foundation Model
AI
DSPy
·524 words·3 mins
Articoli
Framework
Best Practices
Foundation Model
LLM
Build a Large Language Model (From Scratch)
·604 words·3 mins
GitHub
Foundation Model
LLM
Open Source
CS294/194-196 Large Language Model Agents | CS 194/294-196 Large Language Model Agents
·556 words·3 mins
Articoli
AI Agent
Foundation Model
LLM
Claudia – Desktop companion for Claude code
·597 words·3 mins
Hacker News
Foundation Model
AI
The race for LLM cognitive core
·553 words·3 mins
Articoli
LLM
Foundation Model
Qwen3-Coder: Agentic coding in the world
·617 words·3 mins
Hacker News
AI Agent
Foundation Model
Voxtral | Mistral AI
·443 words·3 mins
Articoli
AI
Foundation Model
SymbolicAI: A neuro-symbolic perspective on LLMs
·658 words·4 mins
Hacker News
Framework
Foundation Model
Python
Best Practices
LLM
AI
Gemini for Google Workspace Prompting Guide 101
·464 words·3 mins
Articoli
AI
Go
Foundation Model
MCP is eating the world—and it's here to stay
·552 words·3 mins
Articoli
Natural Language Processing
AI
Foundation Model
Building Effective AI Agents
·597 words·3 mins
Hacker News
AI Agent
AI
Foundation Model
Snorting the AGI with Claude Code
·648 words·4 mins
Hacker News
Framework
Code Review
AI
Best Practices
Foundation Model
Nanonets-OCR-s – OCR model that transforms documents into structured markdown
·589 words·3 mins
Hacker News
LLM
Foundation Model
[2505.24863] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
·548 words·3 mins
Articoli
Foundation Model
[2505.24864] ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
·538 words·3 mins
Corso
LLM
Foundation Model
Ask HN: What is the best LLM for consumer grade hardware?
·602 words·3 mins
Hacker News
LLM
Foundation Model
Show HN: AutoThink – Boosts local LLM performance with adaptive reasoning
·609 words·3 mins
Hacker News
LLM
Foundation Model
Show HN: My LLM CLI tool can run tools now, from Python code or plugins
·674 words·4 mins
Hacker News
Tool
LLM
Foundation Model
Python
A Research Preview of Codex
·600 words·3 mins
Hacker News
AI
Foundation Model
Ollama's new engine for multimodal models
·520 words·3 mins
Articoli
Foundation Model
Vision Now Available in Llama.cpp
·556 words·3 mins
Hacker News
Foundation Model
AI
Computer Vision
Token & Token Usage | DeepSeek API Docs
·486 words·3 mins
Articoli
API
Natural Language Processing
Foundation Model
Gemma 3 QAT Models: Bringing state-of-the-Art AI to consumer GPUs
·540 words·3 mins
Articoli
Go
Foundation Model
AI
GitHub - HandsOnLLM/Hands-On-Large-Language-Models: Official code repository for the O'Reilly Book - 'Hands-On Large Language Models'
·1324 words·7 mins
GitHub
LLM
Open Source
Foundation Model
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning | Nature
·574 words·3 mins
Corso
Framework
LLM
AI
Best Practices
Foundation Model
A foundation model to predict and capture human cognition | Nature
·490 words·3 mins
Articoli
Go
Foundation Model
Natural Language Processing
LLM
AI
Large language models are proficient in solving and creating emotional intelligence tests | Communications Psychology
·546 words·3 mins
Articoli
AI
LLM
Foundation Model