Skip to main content
  1. Tags/

Foundation Model

Introducing Claude Opus 4.7, Anthropic
·912 words·5 mins
Articoli AI Foundation Model
Embarrassingly Simple Self-Distillation Improves Code Generation
·588 words·3 mins
Research Foundation Model LLM
Show HN: 1-Bit Bonsai, the First Commercially Viable 1-Bit Large Language Models
·476 words·3 mins
Hacker News Foundation Model LLM AI
PrismML — Concentrating Intelligence
·953 words·5 mins
Articoli Foundation Model Machine Learning AI
GitHub - z-lab/paroquant: [ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning in Large Language Model Inference
·888 words·5 mins
Articoli AI LLM Machine Learning Foundation Model Python
Conditional Memory via Scalable Lookup: A New Dimension of Sparsity for Large Language Models
·774 words·4 mins
Research Foundation Model LLM
NVIDIA PersonaPlex: Natural Conversational AI With Any Role and Voice - NVIDIA ADLR
·999 words·5 mins
Articoli AI Foundation Model
Ask HN: What is the best way to provide continuous context to models?
·631 words·3 mins
Hacker News API AI Foundation Model Natural Language Processing
Recursive Language Models
·677 words·4 mins
Research AI Foundation Model LLM
Reimagining LLM Memory: Using Context as Training Data Unlocks Models That Learn at Test-Time
·1047 words·5 mins
Corso Natural Language Processing AI Foundation Model LLM
We present Olmo 3, our next family of fully open, leading language models
·903 words·5 mins
Articoli LLM Foundation Model
A2UI
·921 words·5 mins
Articoli LLM Foundation Model
Nano Banana Pro: Gemini 3 Pro Image model from Google DeepMind
·1092 words·6 mins
Articoli Go Image Generation Foundation Model
Supercharge your OCR Pipelines with Open Models
·516 words·3 mins
Articoli Foundation Model AI DevOps
Gemini 3: Introducing the latest Gemini AI model from Google
·949 words·5 mins
Articoli AI Go Foundation Model
said we should delete tokenizers
·467 words·3 mins
Articoli Natural Language Processing Foundation Model AI
"🚀 Hello, Kimi K2 Thinking! The Open-Source Thinking Agent Model is here"
·567 words·3 mins
Articoli Tool Natural Language Processing AI Agent Foundation Model
Source: Thanks and Bharat for showing the world you can in fact tra...
·592 words·3 mins
Articoli AI Foundation Model
Tongyi DeepResearch: A New Era of Open-Source AI Researchers | Tongyi DeepResearch
·525 words·3 mins
Articoli Foundation Model AI Agent AI
MiniMax-M2
·481 words·3 mins
GitHub AI Agent Open Source Foundation Model
I quite like the new DeepSeek-OCR paper
·489 words·3 mins
Articoli Foundation Model Go Computer Vision Natural Language Processing
olmOCR 2: Unit test rewards for document OCR | Ai2
·530 words·3 mins
Articoli Foundation Model AI
How to Get Consistent Classification From Inconsistent LLMs? "How to Obtain Consistent Classification From Inconsistent Language Models?"
·568 words·3 mins
Articoli Foundation Model Go LLM
Stanford's ALL FREE Courses [2024 & 2025] ❯ CS230 - Deep Learni...
·611 words·3 mins
Articoli LLM Transformer Deep Learning Natural Language Processing Foundation Model
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
·565 words·3 mins
Articoli Computer Vision Foundation Model LLM
NeuTTS Air
·450 words·3 mins
GitHub Foundation Model Python AI Open Source
My trick for getting consistent classification from LLMs
·550 words·3 mins
Hacker News Foundation Model Go LLM
EU-funded TildeOpen LLM delivers European AI breakthrough for multilingual innovation | Shaping Europe’s digital future
·494 words·3 mins
Articoli API AI Foundation Model LLM
Qwen-Image
·530 words·3 mins
GitHub Computer Vision Open Source Foundation Model Python Image Generation Natural Language Processing
Huge AI market opportunity in 2025
·508 words·3 mins
Articoli API AI Foundation Model
dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model
·501 words·3 mins
GitHub Foundation Model LLM Python Open Source Computer Vision
Small models are the future of agentic ai
·460 words·3 mins
Articoli AI AI Agent Foundation Model
Kimi K2: Open Agentic Intelligence
·534 words·3 mins
Articoli AI Agent Foundation Model
Introducing Qwen3-Max-Preview (Instruct)
·440 words·3 mins
Articoli API AI Foundation Model
VibeVoice: A Frontier Open-Source Text-to-Speech Model
·588 words·3 mins
Hacker News Framework Best Practices Foundation Model Natural Language Processing
Apertus 70B: Truly Open - Swiss LLM by ETH, EPFL and CSCS
·580 words·3 mins
Hacker News LLM AI Foundation Model
Alexander Kruel - Links for 2025-08-24
·505 words·3 mins
Articoli Foundation Model AI
DSPy
·524 words·3 mins
Articoli Framework Best Practices Foundation Model LLM
Build a Large Language Model (From Scratch)
·604 words·3 mins
GitHub Foundation Model LLM Open Source
CS294/194-196 Large Language Model Agents | CS 194/294-196 Large Language Model Agents
·556 words·3 mins
Articoli AI Agent Foundation Model LLM
Claudia – Desktop companion for Claude code
·597 words·3 mins
Hacker News Foundation Model AI
The race for LLM cognitive core
·553 words·3 mins
Articoli LLM Foundation Model
Qwen3-Coder: Agentic coding in the world
·617 words·3 mins
Hacker News AI Agent Foundation Model
Voxtral | Mistral AI
·443 words·3 mins
Articoli AI Foundation Model
SymbolicAI: A neuro-symbolic perspective on LLMs
·658 words·4 mins
Hacker News Framework Foundation Model Python Best Practices LLM AI
Gemini for Google Workspace Prompting Guide 101
·464 words·3 mins
Articoli AI Go Foundation Model
MCP is eating the world—and it's here to stay
·552 words·3 mins
Articoli Natural Language Processing AI Foundation Model
Building Effective AI Agents
·597 words·3 mins
Hacker News AI Agent AI Foundation Model
Snorting the AGI with Claude Code
·648 words·4 mins
Hacker News Framework Code Review AI Best Practices Foundation Model
Nanonets-OCR-s – OCR model that transforms documents into structured markdown
·589 words·3 mins
Hacker News LLM Foundation Model
[2505.24863] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
·548 words·3 mins
Articoli Foundation Model
[2505.24864] ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
·538 words·3 mins
Corso LLM Foundation Model
Ask HN: What is the best LLM for consumer grade hardware?
·602 words·3 mins
Hacker News LLM Foundation Model
Show HN: AutoThink – Boosts local LLM performance with adaptive reasoning
·609 words·3 mins
Hacker News LLM Foundation Model
Show HN: My LLM CLI tool can run tools now, from Python code or plugins
·674 words·4 mins
Hacker News Tool LLM Foundation Model Python
A Research Preview of Codex
·600 words·3 mins
Hacker News AI Foundation Model
Ollama's new engine for multimodal models
·520 words·3 mins
Articoli Foundation Model
Vision Now Available in Llama.cpp
·556 words·3 mins
Hacker News Foundation Model AI Computer Vision
Token & Token Usage | DeepSeek API Docs
·486 words·3 mins
Articoli API Natural Language Processing Foundation Model
Gemma 3 QAT Models: Bringing state-of-the-Art AI to consumer GPUs
·540 words·3 mins
Articoli Go Foundation Model AI
GitHub - HandsOnLLM/Hands-On-Large-Language-Models: Official code repository for the O'Reilly Book - 'Hands-On Large Language Models'
·1324 words·7 mins
GitHub LLM Open Source Foundation Model
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning | Nature
·574 words·3 mins
Corso Framework LLM AI Best Practices Foundation Model
A foundation model to predict and capture human cognition | Nature
·490 words·3 mins
Articoli Go Foundation Model Natural Language Processing LLM AI
Large language models are proficient in solving and creating emotional intelligence tests | Communications Psychology
·546 words·3 mins
Articoli AI LLM Foundation Model