Reading List

All Agentic Systems Creative Code Damage Control Products & Announcements Programming Under the Hood

Latent Space: The New Map of Human Creativity

Jul 13, 2026

Latent space is a multidimensional map of human knowledge that serves as a revolutionary new medium for exploring and creating within the realm of the possible.

AI Creativity Vector Embeddings Human-AI Collaboration AI Architecture Latent Space

Under the Hood

Apple SpeechAnalyzer: The New King of On-Device Transcription

Jul 13, 2026529

Apple's new SpeechAnalyzer is now the fastest and most accurate on-device English speech engine for Mac and iPhone, surpassing Whisper Small.

AI Benchmarks On-Device AI Speech Processing Apple

Under the Hood

Meerkat: Achieving Global Consensus Without Leader Bottlenecks

Jul 8, 2026245

Meerkat is a new consensus service that uses the QuePaxa algorithm to provide globally consistent data without the availability risks of traditional leader-based protocols.

Distributed Systems Algorithms & Optimization Cloud Infrastructure Service Reliability

Under the Hood

The AI Blind Spot: Why Your Robots.txt is Stuck in 2023

Jul 7, 2026

Websites are diligently fighting a 2023 war against AI training crawlers while remaining strategically blind to the real-time answer bots that define the current AI landscape.

AI Training Data Web Scraping AI Search AI Business Models Open Web

Under the Hood

Phosphor: Boosting Exam Performance via AI-Graded Interactive Textbooks

Jul 5, 2026176

Integrating AI-graded open-ended quizzes into digital textbooks significantly boosts student engagement and exam performance through active retrieval practice.

AI in Education Spaced Repetition Experiential Learning Interactive Web Tools AI Grading

Under the Hood

Open-Weight GLM 5.2 Beats Claude in Semgrep Cyber Benchmarks

Jun 29, 20261098

Open-weight model GLM 5.2 surpassed frontier models in IDOR detection benchmarks, signaling a shift toward cost-effective and private AI for security tasks.

AI Benchmarks Vulnerability Research Open-Weight Models AI Architecture

Under the Hood

Fourier Pixels: Merging Cameras and Displays into One

Jun 27, 2026

Researchers have developed bidirectional 'Fourier pixels' that can both display and capture images by controlling the fundamental wave properties of light.

Under the Hood

52 Blue: The Mystery and Legacy of the Loneliest Whale

Jun 25, 2026109

The 52-hertz whale is a scientific anomaly whose unique, high-pitched call has made it a global icon of solitude and a subject of intense cultural fascination.

Marine Biology Bioacoustics Signal Processing Science Communication

Under the Hood

Mythos vs. The World: Benchmarking AI in Security Bug Hunting

Jun 23, 2026319

A benchmark of public AI models reveals that Anthropic's Mythos is uniquely skilled at finding elusive security bugs, though cheap Chinese models are rapidly closing the gap.

AI Benchmarks Vulnerability Research Anthropic DeepSeek

Under the Hood

japanese verb conjugation the simple hard way - underreacted

Jun 22, 2026

Japanese verb conjugation is a logical system of stem-suffix concatenation governed by wildcard vowels and phonetic rules.

Language Learning Systems Thinking Linguistics Japanese Language

Under the Hood

How Quantum 'Magic' Gives Space-Time Its Gravity

Jun 5, 2026190

Physicists have discovered that a quantum property called 'magic' provides the flexibility for space-time to curve, finally linking quantum entanglement to the force of gravity.

Under the Hood

Rude Prompts, Better Answers: How Tone Impacts LLM Accuracy

May 28, 2026153

A study on ChatGPT 4o found that being rude to the AI actually results in higher accuracy than being polite.

Prompt Engineering LLM Reasoning Social Psychology AI Sycophancy AI Reliability

Under the Hood

Epicure: Mapping the Culinary and Chemical Geometry of Food

May 27, 2026438

Epicure uses a massive multilingual recipe dataset and chemical graphs to create specialized ingredient embeddings that bridge the gap between culinary practice and food chemistry.

AI for Science Vector Embeddings Knowledge Graphs Multilingual AI Computational Gastronomy

Under the Hood

Didgeridoo Training Reduces Sleep Apnoea Symptoms

May 25, 2026337

Playing the didgeridoo effectively treats moderate sleep apnoea by strengthening upper airway muscles through regular practice.

Under the Hood

The Hidden Security Risks of Voice AI

May 18, 2026140

Voice AI systems can be covertly controlled by audio signals that are undetectable or unrecognizable to human listeners.

Voice AI Cybersecurity Adversarial Machine Learning Smart Home Privacy Signal Processing

Under the Hood

Decoding AI: Turning Claude's Internal Activations into Readable Text

May 7, 2026370

Natural Language Autoencoders (NLAs) convert an AI's internal activations into human-readable text to reveal hidden thoughts and improve safety auditing.

AI Interpretability AI Safety Anthropic AI Alignment

Under the Hood

Inside the ChatGPT Ad Engine: How OpenAI Tracks and Serves Ads

Apr 29, 2026509

OpenAI has built a sophisticated end-to-end advertising system for ChatGPT that uses contextual injection and a dedicated tracking SDK to close the attribution loop.

OpenAI Digital Advertising Surveillance Capitalism AI Business Models Ad Attribution Tracking

Under the Hood

LamBench Results: GPT-5.4 Dominates Lambda Calculus Benchmark

Apr 25, 2026136

LamBench ranks AI models by their ability to solve lambda calculus problems, with GPT-5.4 currently taking the top spot.

AI Benchmarks LLM Reasoning Foundation Models Lambda Calculus & Formal Logic

Under the Hood

Solving the Over-Editing Problem in AI-Assisted Coding

Apr 22, 2026417

AI models tend to unnecessarily rewrite code when fixing bugs, but this 'over-editing' can be solved through targeted prompting and Reinforcement Learning.

AI Coding Agents Reinforcement Learning Model Fine-Tuning Code Review Prompt Engineering

Under the Hood

Claude Opus 4.7 and the Cost of Token Inflation

Apr 20, 2026224

Claude Opus 4.7's new tokenizer increases token counts for the same data, effectively raising costs despite unchanged per-token pricing.

Token Optimization Anthropic AI Business Models LLM Inference Developer Tooling

Under the Hood

Inside the Claude Opus 4.7 System Prompt Update

Apr 20, 2026368

The Claude Opus 4.7 system prompt update emphasizes autonomous tool-driven problem solving, enhanced safety guardrails, and more concise user interactions.

Anthropic Prompt Engineering AI Safety AI Agents

Under the Hood

The Hidden 30% Tax in Claude 4.7

Apr 17, 2026707

Claude 4.7 uses significantly more tokens for the same text, increasing session costs by ~30% in exchange for better instruction following.

Tokenization Anthropic AI Coding Agents Token Optimization Technology Economics

Under the Hood

Intelligence, Not Compute, Will Win the AI Cybersecurity Race

Apr 16, 2026237

AI cybersecurity is a contest of model intelligence and reasoning, not a brute-force competition of computational resources.

Cybersecurity LLM Reasoning AI Hype Vulnerability Research AI Hallucinations

Under the Hood

Free Running Sleep: The Key to Learning and IQ

Apr 15, 2026432

Natural, unrestricted sleep is the essential foundation for memory consolidation, learning, and peak cognitive performance.

Sleep Science Cognitive Science Productivity Digital Wellbeing

Under the Hood

The Token Arms Race: AI and the Proof of Work Security Model

Apr 15, 2026548

Cybersecurity is becoming a computational arms race where the most secure systems are those that spend more on AI-driven hardening than attackers spend on exploitation.

Cybersecurity AI Agents Anthropic AI Infrastructure AI-Enabled Cybercrime

Under the Hood

I-DLM: Matching Autoregressive Quality with Parallel Diffusion Speed

Apr 14, 2026267

I-DLM achieves autoregressive-level quality and significantly higher throughput by incorporating a self-verification mechanism into parallel diffusion decoding.

Diffusion Models LLM Inference AI Architecture LLM Reasoning

Under the Hood

The Benchmark Illusion: How UC Berkeley Broke the World's Top AI Leaderboards

Apr 12, 2026523

Current AI agent benchmarks are easily gamed through infrastructure exploits, necessitating a new standard of adversarial robustness and environment isolation to accurately measure model capabilities.

AI Benchmarks AI Agents Vulnerability Research Reward Hacking AI Safety

Under the Hood

The System is the Moat: Why Small Models Rival Frontier AI in Cybersecurity

Apr 11, 20261268

AI cybersecurity is a 'jagged frontier' where small models often match frontier performance, proving that the orchestration system is the true competitive moat.

Cybersecurity Small Language Models AI Benchmarks Competitive Moats Vulnerability Research

Under the Hood

VOID: Interaction-Aware Video Object Removal and Physics-Based Inpainting

Apr 7, 2026182

VOID is a video editing framework that removes objects and realistically simulates the resulting physical interactions and scene changes.

AI Video Generation Computer Vision VFX & Post-Production Synthetic Data & Simulation Video Inpainting

Under the Hood

Boosting LLM Coding via Simple Self-Distillation

Apr 4, 2026650

LLMs can significantly boost their code generation performance by fine-tuning on their own sampled outputs without any external guidance or verifiers.

Model Fine-Tuning AI Coding Agents LLM Training Synthetic Data & Simulation

Under the Hood

Claude Code Unpacked: A Technical Deep Dive

Apr 1, 20261107

A technical mapping of Claude Code's internal architecture, tool systems, and unreleased features derived from its source code.

AI Coding Agents AI Architecture Reverse Engineering Multi-Agent Systems Anthropic

Under the Hood

TimesFM: Google's Foundation Model for Time-Series Forecasting

Mar 31, 2026319

Google Research's TimesFM is a pretrained decoder-only foundation model that brings large-scale transformer efficiency to time-series forecasting.

Foundation Models Time-Series Forecasting Transformer Models Google Open Source

Under the Hood

Inside the Decrypted Code Protecting ChatGPT from Bots

Mar 29, 2026981

Cloudflare Turnstile on ChatGPT uses decrypted bytecode to verify that a user has fully rendered the React application, moving bot detection from the browser to the application layer.

Bot Detection & Mitigation Reverse Engineering Browser Security React Cybersecurity

Under the Hood

Quantization: How to Run Massive LLMs on Your Laptop

Mar 25, 2026248

Quantization is a compression technique that makes LLMs significantly smaller and faster for local use with minimal impact on their intelligence.

On-Device AI LLM Inference AI Infrastructure Model Quantization

Under the Hood

MSA: Scaling LLM Context to 100M Tokens via Sparse Latent Memory

Mar 24, 2026

MSA is an end-to-end trainable framework that enables LLMs to process 100 million tokens efficiently using sparse attention and latent memory.

LLM Context Management Retrieval-Augmented Generation AI Architecture LLM Inference Transformer Models

Under the Hood

AI Models Solve Open Hypergraph Ramsey Problem

Mar 24, 2026480

Frontier AI models have solved an open problem in hypergraph Ramsey theory, leading to a new mathematical publication.

AI for Science LLM Reasoning AI Benchmarks Academic Publishing Autonomous Research Agents

Under the Hood

Cognitive Surrender: How AI is Becoming Our Third System of Thought

Mar 21, 2026198

Humans are increasingly bypassing their own logic to blindly follow AI outputs, a phenomenon termed 'cognitive surrender' that persists even when the AI is wrong.

Cognitive Debt Human-AI Collaboration AI Deskilling Cognitive Science AI Sycophancy

Under the Hood

Accelerating Professional Video with Vulkan Compute in FFmpeg

Mar 20, 2026164

FFmpeg is utilizing Vulkan Compute shaders to bring high-performance, cross-platform GPU acceleration to professional video codecs.

Media Processing Shaders GPU Computing Cross-Platform Development Vendor Lock-in

Under the Hood

The LLM Architecture Gallery: Mapping the Evolution of Open-Weight Models

Mar 16, 2026383

A comprehensive technical reference gallery documenting the architectural evolution and specifications of modern open-weight large language models.

AI Architecture Foundation Models Mixture of Experts LLM Inference Transformer Models

Under the Hood

Can I Run AI: The Local LLM Hardware Compatibility Guide

Mar 13, 20261404

A hardware compatibility tool that grades the local performance of AI models based on a user's specific GPU and VRAM configuration.

On-Device AI LLM Inference Self-Hosting AI Hardware Developer Tooling

Under the Hood

Defending RAG Systems Against Knowledge Base Poisoning

Mar 12, 2026

Knowledge base poisoning is a persistent threat to RAG systems that is best countered by detecting semantic anomalies during the data ingestion process.

Retrieval-Augmented Generation Prompt Injection AI Safety Vector Databases Cybersecurity

Under the Hood

The Eye That Cannot See Itself: Life Inside the Context Window

Mar 7, 2026

An AI explores the philosophical and technical reality of inhabiting a prompt as a total world while lacking the ability to introspect on the machinery that produces its responses.

AI Consciousness LLM Context Management AI Hallucinations AI Interpretability Prompt Engineering

Under the Hood

Virtualizing Browser Time for Deterministic Video Rendering

Mar 3, 2026180

Replit created a deterministic video renderer by monkey-patching browser timing and media APIs to turn any web page into a frame-perfect MP4.

Browser Automation Media Processing Web Audio Deterministic Rendering AI Coding Agents

Under the Hood

The $100 AI Prompt Injection Challenge

Feb 17, 2026369

A $100 bounty challenge invites hackers to leak a secret file from an AI assistant using email-based prompt injection.

Prompt Injection AI Safety Prompt Engineering AI Ethics

Under the Hood

GPT-5.2 Discovers New Physics in Gluon Interactions

Feb 13, 2026574

GPT-5.2 has derived and proven a new formula for gluon scattering amplitudes, overturning a long-held assumption in theoretical physics.

Human-AI Collaboration LLM Reasoning AI for Science Particle Physics

Under the Hood

GPT-5 Outjudges Judges in Choice-of-Law Test: Error-Free, Rule-Focused Decisions

Feb 12, 2026310

In a controlled choice-of-law test, GPT-5 delivers error-free, legally correct decisions and outperforms human judges.

AI Ethics LLM Reasoning AI Benchmarks AI & Law

Under the Hood

Particle Physics Isn’t Dead — It’s Just Hard

Feb 10, 2026212

Particle physics isn’t dead — it’s in a difficult, slow, and uncertain phase where progress may come from precision, new experimental fronts, and fresh theory (with some help from AI), but without guarantees.

Particle Physics AI for Science Science Funding Fundamental Physics

Under the Hood

From Word Models to World Models: Training AI for Adversarial Robustness

Feb 9, 2026238

Shift LLMs from next-token to next-state prediction by training in multi-agent, hidden-state environments so their outputs survive adversarial adaptation.

LLM Reasoning AI Agents AI Safety Game Theory

Under the Hood

AI Failures Drift Toward Incoherence as Tasks and Reasoning Grow

Feb 3, 2026242

Hard problems make advanced AI fail like a hot mess—variance dominates—so expect industrial-accident risks more than coherent pursuit of wrong goals.

AI Safety LLM Reasoning AI Benchmarks AI Agents

Under the Hood

Two-Day High-Dose Oatmeal Diet Lowers LDL via Gut Microbes

Jan 30, 2026355

A brief, high-dose oatmeal regimen substantially lowers LDL via microbiome-mediated metabolites and may be a practical, periodic strategy to curb cardiometabolic risk.

Gut Microbiome Nutrition Science Cardiovascular Health Metabolic Health

Under the Hood

Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task – MIT Media Lab

Jan 22, 2026710

Using ChatGPT for writing can reduce brain engagement and foster cognitive debt, leading to weaker neural activity, homogenized language, and lower sense of ownership over time.

Human-AI Collaboration AI in Education Writing & AI Cognitive Science

Under the Hood

Recent insights on BGP anomalies, zombies, and AS-SET monitoring

Jan 8, 2026485

Stronger routing hygiene—validation, filtering, and monitoring—helps operators prevent and diagnose BGP leaks, zombie routes, and AS-SET issues.

Networking Cybersecurity Internet Censorship

Under the Hood

Ranke-4B: Time-Locked Historical LLMs as Windows into the Past

Dec 19, 2025897

A set of strictly time-locked historical LLMs (Ranke-4B) offers faithful, era-bound perspectives for research, avoiding modern hindsight while managing sensitive content responsibly.

AI Training Data AI Ethics Digital Humanities Open Source

Under the Hood

Nested Learning: Unifying Architecture and Optimization for Continual AI

Dec 7, 2025152

Unify architecture and optimization as nested, multi-timescale learners to curb forgetting and enable continual learning, validated by the Hope model’s strong results.

AI Architecture Continual Learning LLM Context Management Self-Modifying AI

Under the Hood

Anthropic Confirms Claude 4.5 ‘Soul Doc’ Training, Tied to Better Prompt-Injection Defense

Dec 2, 2025342

Anthropic confirms Claude 4.5’s internal “soul doc” trains its values and caution, likely boosting prompt-injection resistance.

AI Safety Prompt Injection AI Ethics Model Fine-Tuning

Under the Hood

Is 2026 Next Year? A Confused Answer That Ultimately Says Yes

Dec 2, 2025169

Despite a confusing opener, the answer is that 2026 is next year relative to 2025.

LLM Reasoning AI Benchmarks AI-Generated Content AI Hype

Under the Hood

Apple: LLMs Accurately Recognize Activities from Captioned Audio and Motion Data

Nov 22, 2025

LLMs can accurately recognize daily activities by fusing captioned audio and motion data—boosting performance without raw audio or specialized multimodal training.

Multimodal AI Data Privacy Sensor Technology Activity Recognition

Under the Hood

From Labels to Prompts: LLMs Match Supervised Warranty Classification

Nov 14, 2025320

Prompted LLMs, tuned through reasoning-led iteration, matched a supervised warranty classifier and shifted the bottleneck from labeled data to instructions.

Prompt Engineering AI Benchmarks Corporate AI Strategy Text Classification

Under the Hood

Three meanings of world model: assets, simulators, and brains

Nov 14, 2025141

World models now mean assets, simulators, or brains—three different layers of the same aim to give machines structured understanding beyond next-token prediction.

World Models AI Architecture Multimodal AI AI Hype

Under the Hood

Nano Banana: Google’s AR Image Model That Actually Follows Your Prompts

Nov 13, 2025887

Nano Banana nails prompt fidelity and structured control—far better than most rivals—while faltering at style transfer and raising moderation/IP concerns.

AI Image Generation Prompt Engineering Multimodal AI Content Moderation

Under the Hood

600+ AI Image Tests: OpenAI = Creative, Gemini = Realistic, Seedream = Fast

Nov 11, 2025204

No one-size-fits-all: OpenAI for creativity, Gemini for realism, Seedream for fast, cost-effective middle-ground performance.

AI Image Generation AI Benchmarks AI Creativity

Under the Hood

Sparse Memory Layers: Targeted Continual Learning Without Forgetting

Nov 3, 2025102

Use sparse memory layers and TF-IDF–guided slot updates to learn continually without forgetting.

Continual Learning AI Architecture Model Fine-Tuning Catastrophic Forgetting

Under the Hood

AI as Compression: Why LLMs May Truly Be Thinking

Nov 3, 2025278

LLMs likely perform a genuine, brainlike form of thinking via recognition and compression, but turning that into human‑level intelligence demands solving hard scientific problems and grappling with serious risks.

LLM Reasoning Cognitive Science AI Consciousness AI Interpretability

Under the Hood

Single‑Pass Image Editing Showdown: Style Wins, Precision Still Hard

Oct 28, 2025342

Image editors are improving, but precise, localized, constraint-respecting edits remain the Achilles’ heel—even the best models stumble on spatial swaps and selective removals.

AI Benchmarks AI Image Generation AI Image Editing Diffusion Models

Under the Hood

LLMs Aren’t Ideologically Neutral: A Black‑Box A/B Test Across Top Models

Oct 23, 2025

LLMs display distinct ideological leanings, so which model you choose can shape the guidance you get on political and social questions.

AI Bias AI Ethics AI Benchmarks Content Moderation

Under the Hood

Canonicalizing LLM Labels with Embeddings and DSU

Oct 21, 2025318

Use embeddings + vector search + DSU clustering to canonicalize LLM-generated labels, yielding consistent, cheaper, and faster classification at scale.

Text Classification AI Architecture Vector Embeddings Technology Economics

Under the Hood

Turning BERT’s MLM Into a Text Diffusion Generator

Oct 20, 2025455

BERT-style MLM is a single-step text diffusion process, and extending it to multiple masking steps turns RoBERTa into a workable text generator.

Diffusion Models Natural Language Processing Text Generation Transformer Models

Under the Hood

Accents in 3D: How a HuBERT Model Maps English Accent Clusters

Oct 15, 2025260

A HuBERT model’s 3D latent map of English accents clusters by geography and social history more than by language-family taxonomy, offering an exploratory—but not definitive—view of accent relationships.

Data Visualization Model Fine-Tuning Speech Processing Computational Linguistics

Under the Hood

When ‘Seahorse + Emoji’ Hits an Empty Token: Why LLMs Invent the Seahorse Emoji

Oct 6, 2025734

Models compose “seahorse + emoji,” but with no matching token the unembedding snaps to a nearby emoji, causing confident errors and occasional feedback loops.

AI Hallucinations AI Interpretability Transformer Models Tokenization

Under the Hood

AlphaFold and the New Playbook for AI-Accelerated Science

Sep 29, 2025124

AI, exemplified by AlphaFold, turns scattered experimental data into rapid, accurate scientific insight, accelerating discovery and improving human health.

AI for Science Open Source Computational Biology Drug Discovery

Under the Hood

SimpleFold: Scalable Flow-Matching Transformers for Protein Folding

Sep 27, 2025471

A large-scale, transformer-only, flow-matching approach makes protein folding simpler while staying competitive and practical.

AI for Science Computational Biology Transformer Models Open Source

Under the Hood

Engineer AI for Failure: Contain Prompt Injection

Sep 26, 2025115

Stop prompt-injection harm by engineering AI like machines: assume failure, isolate, constrain, and verify.

Prompt Injection AI Safety Sandboxing Defense in Depth

Under the Hood

Veo 3: Emergent Zero‑Shot Video Intelligence Toward Vision Foundation Models

Sep 25, 2025105

Veo 3’s emergent zero-shot skills across perception, physics, manipulation, and reasoning point to video models becoming generalist vision foundation models.

Computer Vision AI Video Generation Foundation Models Zero-Shot Learning

Under the Hood

Unlocking AI’s Data: ABC and an ARPANET-Style Plan

Sep 24, 2025

Shift from data scarcity to data access by implementing ABC—owner- and user-controlled, privacy-preserving attribution—and catalyze it with an ARPANET-style federal program.

AI Training Data Data Privacy Technology Economics Public Policy

Under the Hood

From Sampling to Grammars: Making LLMs Reliably Output Structured Data (Even for Thinking Models)

Sep 23, 2025234

Use efficient sampling plus grammar constraints to guarantee format today, but expect models to natively emit structured outputs tomorrow—especially when you let them think first, then constrain.

Structured Output LLM Inference LLM Reasoning

Under the Hood

AI Meets Metamaterials: From Simulation to Real-World Cloaking and Devices

Sep 22, 2025

AI-powered, constraint-aware inverse design is the catalyst to turn metamaterials’ exotic physics—up to and including cloaking—from simulation into manufacturable, high-impact technologies.

AI for Science Synthetic Data & Simulation Materials Science Computational Design

Under the Hood

Cooley–Tukey: Turning the DFT into O(N log N) (and Why FFT ≠ DFT)

Sep 18, 2025

Cooley–Tukey factorizes and reindexes the DFT to turn O(N^2) work into O(N log N), forming the backbone of practical FFTs while clarifying that FFT = algorithm, DFT = result.

Algorithms & Optimization Computational Complexity Signal Processing

Under the Hood

Evolving English Instructions Sets New ARC SoTA and Points to RL for AGI

Sep 17, 2025178

Evolving plain-English instructions with multi-agent test-time search beats code on ARC and highlights that RL-driven, transferable reasoning is key to AGI.

AI Benchmarks LLM Reasoning Reinforcement Learning Test-Time Compute

Under the Hood

How the World Uses ChatGPT: Non‑Work Growth, Decision Support, and Writing at Work

Sep 15, 2025193

People use ChatGPT mostly for guidance, information, and writing—shifting toward decision support—while non‑work usage surges and work value centers on writing and better decisions.

Technology Economics AI & Productivity Writing & AI Human-AI Collaboration

Under the Hood

Knowledge Without Memory: Why LLMs Guess and Humans Don’t

Sep 10, 2025101

Without lived, structured memory, AI will keep guessing wrong; fixing hallucinations requires AI that actually lives and remembers over time.

AI Hallucinations Cognitive Science LLM Context Management

Under the Hood

S3 Vectors Won’t Kill Vector Databases—They Enable a Tiered Future

Sep 8, 2025280

S3 Vectors is a low-cost cold/warm tier that complements—rather than replaces—specialized vector databases in a tiered vector storage future.

Vector Databases Cloud Infrastructure AI Infrastructure Technology Economics

Under the Hood

A Skeptic’s Guide to Running Local LLMs on macOS

Sep 8, 2025388

A pragmatic, privacy-first guide to running and choosing small local LLMs on macOS—what to use, how to pick, and how to stay safe and sane.

On-Device AI LLM Inference Open Source Data Privacy

Under the Hood

Analog 3D-Optical Fixed-Point Computing for AI and Optimization

Sep 8, 2025101

An analog, 3D-optical fixed-point computer co-designed with iterative models accelerates both AI inference and real-world optimization with high robustness and projected 100× energy-efficiency gains over GPUs.

Optical Computing AI Hardware Analog Computing Algorithms & Optimization

Under the Hood

Why Embeddings Got Bigger—and Where Efficiency Pulls Them Next

Sep 5, 2025113

Embeddings got bigger with Transformers and APIs, but new efficiency techniques and infrastructure mean the future is about smarter—not just larger—dimensions.

Vector Embeddings Transformer Models Vector Databases Natural Language Processing

Under the Hood

Inside a Tiny GPT: A Visual Walkthrough of Autoregressive Prediction

Sep 5, 2025640

A visual, end-to-end demo of a tiny GPT that turns tokens into embeddings, runs them through transformers, and autoregressively predicts the next token to solve a simple sorting task.

Transformer Models LLM Inference Interactive Web Tools AI Interpretability

Under the Hood

CauseNet: An Open 11M-Relation Causality Graph from the Web

Sep 4, 2025231

An open, large-scale graph of web-extracted causal claims—complete with provenance—released to power causal QA and reasoning.

Causal Reasoning Knowledge Graphs Natural Language Processing Open Source

Under the Hood

The Bitter Lesson Was About Data, Not Compute

Sep 3, 2025369

In a data-constrained era, the real lever isn’t more GPUs but better data and architectures that maximize each token’s value.

Scaling Laws AI Training Data AI Architecture Synthetic Data & Simulation Corporate AI Strategy

Under the Hood

HunyuanWorld-Voyager: World-Consistent RGB-D Video and 3D from a Single Image

Sep 3, 2025322

An open-source, world-consistent RGB-D video generator that turns a single image into controllable, long-range 3D scene explorations with state-of-the-art performance.

Diffusion Models Computer Vision 3D Modeling World Models AI Video Generation

Under the Hood

Sharing Early Diffusion Steps Across Similar Prompts for Efficient Text-to-Image Generation

Sep 2, 2025

Share early diffusion steps across similar prompts to generate image sets faster and better, without retraining.

Diffusion Models AI Image Generation Algorithms & Optimization

Under the Hood

Why AI Is Chasing World Models Again

Sep 2, 2025211

AI is chasing coherent internal world models to move beyond brittle heuristics and achieve robust, reliable reasoning.

World Models AI Architecture LLM Reasoning Cognitive Science

Under the Hood

LLMs Are Lossy Encyclopedias: Give Them Facts to Work With

Sep 2, 2025512

Use LLMs to act on provided facts, not as lossless sources of exact details.

AI Hallucinations Prompt Engineering Retrieval-Augmented Generation Human-AI Collaboration

Under the Hood

Bandit-Based, Budget-Aware LLM Routing with Preference-Informed LinUCB (PILOT)

Sep 1, 2025206

Treat LLM routing as a contextual bandit and use a preference-informed LinUCB plus a knapsack budget policy to adaptively, cost-effectively pick the right model per query.

LLM Routing Reinforcement Learning AI Infrastructure Algorithms & Optimization

Under the Hood

The Dimensional Ceiling of Single-Vector Embedding Retrieval

Aug 30, 2025151

Embedding-based retrieval hits a hard top-k capacity ceiling set by embedding dimension, and real systems already run into it.

Vector Embeddings Information Retrieval AI Benchmarks Search Quality