
From Pikachus to Pizza: How Pokémon Go Data Navigates Delivery Robots
Pokémon Go's massive database of crowdsourced AR images is now being used to provide centimeter-level navigation for autonomous delivery robots.

Pokémon Go's massive database of crowdsourced AR images is now being used to provide centimeter-level navigation for autonomous delivery robots.
CorridorKey is an AI-driven green screen keyer that uses neural networks to reconstruct true foreground colors and delicate transparency for professional VFX compositing.

An open-source macOS app that uses your camera to detect slouching and gently enforce better posture by blurring the screen.

GeoSpy’s SuperBolt upgrades photo geolocation from miles to meters, enabling rapid, precise, and scalable vehicle recovery.

An open-source tool that turns SVGs into real-time, browser-based puppets using PoseNet/FaceMesh and smart vector deformation.

Samsung’s 2025 Family Hub update brings a unified interface, smarter food tracking, personalized Bixby, and expanded Knox security to its smart home ecosystem.
Skyfall-GS fuses satellite imagery with diffusion-driven iterative refinement to produce real-time, city-scale 3D scenes with superior geometry and textures—without 3D annotations.

An AI gun detector misread a Doritos bag as a weapon, triggering an armed police response and renewing concerns about AI surveillance in schools.

AI checkouts at BMO Stadium made everything slower, simpler, and worse for fans—especially in the heat—despite claims they’re faster.

An LLM-focused, high-throughput OCR system that compresses visual context for efficient document and image understanding.
Focus-stacked macro photography plus COLMAP and Postshot yields sharp, photoreal 3D Gaussian splats of insects, with a free CC BY model shared.

Google’s Gemini 2.5 Computer Use brings high-accuracy, low-latency, safety-aware UI control to developers via the Gemini API.

Veo 3’s emergent zero-shot skills across perception, physics, manipulation, and reasoning point to video models becoming generalist vision foundation models.

An open-source, world-consistent RGB-D video generator that turns a single image into controllable, long-range 3D scene explorations with state-of-the-art performance.
An AR-style setup lets a fluid simulation collide with real objects by aligning a webcam feed—filtered to avoid feedback—with the digital solver.