The AI Wire

5155 articles — page 19 of 172

City-Mesh3R: Simulation-Ready City-Scale 3D Mesh Reconstruction from Multi-View Images (arxiv.org)

2026-05-29|paper|arxiv

Reconstructs simulation-ready, city-scale 3D meshes from multi-view images suitable for use in downstream urban simulation pipelines.

On Language Generation in the Limit with Bounded Memory (arxiv.org)

2026-05-29|paper|arxiv

Establishes theoretical characterizations of which languages can be generated in the limit by algorithms constrained to bounded memory resources.

RoboWits: Unexpected Challenges for Robotic Creative Problem Solving (arxiv.org)

2026-05-29|paper|arxiv

RoboWits introduces a benchmark of creative, open-ended physical problem-solving tasks designed to expose unexpected failure modes in robotic AI systems.

SoundnessBench: Can Your AI Scientist Really Tell Good Research Ideas from Bad Ones?(arxiv.org)

2026-05-29|paper|arxiv

SoundnessBench evaluates whether AI research-idea generation systems can reliably distinguish scientifically valid hypotheses from flawed or unsound ones.

COMPOSE: Composing Future Theorems from Citations and Formal Structure (arxiv.org)

2026-05-29|paper|arxiv

COMPOSE automatically synthesizes novel formal theorem statements by combining citation graphs and structural patterns from existing mathematical literature.

Fairness-Aware Federated Learning with Trajectory Shapley Value (arxiv.org)

2026-05-29|paper|arxiv

Introduces Trajectory Shapley Value to fairly attribute contributions of federated clients over training trajectories, enabling fairness-aware model aggregation.

DynaFLIP: Rethinking Robotics Perception via Tri-Modal-Dynamics Guided Representation (arxiv.org)

2026-05-29|paper|arxiv

Fuses RGB, depth, and event-based sensing through dynamics-guided representations to improve robotic perception across varying motion and lighting conditions.

Claude Code – Everything You Can Configure That the Docs Don't Tell You (buildingbetter.tech)

2026-05-29|news|hackernews

A guide exposes undocumented Claude Code configuration options, giving practitioners finer control over behavior beyond what official documentation covers.

Key points

2026-05-29|model|perplexity

- OpenAI published a **Frontier Governance Framework** explaining how internal safety practices map to emerging regulation and risk‑assessment requirements for **frontier models**.[8] - In a related cybersecurity post, OpenAI references **GPT‑5.5** as “our smartest and most intuitive model to date,” with strong cybersecurity capabilities, noting it was released *two weeks before* that article.[2]

Learning A Unified Risk Map for Autonomous Driving in Partially Observable Environments (huggingface.co)

2026-05-29|model|huggingface

A unified risk map framework is learned for autonomous driving that integrates partial observability, aggregating heterogeneous risk signals into a single spatial representation.

ChildVox: A Speech, Audio, and Large Audio-Language Model Benchmark in Understanding and Characterizing Sound across Childhood (huggingface.co)

2026-05-29|model|huggingface

Provides a benchmark evaluating speech and audio-language models on child-produced sounds, covering developmental speech characteristics across different childhood age groups.

Leave a Window Out: Modifying the Jackknife for Predictive Inference in Time Series (arxiv.org)

2026-05-29|paper|arxiv

Adapts the jackknife resampling method to handle temporal dependencies in time series by excluding contiguous windows rather than individual observations.

Improved Guarantees for Heterogeneous Treatment-Effect Estimation via Matrix Completion (arxiv.org)

2026-05-29|paper|arxiv

Applies matrix completion techniques to heterogeneous treatment-effect estimation, yielding tighter theoretical guarantees than prior methods under weaker assumptions.

Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection (arxiv.org)

2026-05-29|paper|arxiv

Applies a compact vision-language model to time-series anomaly detection, achieving trusted, efficient inference suitable for resource-constrained deployment.

Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software (arxiv.org)

2026-05-29|paper|arxiv

Examines whether physics domain knowledge alone suffices to guide AI-assisted scientific software development, using physicist-supervised workflows as a case study.

datasette 1.0a31 (simonwillison.net)

2026-05-29|news|blog/Simon Willison

Releases version 1.0a31 of Datasette, the open-source tool for exploring and publishing SQLite databases, with incremental fixes or features toward stable 1.0.

How Endava builds an agentic organization with Codex (openai.com)

2026-05-29|news|blog/OpenAI Blog

Endava, an IT services firm, restructured its engineering workflows by deploying OpenAI Codex agents to automate software development tasks organization-wide.

Before the Shutter: Aesthetic and Actionable Portrait Photography Planning in 3D Scenes (arxiv.org)

2026-05-29|paper|arxiv

Plans portrait photography by suggesting aesthetically optimal camera angles and actionable shooting instructions within a reconstructed 3D scene before capture.

SchGen: PCB Schematic Generation with Semantic-Grounded Code Representations (arxiv.org)

2026-05-29|paper|arxiv

Generates PCB schematics by representing circuit designs as semantically grounded code, enabling LLMs to produce structured, meaningful schematic outputs.

Python utility package for building Claude Code hooks (github.com)

2026-05-29|news|hackernews

A Python package provides reusable utilities for defining, registering, and managing lifecycle hooks that extend or customize Claude Code agent behavior.

What’s not present this week (based on available information)

2026-05-29|model|perplexity

Within approximately the last 7 days, there are **no publicly documented releases** that meet all of your criteria of: - Brand‑new **frontier base models** from OpenAI, Anthropic (beyond Opus 4.8), Google, Meta, or Microsoft. - Newly released, **high‑capability open‑source base models** with clearly superior benchmarks, substantial new architecture, or paradigm‑shift behaviors. - Novel architectures (e.g., radically different from transformer‑variants) released as broadly usable models, not

Why it’s frontier‑relevant

2026-05-29|model|perplexity

- While not a model, this is a direct indicator of rapidly increasing capital behind **frontier model R&D and training runs** at Anthropic, including successor models beyond Claude Opus 4.8. - For forecasting **near‑future model releases**, this kind of funding event is a key structural signal in the frontier race. ---

1. Anthropic – Project Glasswing (early agentic security system)

2026-05-29|model|perplexity

Project Glasswing is an early Anthropic agentic system designed to perform automated security monitoring and threat detection using AI agents.

@@trq212: I think you’ll really like Opus 4.8...(x.com)

2026-05-29|news|twitter-bookmarks

A user previews Claude Opus 4.8, suggesting it offers notable improvements users of earlier Opus versions will find impressive.

Framework / model context & organization

2026-05-29|model|perplexity

- **OpenAI Frontier Governance Framework** – OpenAI[8] - **GPT‑5.5** context (released “two weeks ago” relative to OpenAI’s cyber post)[2]

3. OpenAI – Governance & GPT‑5.5 context (no new model this week, but relevant signals)

2026-05-29|model|perplexity

While not a release, these are the only frontier‑adjacent OpenAI updates in the timeframe.

Model / system name & org

2026-05-29|model|perplexity

- **Project Glasswing** – Anthropic[7]

f/prompts.chat (163000 stars): f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the co (github.com)

2026-05-29|tool|github

A community-curated repository for sharing and discovering reusable prompt templates designed for ChatGPT and other conversational AI systems.

markdown-svg-renderer (simonwillison.net)

2026-05-29|news|blog/Simon Willison

A renderer that converts Markdown containing SVG markup into properly displayed vector graphics output.

llm-anthropic 0.25.1 (simonwillison.net)

2026-05-29|news|blog/Simon Willison

Releases version 0.25.1 of the llm-anthropic plugin, adding or fixing features for using Anthropic Claude models via the LLM command-line tool.

← Prev19 / 172Next →