The AI Wire

5155 articles — page 16 of 172

Quoting Karen Kwok for Reuters Breakingviews (simonwillison.net)

2026-05-31|news|blog/Simon Willison

Features a quote or commentary from Karen Kwok published in Reuters Breakingviews, likely offering financial or business analysis on a current topic.

@@mr_r0b0t: Big news for @NVIDIAAI Blackwell users!...(x.com)

2026-05-31|news|twitter-bookmarks

Announces a new development, update, or feature relevant to NVIDIA Blackwell GPU users, likely related to AI inference, drivers, or software support.

Show HN: Komi-learn – continuous memory and self-improvement for coding agents (github.com)

2026-05-31|news|hackernews

Komi-learn adds persistent memory and iterative self-improvement capabilities to coding agents, allowing them to retain past solutions and refine coding strategies over time.

Notes from the Mistral AI Now Summit (koenvangilst.nl)

2026-05-30|news|hackernews

Summary notes from Mistral AI's Now Summit covering announcements, talks, and strategic directions shared at the event.

Is AI causing a repeat of frontend’s lost decade?(mastrojs.github.io)

2026-05-30|news|hackernews

Argues that AI-generated code is repeating frontend development's 'lost decade' of poor practices, technical debt, and degraded developer craft.

Liquid AI reveals 8B-A1B MoE trained on 38T (liquid.ai)

2026-05-30|news|hackernews

Liquid AI releases an 8-billion-parameter Mixture-of-Experts model with 1B active parameters, trained on 38 trillion tokens.

We should be more tired than the model (vickiboykis.com)

2026-05-30|news|hackernews

Argues that human operators accumulate cognitive fatigue faster than AI models, with implications for how AI-human collaboration should be structured.

Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA (github.com)

2026-05-30|news|hackernews

Tiny-vLLM is a high-performance LLM inference engine implemented in C++ and CUDA, targeting fast, lightweight local model serving.

Shift will clean homes for free to train future robots (theverge.com)

2026-05-30|news|hackernews

Shift robotics company offers free home cleaning services to collect real-world household manipulation data for training future domestic robots.

langchain-ai/langchain (138011 stars): The agent engineering platform.(github.com)

2026-05-30|tool|github

LangChain provides a framework and tooling for building LLM-powered agents, chains, and multi-step reasoning pipelines in production.

open-webui/open-webui (139225 stars): User-friendly AI Interface (Supports Ollama, OpenAI API, ...)(github.com)

2026-05-30|tool|github

Open WebUI delivers a self-hosted, browser-based chat interface supporting local models via Ollama and remote models via the OpenAI API.

langgenius/dify (143139 stars): Production-ready platform for agentic workflow development.(github.com)

2026-05-30|tool|github

Dify is a production-ready platform for visually designing, deploying, and managing agentic LLM workflows with built-in observability.

huggingface/transformers (161059 stars): 🤗 Transformers: the model-definition framework for state-of-the-art machine lear (github.com)

2026-05-30|tool|github

Hugging Face Transformers provides standardized model definitions, weights, and APIs for loading and fine-tuning state-of-the-art pretrained models.

f/prompts.chat (163046 stars): f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the co (github.com)

2026-05-30|tool|github

A community-curated repository for sharing, discovering, and collecting reusable prompt templates for ChatGPT and other LLM interfaces.

ollama/ollama (172629 stars): Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemm (github.com)

2026-05-30|tool|github

Ollama enables local download and execution of large language models including Kimi-K2.5, GLM-5, DeepSeek, Qwen, and Gemma via a simple CLI.

Significant-Gravitas/AutoGPT (184647 stars): AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our (github.com)

2026-05-30|tool|github

AutoGPT provides an open-source platform enabling users to build and run autonomous AI agents that chain GPT model calls to complete multi-step goals without continuous human input.

Hugging Face / announcement

2026-05-30|model|perplexity

- vLLM release notes mention Qwen3.5 support as a major new architecture.[2] - The underlying Qwen3.5 models are typically published on Hugging Face under the Qwen org; vLLM’s notes are a reliable pointer to the family’s capabilities.[2]

6. Qwen3.5 Family – Gated Delta Networks & advanced decoding (open‑source)

2026-05-30|model|perplexity

Qwen3.5 introduces Gated Delta Network architectures replacing standard attention and advanced decoding strategies across an open-source model family for improved efficiency and performance.

5. OpenAI – gpt‑oss‑120b & gpt‑oss‑20b (open‑weight reasoning models)

2026-05-30|model|perplexity

OpenAI releases two open-weight reasoning-capable models at 120B and 20B parameter scales, making competitive reasoning model weights publicly accessible.

4. OpenAI – GPT‑5.3‑Codex and GPT‑5.1‑Codex‑Max (frontier coding/agentic models)

2026-05-30|model|perplexity

OpenAI releases frontier-grade coding and agentic models in two tiers—GPT-5.3-Codex and GPT-5.1-Codex-Max—optimized for software generation and autonomous task execution.

3. Anthropic – Claude Opus 4.8 (frontier Claude upgrade)

2026-05-30|model|perplexity

Anthropic upgrades the Claude Opus line to version 4.8, advancing frontier-level capability, likely in reasoning, instruction following, or safety alignment over prior Opus releases.

2. OpenAI – GPT‑5.2 (enterprise‑focused frontier series)

2026-05-30|model|perplexity

OpenAI targets enterprise deployment with GPT-5.2, a frontier model series tuned for reliability, compliance, and performance in business-critical applications.

1. OpenAI – GPT‑5.5 (frontier model, cyber‑focused rollout)

2026-05-30|model|perplexity

OpenAI rolls out GPT-5.5 as a frontier model with a cyber-focused initial deployment, targeting cybersecurity-related tasks or threat analysis use cases.

PRISM: A Multi-Dimensional Benchmark for Evaluating LLM Peer Reviewers (huggingface.co)

2026-05-30|model|huggingface

PRISM evaluates LLMs acting as academic peer reviewers across multiple quality dimensions, measuring review accuracy, consistency, and alignment with human expert judgments.

Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation (huggingface.co)

2026-05-30|model|huggingface

Reformulating uniform diffusion models with a leave-one-out denoiser and absorbing-state perspective yields a theoretically cleaner, more stable training and inference framework for discrete generative models.

EarlyTom: Early Token Compression Completes Fast Video Understanding (huggingface.co)

2026-05-30|model|huggingface

EarlyTom compresses video token representations at early transformer layers, dramatically reducing computation while preserving sufficient temporal information for accurate video understanding.

CoHyDE: Iterative Co-Training of LLM Rewriter & Dense Encoder for Tool Retrieval (huggingface.co)

2026-05-30|model|huggingface

CoHyDE jointly and iteratively trains an LLM query rewriter alongside a dense encoder so both components mutually improve retrieval of the correct external tools for a given query.

Xetrieval: Mechanistically Explaining Dense Retrieval (huggingface.co)

2026-05-30|model|huggingface

Xetrieval applies mechanistic interpretability methods to dense retrieval models, identifying which internal circuits and representations drive document-query similarity scoring.

REPOT: Recoverable Program-of-Thought via Checkpoint Repair (huggingface.co)

2026-05-30|model|huggingface

Introduces checkpoint-based repair mechanisms that recover failed Program-of-Thought reasoning chains mid-execution rather than restarting from scratch.

Multi-view Consistent 3D Gaussian Head Avatars 'without' Multi-view Generation (huggingface.co)

2026-05-30|model|huggingface

Builds consistent 3D Gaussian head avatars from single-view inputs by enforcing multi-view consistency without requiring multi-view generative models.

← Prev16 / 172Next →