The AI Wire

High Signal (4-5)clear

3173 articles — page 14 of 106

open-webui/open-webui (138948 stars): User-friendly AI Interface (Supports Ollama, OpenAI API, ...)(github.com)

2026-05-28|tool|github

Open WebUI delivers a self-hosted browser interface for interacting with local and remote LLMs including Ollama and OpenAI-compatible APIs.

langgenius/dify (142941 stars): Production-ready platform for agentic workflow development.(github.com)

2026-05-28|tool|github

Dify enables developers to build, deploy, and monitor LLM-powered agentic workflows in production environments with a visual development platform.

huggingface/transformers (161009 stars): 🤗 Transformers: the model-definition framework for state-of-the-art machine lear (github.com)

2026-05-28|tool|github

Hugging Face Transformers provides standardized model definitions, weights, and APIs for loading and fine-tuning state-of-the-art pretrained models.

f/prompts.chat (162942 stars): f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the co (github.com)

2026-05-28|tool|github

A community-curated repository for sharing and discovering reusable system and user prompts for ChatGPT and other LLM interfaces.

ollama/ollama (172474 stars): Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemm (github.com)

2026-05-28|tool|github

Ollama enables local download, quantization management, and inference serving of large language models including Qwen, DeepSeek, and Gemma via a CLI and API.

Significant-Gravitas/AutoGPT (184594 stars): AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our (github.com)

2026-05-28|tool|github

AutoGPT provides an open-source autonomous agent platform that chains GPT model calls with tool use to complete long-horizon tasks with minimal human input.

Hugging Face / repo link

2026-05-28|model|perplexity

- GitHub repository: `nousresearch/hermes-agent`.[3]

4. Nous Research – Hermes Agent v0.14.0 (Self‑Improving Agent Stack)

2026-05-28|model|perplexity

Nous Research's Hermes Agent v0.14.0 introduces a self-improving agent stack where the agent iteratively refines its own prompts, tools, or weights during operation.

3. Anthropic – Claude Sonnet 4.6 (1M‑token Context)

2026-05-28|model|perplexity

Claude Sonnet 4.6 extends Anthropic's mid-tier model to a one-million-token context window, enabling processing of entire codebases or book-length documents in a single pass.

2. Anthropic – Claude Opus 4.7

2026-05-28|model|perplexity

Claude Opus 4.7 advances Anthropic's highest-capability model tier with improved reasoning, instruction following, and performance on complex multi-step tasks.

Announcement link

2026-05-28|model|perplexity

- OpenAI blog announcement (GPT‑5.5 with Trusted Access for Cyber).[1]

Model / org

2026-05-28|model|perplexity

- **GPT‑5.5 (Cyber‑focused deployment)** – OpenAI[1]

1. OpenAI – GPT‑5.5 “Trusted Access for Cyber” Expansion

2026-05-28|model|perplexity

OpenAI expands GPT-5.5 access specifically for vetted cybersecurity professionals and organizations, enabling trusted use of the model for offensive and defensive security workflows.

260K-param LLM running on an emulated 90s CPU inside an 18-year-old RTOS (v.redd.it)

2026-05-28|news|reddit/LocalLLaMA

A 260,000-parameter LLM was successfully executed on an emulated 1990s-era CPU running an 18-year-old real-time operating system, demonstrating extreme-constraint on-device inference.

SWE-rebench Leaderboard (March, April and May 2026): GPT-5.5, Opus 4.7, Cursor (Composer 2.5), Kimi K2.6 and More (swe-rebench.com)

2026-05-28|news|reddit/LocalLLaMA

Tracks and ranks AI coding agents including GPT-5.5, Opus 4.7, Cursor Composer 2.5, and Kimi K2.6 on software engineering tasks via the SWE-rebench leaderboard for early 2026.

Long Live The Balance: Information Bottleneck Driven Tree-based Policy Optimization (huggingface.co)

2026-05-28|model|huggingface

Uses Information Bottleneck theory to balance exploration and exploitation in tree-based reinforcement learning policy optimization, preventing collapse toward suboptimal policies.

GEM: Generative Supervision Helps Embodied Intelligence (huggingface.co)

2026-05-28|model|huggingface

Incorporates generative model supervision signals to improve embodied agent learning, enabling better scene understanding and action planning in physical environments.

Advancing Creative Physical Intelligence in Large Multimodal Models (huggingface.co)

2026-05-28|model|huggingface

Extends large multimodal models with creative physical reasoning capabilities, enabling generation and understanding of physically plausible, imaginative real-world scenarios.

MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems (huggingface.co)

2026-05-28|model|huggingface

Traces the origin of factual or reasoning errors in LLM memory systems back to specific stored memories, attributing failures to their root causes for debugging.

Rethinking Memory as Continuously Evolving Connectivity (huggingface.co)

2026-05-28|model|huggingface

Reframes memory in neural networks as dynamic connectivity patterns that evolve continuously over time rather than fixed storage, enabling adaptive long-term retention.

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation (huggingface.co)

2026-05-28|model|huggingface

Trains a proactive recommendation agent via reinforcement learning with a rectified policy gradient to correct overestimated returns and improve anticipatory item suggestion.

PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective (huggingface.co)

2026-05-28|model|huggingface

Analyzes parameter-efficient fine-tuning methods through a stability-plasticity lens, identifying which techniques best preserve pretrained knowledge while adapting to new tasks.

Fast-dDrive: Efficient Block-Diffusion VLM for Autonomous Driving (huggingface.co)

2026-05-28|model|huggingface

Applies block-level diffusion within a vision-language model for autonomous driving to achieve faster inference while maintaining high-quality scene understanding and planning.

AgensFlow: A Coordination-Policy Substrate for Multi-Agent Systems (huggingface.co)

2026-05-28|model|huggingface

Provides a coordination and policy substrate that manages communication, task allocation, and decision-making protocols across multiple collaborating AI agents.

Triplet-Block Diffusion RWKV (huggingface.co)

2026-05-28|model|huggingface

Combines RWKV's linear recurrent architecture with a triplet-block structure and diffusion-based generation to enable efficient sequence modeling with improved generation quality.

DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes (huggingface.co)

2026-05-28|model|huggingface

Trains reasoning models via reinforcement learning to recover correct reasoning chains after encountering corrupted or noisy input prefixes, improving robustness to prompt perturbations.

Lost in Sampling: Assessing Lexical Reachability in LLMs via the Word Coverage Score (WCS)(huggingface.co)

2026-05-28|model|huggingface

Introduces Word Coverage Score (WCS) to measure how many lexical tokens an LLM can actually generate under sampling, revealing vocabulary blind spots.

ESC-Skills: Discovering and Self-Evolving Skills for Emotional Support Conversations (huggingface.co)

2026-05-28|model|huggingface

Presents a system that automatically discovers and iteratively refines reusable conversational skills to improve emotional support dialogue agents.

AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning (huggingface.co)

2026-05-28|model|huggingface

Scales multi-agent systems for long-horizon tasks by enabling collective reasoning across many collaborating agents acting in concert.

Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders (huggingface.co)

2026-05-28|model|huggingface

Uses sparse autoencoder features from model internals to guide selection and curation of post-training data, improving LLM fine-tuning efficiency.

← Prev14 / 106Next →