ai - summarizeddata (Page 30)

2026-05-29

one mask to rule them all: hidden facts after editing

a compact binary mask reverses most knowledge edits in language models, revealing a shared mechanism behind diverse factual updates.

2026-05-29

behavior-induced mirror-prox td for faster off-policy learning

a new mirror-prox temporal-difference method uses behavior-policy transition information instead of feature covariance to speed up off-policy prediction.

2026-05-29

glean hits $300m arr as ai cost savings drive growth

enterprise ai search company glean reaches $300 million in annual recurring revenue, tripling in 15 months, with token cost reduction becoming a key selling point.

2026-05-29

federated probe-logit distillation rate limits under heterogeneous bandwidth

new lower bounds show that the bandwidth term in federated probe-logit distillation is tight, and the method extends to nodes with different upload budgets.

2026-05-29

ai agent plans molecular tweaks for drug lead optimization

a new llm-based agent called trace uses tool planning to optimize drug-like molecules over multiple steps, improving properties while keeping key structures intact.

2026-05-29

behavior-aware corrections stabilize off-policy td learning

replacing the auxiliary covariance matrix with the behavior bellman matrix improves stability in off-policy temporal-difference learning.

2026-05-29

anytime-valid federated conformal rag for llm swarms

a new method extends federated conformal rag to provide valid uncertainty estimates at any stopping time, enabling safer adaptive control in distributed language model systems.

2026-05-29

claude opus 4.8 ships with honesty improvements

anthropic releases claude opus 4.8, a minor update focusing on reduced hallucinations and mid-conversation system messages.

2026-05-29

frontier models score below 50% on enterprise it agent benchmark

itbench-aa evaluates ai agents on kubernetes incident response, with claude opus 4.7 leading at 47% accuracy.

2026-05-29

google research at i/o 2026: ai for science, health, and more

google research highlights from i/o 2026 include new ai tools for scientific discovery, health coaching, and edge computing, plus advances in weather prediction and model factuality.

2026-05-29

seven ai projects to automate real workflows in 2026

build practical ai assistants for job search, research, invoice processing, and more with step-by-step guides.

2026-05-29

aws rebuilds search for ai agents

aws launches opensearch serverless to handle spiky agent traffic, scaling compute from zero to meet bursts and cutting idle costs.

2026-05-28

ai token futures trading is coming

exchanges are developing derivatives markets for llm tokens, letting businesses hedge against compute costs.

2026-05-28

asana buys no-code agent builder stack ai

asana acquires stack ai to integrate no-code ai agents into its work management platform, aiming to automate complex business processes.

2026-05-28

anthropic raises $65b at $965b valuation before ipo

anthropic closed a $65 billion series h round, reaching a $965 billion valuation as it prepares to go public.

2026-05-28

tuning local llm settings with ollama

learn to fine-tune local language model parameters, optimize hardware, and format prompts using ollama's modelfile, environment variables, and go templates.

2026-05-28

sesame launches ios app for conversational ai agents

sesame, founded by oculus creators, releases an ios app with ai agents that talk naturally, pause to think, and adapt mid-sentence.

2026-05-28

dynaschedbench calibrates dynamic scheduling benchmarks for llm agents

a new framework controls instance difficulty to test llm-based scheduling agents, revealing an observability paradox where more information can hurt performance.