one mask to rule them all: hidden facts after editing
a compact binary mask reverses most knowledge edits in language models, revealing a shared mechanism behind diverse factual updates.
topic
a compact binary mask reverses most knowledge edits in language models, revealing a shared mechanism behind diverse factual updates.
a new mirror-prox temporal-difference method uses behavior-policy transition information instead of feature covariance to speed up off-policy prediction.
enterprise ai search company glean reaches $300 million in annual recurring revenue, tripling in 15 months, with token cost reduction becoming a key selling point.
new lower bounds show that the bandwidth term in federated probe-logit distillation is tight, and the method extends to nodes with different upload budgets.
a new llm-based agent called trace uses tool planning to optimize drug-like molecules over multiple steps, improving properties while keeping key structures intact.
replacing the auxiliary covariance matrix with the behavior bellman matrix improves stability in off-policy temporal-difference learning.
a new method extends federated conformal rag to provide valid uncertainty estimates at any stopping time, enabling safer adaptive control in distributed language model systems.
anthropic releases claude opus 4.8, a minor update focusing on reduced hallucinations and mid-conversation system messages.
itbench-aa evaluates ai agents on kubernetes incident response, with claude opus 4.7 leading at 47% accuracy.
google research highlights from i/o 2026 include new ai tools for scientific discovery, health coaching, and edge computing, plus advances in weather prediction and model factuality.
build practical ai assistants for job search, research, invoice processing, and more with step-by-step guides.
aws launches opensearch serverless to handle spiky agent traffic, scaling compute from zero to meet bursts and cutting idle costs.
exchanges are developing derivatives markets for llm tokens, letting businesses hedge against compute costs.
asana acquires stack ai to integrate no-code ai agents into its work management platform, aiming to automate complex business processes.
anthropic closed a $65 billion series h round, reaching a $965 billion valuation as it prepares to go public.
learn to fine-tune local language model parameters, optimize hardware, and format prompts using ollama's modelfile, environment variables, and go templates.
sesame, founded by oculus creators, releases an ios app with ai agents that talk naturally, pause to think, and adapt mid-sentence.
a new framework controls instance difficulty to test llm-based scheduling agents, revealing an observability paradox where more information can hurt performance.