2,000 people tried to hack an ai assistant and failed
a public challenge to leak secrets from an ai assistant via email saw 6,000 attempts but no successful breaches, highlighting improved model defenses against prompt injection.
topic
a public challenge to leak secrets from an ai assistant via email saw 6,000 attempts but no successful breaches, highlighting improved model defenses against prompt injection.
openai faces government release limits while expanding in india, plus new model tiers, custom chips, and practical data science tools.
openai appoints prabhjeet singh as first india managing director to scale consumer growth, enterprise adoption, and partnerships in a key ai battleground.
openai restricts its new gpt-5.6 models to a small group of trusted partners following a us government request, sparking debate over ai release controls.
openai launches limited preview of gpt-5.6 models sol, terra, and luna, offering varied performance and pricing tiers with new prompt caching features.
google retrofits multi-token prediction onto frozen gemini nano models to accelerate on-device inference without separate drafters.
five concrete agentic workflows automate major stages of a data science pipeline, from exploratory data analysis to feature engineering, reducing manual effort.
openai reveals jalapeño, a custom inference chip built with broadcom, joining a trend of big tech firms designing their own silicon to reduce reliance on nvidia.
openai and anthropic face similar delays as us regulators review frontier models customer by customer, raising industry-wide concerns.
fine-tune open language models locally on your mac using mlx, with no cloud gpus or costs required.
use gemini to build spreadsheets, generate formulas, and analyze data with natural language commands.
a study comparing token-level predictions of a hybrid model and a transformer reveals where each architecture excels.
refusal in ai chat models depends on a compliant persona direction, not just a standalone refusal mechanism.
a new method uses graded data samples to better isolate and control sycophancy in language models.
new tools measure how bayesian updating changes probability mass in small regions, revealing behavior missed by global divergences.
patronus ai builds simulated environments to evaluate ai agents on complex, real-world tasks, helping labs catch failures before deployment.
a new method blends maximum likelihood and empirical bayes estimators using excess mean squared error to improve second-order risk.
kg-trace combines neural networks with a knowledge graph to make antimicrobial resistance predictions that align with known biological pathways.