Microsoft Open-Sources ASSERT, a Framework for Turning Natural-Language AI Policies Into Scored Tests
Microsoft launched ASSERT on Tuesday, an open source framework that converts natural-language descriptions of AI behavior into scored tests. The tool records system paths and supports continuous monitoring for application-specific policies.
marketpulse.comMicrosoft released an open source framework called Adaptive Spec-driven Scoring for Evaluation and Regression Testing on Tuesday. The framework is abbreviated as ASSERT. ASSERT turns high-level natural-language descriptions of goals, policies, or intended behaviors into thorough scored tests.
It takes plain-language descriptions of an AI model’s expected behavior and policies and turns them into a structured set of acceptable and unacceptable behaviors. ASSERT generates problem scenarios and test cases, runs them against the target system, and scores the results. It can record the paths the AI system takes, including intermediate actions and tool calls.
Developers can provide system context, tools, and constraints to customize evaluations in ASSERT. For example, a developer could specify that a document research AI agent should not send emails to people outside the company and should limit confidential information to C-level executives. Sarah Bird, chief product officer of Responsible AI at Microsoft, said evaluations are critical.
“One of the things we’ve learned is that evaluations are absolutely critical to making good decisions,” she said. “Because if you don’t understand the behavior of the AI system, it’s really hard to know if it’s meeting your organization’s bar,” Bird said. ” Bird said ASSERT can be used to evaluate systems when they are being built, after deployment, and for continuous monitoring.
TechCrunch reported that the framework fills a gap that broader evaluations cannot address when AI models must follow application-specific policies.
Transparency
Reported by a single outlet. This score reflects source tier and factual specificity — corroboration is limited with one source.
Story details
Related Stories
abcnews.go.comTrump Signs Executive Order Prioritizing AI for Cybersecurity Innovation
President Donald J. Trump signed an executive order on June 2 directing federal agencies to accelerate artificial intelligence development for protecting critical infrastructure. The order reverses earlier emphasis on slower deployment and risk reviews.
nbcnews.comTrump Signs AI Executive Order Promoting Innovation While Requiring Security Reviews
The order directs federal agencies to promote advanced AI development while addressing security concerns and reduces government review compared with an earlier draft.
The HillTrump administration proposes expanding 401(k) alternative asset options; Democrats urge withdrawal
Top Democratic lawmakers sent a letter Monday asking the Department of Labor to drop a rule that would allow cryptocurrency, private equity and private credit in retirement plans. They said the change would expose an estimated $14.2 trillion in savings to greater risk and higher…