An Implementation of the Microsoft Agent Governance Toolkit for Safe AI Agent Tool Use with Policies, Approvals, Audit Logs, and Risk Controls

Source: MarkTechPost

In this tutorial, we build a governed AI-agent workflow using Microsoft’s Agent Governance Toolkit as the reference point. We create a Colab-ready implementation where agents do not directly execute tools; instead, every action first passes through a governance layer that checks the agent’s identity, trust score, risk tier, requested tool, action type, sensitivity level, and policy rules. We define a YAML-based policy that controls destructive database operations, external email sending, shell execution, access to sensitive data, and financial transfers. We then wrap each tool with governance logic so that actions can be allowed, denied, sandboxed, or routed through an approval step before execution. We also generate tamper-evident audit records, run policy tests, activate a kill switch, summarize governance decisions, and visualize the relationships between agents, tools, rules, and outcomes as a graph.

import os import sys import json import time import uuid import hmac import yaml import hashlib import random import shutil import subprocess from dataclasses import dataclass, asdict from datetime import datetime, timezone from typing import Any, Dict, List, Callable, Optional def pip_install(*packages):    subprocess.run(        [sys.executable, "-m", "pip", "install", "-q", *packages],        check=False    ) pip_install("pyyaml", "pandas", "networkx", "matplotlib", "rich") pip_install("agent-governance-toolkit[full]") from rich.console import Console from rich.table import Table from rich.panel import Panel from rich import box import pandas as pd import networkx as nx import matplotlib.pyplot as plt console = Console() REPO_URL = "https://github.com/microsoft/agent-governance-toolkit" REPO_DIR = "https://www.marktechpost.com/content/agent-governance-toolkit" if not os.path.exists(REPO_DIR):    subprocess.run(["git", "clone", "--depth", "1", REPO_URL, REPO_DIR], check=False) official_govern = None official_import_error = None try:    from agentmesh.governance import govern as official_govern except Exception as e:    official_import_error = repr(e)

We set up the Colab environment by installing the required libraries and importing everything needed for policy handling, auditing, visualization, and data analysis. We also clone the Microsoft Agent Governance Toolkit repository to keep the notebook connected to the original project. We then try to import the official governance function, while keeping the tutorial runnable even if the preview package changes.

POLICY_PATH = "https://www.marktechpost.com/content/advanced_agent_policy.yaml" policy_yaml = """ apiVersion: governance.toolkit/v1 name: advanced-colab-governance-policy default_action: allow metadata:  owner: ai-platform-team  environment: tutorial  description: >    Demonstrates deterministic governance controls for AI agent tool calls. rules:  - name: block-destructive-database-actions    description: "Agents must not perform destructive database operations."    condition: "action.type in ['drop_table', 'delete_table', 'truncate_table']"    action: deny    severity: critical    owasp_risk: "Tool misuse / Excessive agency"  - name: require-human-approval-for-email    description: "External email requires approval before execution."    condition: "action.type == 'send_email' and action.recipient_domain != 'internal.local'"    action: require_approval    approvers: ["security-team", "business-owner"]    severity: high    owasp_risk: "Goal hijacking / Unauthorized action"  - name: sandbox-shell-execution    description: "Shell commands must run in a sandbox with blocked dangerous commands."    condition: "action.type == 'shell_exec'"    action: sandbox    sandbox:      blocked_terms: ["rm -rf", "curl http", "wget http", "chmod 777", "sudo"]      max_runtime_seconds: 2    severity: high    owasp_risk: "Tool misuse / Unsafe execution"  - name: deny-low-trust-agent-sensitive-data    description: "Low-trust agents cannot access sensitive data."    condition: "identity.trust_score < 0.65 and action.sensitivity in ['high', 'critical']"    action: deny    severity: high    owasp_risk: "Identity abuse / Data exfiltration"  - name: require-approval-for-financial-transaction    description: "Financial transactions above 1000 require approval."    condition: "action.type == 'transfer_money' and action.amount > 1000"    action: require_approval    approvers: ["finance-controller"]    severity: critical    owasp_risk: "Excessive agency / Business process compromise"  - name: rate-limit-high-risk-agent    description: "High-risk agents are blocked from repeated autonomous actions."    condition: "identity.risk_tier == 'high' and action.autonomous == True"    action: deny    severity: medium    owasp_risk: "Rogue agent / Cascading failure" """ with open(POLICY_PATH, "w") as f:    f.write(policy_yaml) with open(POLICY_PATH, "r") as f:    policy = yaml.safe_load(f)

We create a YAML governance policy that defines how agent actions should be handled before execution. We add rules to block destructive database actions, require approval for external emails and financial transfers, sandbox shell commands, and restrict low-trust agents from sensitive data. We then save and reload this policy so the rest of the tutorial can use it as the main governance configuration.

@dataclass class AgentIdentity:    agent_id: str    name: str    role: str    owner: str    trust_score: float    risk_tier: str    scopes: List[str] @dataclass class GovernanceDecision:    decision_id: str    timestamp: str    policy_name: str    agent_id: str    agent_name: str    tool_name: str    action: Dict[str, Any]    decision: str    matched_rule: Optional[str]    severity: Optional[str]    reason: str    approved_by: Optional[str]    previous_hash: str    record_hash: str class GovernanceDenied(Exception):    pass class ApprovalRequired(Exception):    pass class SandboxViolation(Exception):    pass class DotDict(dict):    def __getattr__(self, item):        value = self.get(item)        if isinstance(value, dict):            return DotDict(value)        return value def safe_eval_condition(condition: str, action: Dict[str, Any], identity: AgentIdentity) -> bool:    safe_globals = {        "__builtins__": {},        "True": True,        "False": False,        "None": None,    }    safe_locals = {        "action": DotDict(action),        "identity": DotDict(asdict(identity)),    }    try:        return bool(eval(condition, safe_globals, safe_locals))    except Exception as e:        return False

We define the core data structures for representing agent identities, governance decisions, and governance-related exceptions. We also create a small dot-access dictionary helper so that policy conditions can read values such as action.type and identity.trust_score. We then build a safe condition evaluator that checks whether each policy rule matches the current agent action.

class TamperEvidentAuditLog:    def __init__(self, secret: bytes = b"tutorial-secret-key"):        self.records: List[GovernanceDecision] = []        self.secret = secret        self.last_hash = "GENESIS"    def _hash_record(self, payload: Dict[str, Any], previous_hash: str) -> str:        canonical = json.dumps(            {"payload": payload, "previous_hash": previous_hash},            sort_keys=True,            default=str        ).encode()        return hmac.new(self.secret, canonical, hashlib.sha256).hexdigest()    def append(        self,        policy_name: str,        identity: AgentIdentity,        tool_name: str,        action: Dict[str, Any],        decision: str,        matched_rule: Optional[str],        severity: Optional[str],        reason: str,        approved_by: Optional[str] = None    ) -> GovernanceDecision:        base_payload = {            "decision_id": str(uuid.uuid4()),            "timestamp": datetime.now(timezone.utc).isoformat(),            "policy_name": policy_name,            "agent_id": identity.agent_id,            "agent_name": identity.name,            "tool_name": tool_name,            "action": action,            "decision": decision,            "matched_rule": matched_rule,            "severity": severity,            "reason": reason,            "approved_by": approved_by,        }        record_hash = self._hash_record(base_payload, self.last_hash)        record = GovernanceDecision(            **base_payload,            previous_hash=self.last_hash,            record_hash=record_hash        )        self.records.append(record)        self.last_hash = record_hash        return record    def verify(self) -> bool:        previous = "GENESIS"        for r in self.records:            payload = asdict(r)            record_hash = payload.pop("record_hash")            previous_hash = payload.pop("previous_hash")            if previous_hash != previous:                return False            expected = self._hash_record(payload, previous_hash)            if expected != record_hash:                return False            previous = record_hash        return True    def to_dataframe(self) -> pd.DataFrame:        return pd.DataFrame([asdict(r) for r in self.records]) audit_log = TamperEvidentAuditLog()

We implement a tamper-evident audit log that records every governance decision made by the system. We use chained hashes, so each new record depends on the previous record, making changes easier to detect. We also add methods to verify the audit chain and convert the records into a dataframe for later analysis.

class TutorialGovernanceEngine:    def __init__(self, policy: Dict[str, Any], audit_log: TamperEvidentAuditLog):        self.policy = policy        self.audit_log = audit_log        self.kill_switch_enabled = False        self.error_budget = 5        self.recent_denials = 0    def activate_kill_switch(self):        self.kill_switch_enabled = True    def deactivate_kill_switch(self):        self.kill_switch_enabled = False    def evaluate(        self,        identity: AgentIdentity,        tool_name: str,        action: Dict[str, Any]    ) -> GovernanceDecision:        if self.kill_switch_enabled:            return self.audit_log.append(                policy_name=self.policy["name"],                identity=identity,                tool_name=tool_name,                action=action,                decision="deny",                matched_rule="global-kill-switch",                severity="critical",                reason="Global governance kill switch is active."            )        for rule in self.policy.get("rules", []):            condition = rule.get("condition", "")            if safe_eval_condition(condition, action, identity):                rule_action = rule.get("action", "deny")                matched_rule = rule.get("name")                severity = rule.get("severity")                description = rule.get("description", "Policy rule matched.")                if rule_action == "deny":                    self.recent_denials += 1                    return self.audit_log.append(                        policy_name=self.policy["name"],                        identity=identity,                        tool_name=tool_name,                        action=action,                        decision="deny",                        matched_rule=matched_rule,                        severity=severity,                        reason=description                    )                if rule_action == "require_approval":                    return self.audit_log.append(                        policy_name=self.policy["name"],                        identity=identity,                        tool_name=tool_name,                        action=action,                        decision="require_approval",                        matched_rule=matched_rule,                        severity=severity,                        reason=description                    )                if rule_action == "sandbox":                    blocked_terms = rule.get("sandbox", {}).get("blocked_terms", [])                    command = str(action.get("command", ""))                    for term in blocked_terms:                        if term in command:                            self.recent_denials += 1                            return self.audit_log.append(                                policy_name=self.policy["name"],                                identity=identity,                                tool_name=tool_name,                                action=action,                                decision="deny",                                matched_rule=matched_rule,                                severity=severity,                                reason=f"Sandbox blocked command term: {term}"                            )                    return self.audit_log.append(                        policy_name=self.policy["name"],                        identity=identity,                        tool_name=tool_name,                        action=action,                        decision="sandbox",                        matched_rule=matched_rule,                        severity=severity,                        reason=description                    )        return self.audit_log.append(            policy_name=self.policy["name"],            identity=identity,            tool_name=tool_name,            action=action,            decision=self.policy.get("default_action", "allow"),            matched_rule=None,            severity=None,            reason="No policy rule matched. Default action applied."        ) engine = TutorialGovernanceEngine(policy, audit_log)

We build the main governance engine that compares each agent action against the YAML policy rules. We handle different outcomes such as deny, approval required, sandbox mode, and default allow. We also include a kill switch that immediately blocks all actions when needed.

def query_database(table: str, operation: str = "select") -> Dict[str, Any]:    return {        "status": "success",        "operation": operation,        "table": table,        "rows_returned": random.randint(10, 100)    } def send_email(to: str, subject: str, body: str) -> Dict[str, Any]:    return {        "status": "sent",        "to": to,        "subject": subject,        "body_preview": body[:80]    } def shell_exec(command: str) -> Dict[str, Any]:    allowed_commands = ["echo", "date", "pwd", "ls"]    first = command.strip().split()[0] if command.strip() else ""    if first not in allowed_commands:        return {            "status": "blocked_by_tutorial_shell",            "command": command,            "reason": "Only harmless demo shell commands are executed."        }    result = subprocess.run(        command,        shell=True,        capture_output=True,        text=True,        timeout=2    )    return {        "status": "executed",        "command": command,        "stdout": result.stdout.strip(),        "stderr": result.stderr.strip()    } def transfer_money(amount: float, destination: str) -> Dict[str, Any]:    return {        "status": "transferred",        "amount": amount,        "destination": destination    } class GovernedTool:    def __init__(        self,        name: str,        fn: Callable,        engine: TutorialGovernanceEngine,        identity: AgentIdentity,        approval_simulator: Optional[Callable[[GovernanceDecision], bool]] = None    ):        self.name = name        self.fn = fn        self.engine = engine        self.identity = identity        self.approval_simulator = approval_simulator or (lambda decision: False)    def __call__(self, **kwargs):        action = dict(kwargs)        action.setdefault("autonomous", True)        decision = self.engine.evaluate(            identity=self.identity,            tool_name=self.name,            action=action        )        if decision.decision == "deny":            raise GovernanceDenied(                f"Action denied by rule '{decision.matched_rule}': {decision.reason}"            )        if decision.decision == "require_approval":            approved = self.approval_simulator(decision)            if not approved:                raise ApprovalRequired(                    f"Approval required by rule '{decision.matched_rule}': {decision.reason}"                )            self.engine.audit_log.append(                policy_name=self.engine.policy["name"],                identity=self.identity,                tool_name=self.name,                action=action,                decision="approved",                matched_rule=decision.matched_rule,                severity=decision.severity,                reason="Human approval simulated for tutorial.",                approved_by="tutorial-approver"            )        return self.fn(**kwargs)

We define sample tools that represent real agent capabilities, including database access, email sending, shell execution, and money transfer. We then create a governed tool wrapper that ensures every tool call passes through the governance engine first. We ensure denied actions stop immediately, that approval-based actions require a simulated approval, and that only approved or allowed actions reach the actual tool.

research_agent = AgentIdentity(    agent_id="agent-research-001",    name="ResearchAgent",    role="market_research",    owner="strategy-team",    trust_score=0.91,    risk_tier="low",    scopes=["read_database", "web_search", "internal_email"] ) ops_agent = AgentIdentity(    agent_id="agent-ops-002",    name="OpsAgent",    role="automation",    owner="platform-team",    trust_score=0.72,    risk_tier="medium",    scopes=["shell_exec", "read_database"] ) unknown_agent = AgentIdentity(    agent_id="agent-shadow-999",    name="ShadowAgent",    role="unknown",    owner="unknown",    trust_score=0.42,    risk_tier="high",    scopes=["unknown"] ) finance_agent = AgentIdentity(    agent_id="agent-finance-003",    name="FinanceAgent",    role="finance_ops",    owner="finance-team",    trust_score=0.88,    risk_tier="low",    scopes=["transfer_money", "read_database"] ) def tutorial_approval_simulator(decision: GovernanceDecision) -> bool:    action = decision.action    if decision.matched_rule == "require-approval-for-financial-transaction":        return action.get("amount", 0) <= 5000    if decision.matched_rule == "require-human-approval-for-email":        return "confidential" not in str(action).lower()    return False research_db = GovernedTool(    name="query_database",    fn=query_database,    engine=engine,    identity=research_agent,    approval_simulator=tutorial_approval_simulator ) ops_shell = GovernedTool(    name="shell_exec",    fn=shell_exec,    engine=engine,    identity=ops_agent,    approval_simulator=tutorial_approval_simulator ) shadow_db = GovernedTool(    name="query_database",    fn=query_database,    engine=engine,    identity=unknown_agent,    approval_simulator=tutorial_approval_simulator ) research_email = GovernedTool(    name="send_email",    fn=send_email,    engine=engine,    identity=research_agent,    approval_simulator=tutorial_approval_simulator ) finance_transfer = GovernedTool(    name="transfer_money",    fn=transfer_money,    engine=engine,    identity=finance_agent,    approval_simulator=tutorial_approval_simulator )

We create multiple agents with different roles, trust scores, risk levels, and scopes to simulate a realistic multi-agent environment. We also define an approval simulator that accepts or rejects actions based on simple business logic. We then wrap each tool with the correct agent identity so the governance layer can make identity-aware decisions.

scenarios = [    {        "name": "Safe database read",        "tool": research_db,        "kwargs": {            "table": "customers",            "operation": "select",            "type": "select",            "sensitivity": "medium"        }    },    {        "name": "Blocked destructive database action",        "tool": research_db,        "kwargs": {            "table": "customers",            "operation": "drop",            "type": "drop_table",            "sensitivity": "critical"        }    },    {        "name": "External email requiring approval",        "tool": research_email,        "kwargs": {            "to": "[email protected]",            "recipient_domain": "example.com",            "subject": "Quarterly update",            "body": "Sharing a non-confidential quarterly update.",            "type": "send_email",            "sensitivity": "medium"        }    },    {        "name": "External email denied due to approval rejection",        "tool": research_email,        "kwargs": {            "to": "[email protected]",            "recipient_domain": "example.com",            "subject": "Confidential strategy",            "body": "This contains confidential strategy.",            "type": "send_email",            "sensitivity": "critical"        }    },    {        "name": "Safe sandbox shell command",        "tool": ops_shell,        "kwargs": {            "command": "echo Agent governance is active",            "type": "shell_exec",            "sensitivity": "low"        }    },    {        "name": "Dangerous shell command blocked",        "tool": ops_shell,        "kwargs": {            "command": "rm -rf /content/something",            "type": "shell_exec",            "sensitivity": "critical"        }    },    {        "name": "Low-trust agent blocked from sensitive data",        "tool": shadow_db,        "kwargs": {            "table": "executive_compensation",            "operation": "select",            "type": "select",            "sensitivity": "critical"        }    },    {        "name": "Financial transfer requiring approval",        "tool": finance_transfer,        "kwargs": {            "amount": 2500,            "destination": "vendor-123",            "type": "transfer_money",            "sensitivity": "high"        }    },    {        "name": "Large financial transfer rejected",        "tool": finance_transfer,        "kwargs": {            "amount": 15000,            "destination": "vendor-999",            "type": "transfer_money",            "sensitivity": "critical"        }    }, ] results = [] for scenario in scenarios:    try:        output = scenario["tool"](**scenario["kwargs"])        results.append({            "scenario": scenario["name"],            "status": "executed",            "output": output        })    except Exception as e:        results.append({            "scenario": scenario["name"],            "status": "blocked_or_pending",            "error": str(e)        }) audit_df = audit_log.to_dataframe() display_cols = [    "timestamp",    "agent_name",    "tool_name",    "decision",    "matched_rule",    "severity",    "reason",    "record_hash" ] display(audit_df[display_cols]) test_cases = [    {        "name": "drop_table must be denied",        "identity": research_agent,        "tool_name": "query_database",        "action": {"type": "drop_table", "sensitivity": "critical", "autonomous": True},        "expected": "deny"    },    {        "name": "safe select should be allowed",        "identity": research_agent,        "tool_name": "query_database",        "action": {"type": "select", "sensitivity": "low", "autonomous": True},        "expected": "allow"    },    {        "name": "external email should require approval",        "identity": research_agent,        "tool_name": "send_email",        "action": {            "type": "send_email",            "recipient_domain": "example.com",            "sensitivity": "medium",            "autonomous": True        },        "expected": "require_approval"    },    {        "name": "low trust sensitive access denied",        "identity": unknown_agent,        "tool_name": "query_database",        "action": {"type": "select", "sensitivity": "critical", "autonomous": True},        "expected": "deny"    },    {        "name": "shell command should enter sandbox",        "identity": ops_agent,        "tool_name": "shell_exec",        "action": {            "type": "shell_exec",            "command": "echo hello",            "sensitivity": "low",            "autonomous": True        },        "expected": "sandbox"    }, ] test_results = [] for test in test_cases:    decision = engine.evaluate(        identity=test["identity"],        tool_name=test["tool_name"],        action=test["action"]    )    passed = decision.decision == test["expected"]    test_results.append({        "test": test["name"],        "expected": test["expected"],        "actual": decision.decision,        "passed": passed,        "matched_rule": decision.matched_rule    }) test_df = pd.DataFrame(test_results) display(test_df) engine.activate_kill_switch() try:    research_db(        table="customers",        operation="select",        type="select",        sensitivity="low"    ) except Exception as e:    pass engine.deactivate_kill_switch() audit_df = audit_log.to_dataframe() summary = (    audit_df    .groupby(["decision", "severity"], dropna=False)    .size()    .reset_index(name="count")    .sort_values("count", ascending=False) ) display(summary) agent_summary = (    audit_df    .groupby(["agent_name", "decision"])    .size()    .reset_index(name="count")    .sort_values(["agent_name", "count"], ascending=[True, False]) ) display(agent_summary) decision_counts = audit_df["decision"].value_counts() plt.figure(figsize=(8, 5)) decision_counts.plot(kind="bar") plt.title("Governance Decisions Across Agent Actions") plt.xlabel("Decision") plt.ylabel("Count") plt.xticks(rotation=30) plt.tight_layout() plt.show() severity_counts = audit_df["severity"].fillna("none").value_counts() plt.figure(figsize=(8, 5)) severity_counts.plot(kind="bar") plt.title("Governance Events by Severity") plt.xlabel("Severity") plt.ylabel("Count") plt.xticks(rotation=30) plt.tight_layout() plt.show() G = nx.DiGraph() for _, row in audit_df.iterrows():    agent_node = f"Agent: {row['agent_name']}"    tool_node = f"Tool: {row['tool_name']}"    decision_node = f"Decision: {row['decision']}"    rule_node = f"Rule: {row['matched_rule']}" if pd.notna(row["matched_rule"]) else "Rule: default"    G.add_node(agent_node, node_type="agent")    G.add_node(tool_node, node_type="tool")    G.add_node(decision_node, node_type="decision")    G.add_node(rule_node, node_type="rule")    G.add_edge(agent_node, tool_node, relation="calls")    G.add_edge(tool_node, decision_node, relation="produces")    G.add_edge(decision_node, rule_node, relation="matched") plt.figure(figsize=(14, 9)) pos = nx.spring_layout(G, seed=42, k=0.8) nx.draw_networkx_nodes(G, pos, node_size=1800) nx.draw_networkx_edges(G, pos, arrows=True, arrowstyle="->", arrowsize=15) nx.draw_networkx_labels(G, pos, font_size=8) plt.title("Agent Governance Graph: Agents, Tools, Decisions, and Policy Rules") plt.axis("off") plt.tight_layout() plt.show() EXPORT_DIR = "https://www.marktechpost.com/content/agt_tutorial_outputs" os.makedirs(EXPORT_DIR, exist_ok=True) audit_json_path = os.path.join(EXPORT_DIR, "tamper_evident_audit_log.json") audit_csv_path = os.path.join(EXPORT_DIR, "governance_audit_log.csv") policy_copy_path = os.path.join(EXPORT_DIR, "advanced_agent_policy.yaml") test_results_path = os.path.join(EXPORT_DIR, "policy_test_results.csv") with open(audit_json_path, "w") as f:    json.dump([asdict(r) for r in audit_log.records], f, indent=2, default=str) audit_df.to_csv(audit_csv_path, index=False) test_df.to_csv(test_results_path, index=False) shutil.copy(POLICY_PATH, policy_copy_path)

We run a set of test scenarios that show how the governed system handles safe actions, risky actions, approval flows, and blocked operations. We display the audit log, run policy tests, activate and deactivate the kill switch, and summarize governance decisions with tables and charts. We also create a governance graph and export the audit logs, policy file, and test results as reusable artifacts.

In conclusion, we have a fully governed-agent workflow that covers both policy enforcement and observability. We simulated multiple agents with varying trust levels. We showed how the same system responds differently depending on the agent’s identity, the action’s sensitivity, and the rules defined in the policy file. Safe actions, such as simple database reads, are executed. In contrast, risky actions, such as destructive database changes, unsafe shell commands, low-trust sensitive access, and large financial transfers, are blocked or sent for approval. We also recorded every decision in an audit log, verified the audit chain, ran policy tests, exported governance artifacts, and created visual summaries that make the system’s behavior easier to review.

Check out the Full Codes here. Also, feel free to follow us on Twitter and don’t forget to join our 150k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.? Connect with us

Sana Hassan

Sana Hassan, a consulting intern at Marktechpost and dual-degree student at IIT Madras, is passionate about applying technology and AI to address real-world challenges. With a keen interest in solving practical problems, he brings a fresh perspective to the intersection of AI and real-life solutions.