Agent Army MECE Audit โ Quick Reference¶
Overall Score: 72/100 (Fair โ actionable improvements exist)
Date: 2026-05-22
Agents: See AGENT_COUNTS.md โ organized across 11 categories
Routing ambiguity: 50% (target: <5%)
๐ด Critical Overlaps (Fix Immediately)¶
| Agents | Issue | Impact | Fix Effort |
|---|---|---|---|
devops-engineer + deployment-engineer |
Both own CI/CD; no boundary rule | High โ users don't know which to pick | 2 hrs |
debugger + error-detective |
Identical scope (root cause diagnosis) | Medium โ duplicate capability | Merge or split by local vs. distributed |
ml-engineer + machine-learning-engineer |
RESOLVED โ merged into machine-learning-engineer; ml-engineer removed |
โ | Done |
devops-engineer + deployment-engineer |
RESOLVED โ boundary rule added (devops = build/operate CI/CD + infra; deployment = release/rollout strategy) | โ | Done |
react-specialist + frontend-developer |
Both own React; no rule for greenfield vs. optimization | High โ 15% of React tasks ambiguous | Add rule: "frontend-developer=greenfield, react-specialist=optimization" |
backend-developer + node-specialist + fastapi-developer |
Architecture vs. language vs. framework confusion | High โ many server-side tasks ambiguous | Add rule: "backend-developer=cross-language architecture, specialists=language/framework idioms" |
โ ๏ธ Medium Overlaps (Add Decision Rules)¶
Can be resolved with explicit boundary rules (no merging needed):
api-designervsbackend-developerโ "designer owns specs; backend owns API implementation"data-engineervsdlt-engineerโ "engineer=generic tool-agnostic; dlt-engineer=dlt-specific optimization"documentation-engineervstechnical-writerโ "engineer designs systems; writer creates content"performance-engineervs layer-specialists โ "performance-engineer diagnoses bottleneck; specialist fixes in their layer"security-auditorvspenetration-testerโ "auditor=comprehensive assessment; tester=exploitation + validation"platform-engineervskubernetes-specialistโ "platform=IDP end-to-end; k8s-specialist=k8s ops"legacy-modernizervsrefactoring-specialistโ "modernizer=strategy + sequencing; specialist=tactical code cleanup"- Plus 6 more in full scorecard
๐ Category Scores¶
| Category | Score | Status | Key Issue |
|---|---|---|---|
| 11 ยท Enterprise Architecture | 91/100 | โ Exemplary | None โ TOGAF-aligned structure is MECE-perfect |
| 07 ยท Specialized Domains | 85/100 | โ Good | None โ scoped by domain, clear |
| 10 ยท Research & Analysis | 88/100 | โ Good | Minor: add pipeline clarification |
| 09 ยท Meta & Orchestration | 87/100 | โ Good | Minor: orchestration vs. coordination boundary |
| 08 ยท Business & Product | 82/100 | โ Good | Minor: add rules for PM vs BA, PM vs Scrum |
| 03 ยท Infrastructure | 81/100 | โ ๏ธ Fair | ๐ด CRITICAL: devops/deployment overlap |
| 04 ยท Quality & Security | 79/100 | โ ๏ธ Fair | ๐ด debugger/error-detective merger |
| 02 ยท Language Specialists | 78/100 | โ ๏ธ Fair | Consider consolidating 4 JS agents; 5 PowerShell OK |
| 06 ยท Developer Experience | 76/100 | โ ๏ธ Fair | Minor: doc engineer vs. writer |
| 05 ยท Data & AI | 74/100 | โ ๏ธ Fair | ๐ด ml-engineer/machine-learning-engineer merger |
| 01 ยท Core Development | 65/100 | ๐ด Fair | ๐ด frontend-developer scope creep; fullstack redundancy |
๐ฏ What Blocks Semantic Distinctiveness¶
Problem 1: Missing Decision Rules¶
Most overlaps exist because descriptions lack explicit boundary conditions.
Example (current):
react-specialist: "optimize existing React applications"
frontend-developer: "build complete frontend applications across React, Vue, Angular"
โ Unclear: can frontend-developer optimize React? Does react-specialist build new React apps?
Fixed:
react-specialist: "optimize existing React codebases for performance, state management, hooks"
- Use when: you have working React code that needs performance/architecture improvements
frontend-developer: "build new full-stack frontends across React/Vue/Angular; multi-framework architecture"
- Use when: greenfield frontend work or selecting framework strategy
Rule: frontend-developer chooses framework; react-specialist optimizes React only
Problem 2: Granularity Mismatch (Diagonal Overlap)¶
One agent scoped to language level; another to framework level; they converge.
Example:
- backend-developer (language-agnostic, architecture-focused)
- node-specialist (language-specific)
- fastapi-developer (framework-specific)
All three could own "build a Node.js FastAPI equivalent" task.
Fix: Add hierarchy rule:
backend-developer: Cross-language architecture (microservices, API design, scalability)
node-specialist: Node.js idioms, async patterns, npm ecosystem
fastapi-developer: FastAPI-specific async patterns, Pydantic validation
Route "build Node.js API" to: backend-developer (design) โ node-specialist (implement)
Problem 3: Same Artifact, Different Focus¶
Both agents produce the same deliverable but from different angles.
Examples:
- debugger and error-detective both diagnose root causes
- data-analyst and data-scientist both analyze data
- ml-engineer and machine-learning-engineer both deploy ML systems
Fix: Either merge (if truly identical) or split explicitly by scope.
๐ Routing Test Results: 20 Real Tasks¶
Ambiguity rate: 50% (10/20 tasks)
Tasks with clear single agent (40%): - Build a GraphQL API - Set up Kubernetes - Audit compliance - Develop smart contracts - Build a game - Implement payment processing - etc.
Tasks with 2+ ambiguous agents (60%): - "Optimize React app" โ react-specialist OR performance-engineer - "Build Node.js API" โ node-specialist OR backend-developer - "Set up CI/CD" โ devops-engineer OR deployment-engineer - "Debug a bug" โ debugger OR error-detective - "Build ELT pipeline" โ data-engineer OR dlt-engineer - etc.
โ How to Fix (Phase 1: 1 Week, 5โ10 Hours)¶
Step 1: Merge or Deprecate (2 hrs)¶
- Merge
ml-engineer+machine-learning-engineerโ keep one name - Merge
debugger+error-detectiveโ split by distributed (error-detective) vs. local (debugger), OR merge intodebuggerwith sub-focus areas - Merge
frontend-developer+fullstack-developerโ if fullstack is rare, deprecate it
Step 2: Add Boundary Rules to AGENTS.md (4 hrs)¶
Create a "Routing Rules" section for each overlapping pair.
Template:
### react-specialist vs frontend-developer
- **react-specialist**: optimize existing React codebases; advanced React 18+ patterns; hooks/context/state mgmt
- **frontend-developer**: greenfield multi-framework work; framework selection; full-stack integration
- **When unsure**: frontend-developer if choosing tech, react-specialist if refining tech
Step 3: Test Routing on 5 Real Tasks (1 hr)¶
Pick 5 ambiguous tasks from your backlog; confirm each routes unambiguously with new rules.
Step 4: Publish + Communicate (1 hr)¶
Update AGENTS.md with routing rules; notify team of merged agents (deprecation notices).
๐ Before Adding New Agents to Your Backlog¶
Use the Agent MECE Checklist (from AGENT_MECE_AUDIT_RUBRIC.md, Part F):
---
name: [agent-name]
description: "..."
tools: [...]
model: [...]
category: [...]
---
## MECE Self-Check
โ Primary deliverable: [artifact โ distinct from 5+ existing agents?]
โ Overlaps with: [list any agents with same/adjacent deliverable]
โ Boundary rule: [explicit condition separating from overlapping agents]
โ Routing test: [5 tasks โ does each route unambiguously?]
Red flags for backlog candidates: - โ "Like X, but for Y" (e.g., "like data-engineer, but for Apache Spark") โ probably a sub-specialization, not a new agent - โ No clear boundary rule vs. existing agents - โ Same artifact as 2+ agents (unless you're splitting an oversized role) - โ Sits in a gap but collides with 3+ existing agents (fix overlaps first)
Green flags: - โ Solves a clearly unmet problem domain (e.g., edge computing, visual design systems) - โ Scopes cleanly (no diagonal overlap) - โ Boundary rules vs. adjacent agents are explicit - โ Routing test passes (new agent always gets picked unambiguously)
๐ Full Documentation¶
AGENT_MECE_AUDIT_RUBRIC.md: Complete framework (7 parts, includes templates)AGENT_MECE_AUDIT_SCORECARD.md: Full audit with findings per category + improvement roadmapMECE_AUDIT_SUMMARY.mdโ you are here
Recommended Next Steps¶
- Review critical overlaps above; draft mergers/deprecations
- Add 5 boundary rules to
AGENTS.md(highest-impact overlaps first) - Run 10-task routing test to validate fixes
- Apply checklist to each agent in your backlog before adding
- Re-audit semi-annually (or every 5 new agents)
Questions? The rubric is designed so non-agents can evaluate distinctiveness. Use it to onboard others.