Review Workflow¶

The PAW Review Workflow is a structured, three-stage process for thoughtful code review of any pull request. It helps reviewers understand changes, evaluate their impact, identify gaps, and provide considerate, actionable feedback.

Overview¶

PAW Review applies the same principles as the implementation workflow: traceable reasoning, rewindable analysis, and human-in-the-loop decision making. Instead of building code forward from a spec, it works backward from implementation to understanding.

Property	Description
Understanding before critique	Analyze what changed and why before evaluating quality
Comprehensive feedback	Generate all findings; human filters and adjusts based on context
Artifact-based	Durable markdown documents trace reasoning from changes to comments
Rewindable	Any stage can restart if new information changes understanding
Human-controlled	Nothing posted automatically; reviewer selects what to post

Skills-Based Architecture¶

The review workflow uses a skills-based architecture for dynamic, maintainable orchestration:

Invocation: /paw-review <PR-number-or-URL>

How it works:

The PAW Review agent loads the paw-review-workflow skill
The workflow skill orchestrates activity skills via subagent execution
Each activity skill produces specific artifacts
Complete review runs without manual pauses

Bundled Skills:

Skill	Type	Stage	Output
`paw-review-workflow`	Workflow	—	Orchestration logic
`paw-review-understanding`	Activity	Understanding	ReviewContext.md, ResearchQuestions.md, DerivedSpec.md
`paw-review-baseline`	Activity	Understanding	CodeResearch.md
`paw-review-impact`	Activity	Evaluation	ImpactAnalysis.md
`paw-review-gap`	Activity	Evaluation	GapAnalysis.md
`paw-review-correlation`	Activity	Evaluation	CrossRepoAnalysis.md (multi-repo only)
`paw-review-feedback`	Activity	Output	ReviewComments.md (draft → finalized)
`paw-review-critic`	Activity	Output	Assessment sections
`paw-review-github`	Activity	Output	GitHub pending review

Tool Support:

paw_get_skills — Retrieves catalog of available skills
paw_get_skill — Loads specific skill content by name

Subagent Skill Loading:

Every subagent MUST call paw_get_skill FIRST before executing any work. The workflow skill requires delegation prompts to include: "First load your skill using paw_get_skill('paw-review-<skill-name>'), then execute the activity."

Cross-Repository Review¶

PAW Review supports coordinated review of multiple related PRs across repositories.

Invocation:

/paw-review https://github.com/org/api/pull/123 https://github.com/org/frontend/pull/456

Detection triggers:

Multiple PR URLs/numbers in the command
Multi-root VS Code workspace detected
PRs reference different repositories

What happens:

Creates separate artifact directories per repository (PR-123-api/, PR-456-frontend/)
Analyzes each PR through full review stages
Identifies cross-repository impacts and dependencies
Creates pending reviews on each PR with cross-references

Artifact additions for multi-repo:

ReviewContext.md includes related_prs field linking to other PRs
ImpactAnalysis.md includes Cross-Repository Dependencies table
GapAnalysis.md includes cross-repo consistency checks
ReviewComments.md includes cross-references like (See also: org/frontend#456)

Single-PR workflows remain unchanged—multi-repo features activate only when detected.

Workflow Stages¶

PR → Understanding (R1) → Evaluation (R2) → Feedback Generation (R3)

Stage R1 — Understanding¶

Goal: Comprehensively understand what changed and why

Skills: paw-review-understanding, paw-review-baseline

Inputs:

PR URL or number (GitHub context)
Base branch name (non-GitHub context)
Repository context

Outputs:

ReviewContext.md — PR metadata, changed files, flags
ResearchQuestions.md — Research questions for baseline analysis
CodeResearch.md — Pre-change baseline understanding
DerivedSpec.md — Reverse-engineered intent and acceptance criteria

Process:

Fetch PR metadata and create ReviewContext.md
- Document all changed files with additions/deletions
- Set flags: CI failures, breaking changes suspected
Research pre-change baseline
- Analyze codebase at base commit (pre-change state)
- Document how system worked before changes
Derive specification
- Use CodeResearch.md to understand before/after behavior
- Reverse-engineer author intent from code and PR description
- Document assumptions and open questions

Stage R2 — Evaluation¶

Goal: Assess impact and identify what might be missing or concerning

Review Modes:

Single-model (default): paw-review-impact, paw-review-gap
Society-of-thought: paw-sot engine (replaces both impact and gap analysis)

Inputs:

All Stage R1 artifacts
Repository codebase at base and head commits

Outputs (single-model):

ImpactAnalysis.md — System-wide effects, integration points, breaking changes
GapAnalysis.md — Findings organized by Must/Should/Could

Outputs (society-of-thought):

REVIEW-{SPECIALIST}.md — Per-specialist findings
REVIEW-SYNTHESIS.md — Confidence-weighted synthesized findings

Process (single-model):

Analyze impact
- Build integration graph of dependencies
- Detect breaking changes
- Assess performance and security implications
- Evaluate design and architecture fit
- Document deployment considerations and risk
Identify gaps
- Correctness: Logic errors, edge cases, error handling
- Safety & Security: Validation, authorization, concurrency
- Testing: Coverage and test effectiveness
- Maintainability: Code clarity, documentation
- Performance: N+1 queries, unbounded operations
- Complexity: Over-engineering concerns
- Positive Observations: Good practices to commend
Categorize findings
- Must — Correctness, safety, security issues with concrete impact
- Should — Quality, completeness, testing gaps
- Could — Optional enhancements

Process (society-of-thought):

Construct review context from ReviewContext.md and pass to paw-sot
Specialists review PR diff with distinct cognitive strategies
Synthesis merges findings with confidence weighting and conflict resolution

See Society-of-Thought Review for configuration details.

Stage R3 — Output¶

Goal: Generate comprehensive review comments, critically assess them, and post to GitHub

Skills: paw-review-feedback, paw-review-critic, paw-review-github

Inputs:

All prior artifacts
CrossRepoAnalysis.md (multi-repo only)

Outputs:

ReviewComments.md — Complete feedback with full comment history
GitHub pending review (GitHub context) — Draft review with filtered comments

Process:

The Output stage uses an iterative feedback-critique pattern:

Initial Feedback Pass (paw-review-feedback)
- Transform all findings into review comments with rationale
- Incorporate cross-repo gaps for multi-repo reviews
- Create ReviewComments.md with status: draft
- Does NOT post to GitHub yet
Critical Assessment (paw-review-critic)
- Evaluate each comment for usefulness, accuracy, trade-offs
- Add assessment sections with Include/Modify/Skip recommendations
- Assessments help reviewer decide what to include
- Never posted to GitHub—for local reference only
Critique Response Pass (paw-review-feedback)
- Process critic recommendations
- Add **Updated Comment:** for modified comments
- Mark each comment with **Final**: status
- Update ReviewComments.md status to: finalized
GitHub Posting (paw-review-github, GitHub only)
- Filter to only comments marked "Ready for GitHub posting"
- Create pending review with filtered comments
- Skipped comments remain in artifact but are NOT posted
- Non-GitHub: provides manual posting instructions

Comment Evolution in ReviewComments.md:

Each comment shows its complete history: - Original — Initial feedback from first pass - Assessment — Critic evaluation - Updated — Refined version if modification was recommended - Final — Ready for posting or skipped per critique - Posted — GitHub pending review ID - Regenerate with adjusted tone if requested

Review Artifacts¶

ReviewContext.md¶

Authoritative parameter source for the review workflow.

Contents:

PR Number/Branch
Base and Head commits
Changed files summary
CI Status and flags
Description and metadata

DerivedSpec.md¶

Reverse-engineered specification from implementation.

Contents:

Intent Summary — What problem this appears to solve
Scope — What's in and out of scope
Assumptions — Inferences from the code
Measurable Outcomes — Before/after behavior
Changed Interfaces — APIs, routes, schemas
Risks & Invariants
Open Questions

ImpactAnalysis.md¶

System-wide effects and design assessment.

Contents:

Integration Points and dependencies
Breaking Changes
Performance Implications
Security & Authorization Changes
Design & Architecture Assessment
User Impact (end-user and developer-user)
Risk Assessment

GapAnalysis.md¶

Findings organized by severity.

Structure:

## Must

### [Finding Title]
**File:** `path/to/file.ts:123`
**Finding:** [What the issue is]
**Impact:** [Why this matters]
**Suggestion:** [How to fix it]

## Should
...

## Could
...

## Positive Observations
...

Note: ImpactAnalysis.md and GapAnalysis.md are produced in single-model mode only. In society-of-thought mode, the Evaluation Stage produces REVIEW-{SPECIALIST}.md per specialist and REVIEW-SYNTHESIS.md (see Stage R2).

ReviewComments.md¶

Complete feedback with full comment history showing evolution from original to posted.

Status field: draft (awaiting critique) or finalized (ready for posting)

For each comment:

Original comment text and suggestions
Rationale (Evidence, Baseline Pattern, Impact, Best Practice)
Assessment (Usefulness, Accuracy, Trade-offs, Recommendation)
Updated comment/suggestion (if modified per critique)
Final status (Ready for GitHub posting or Skipped per critique)
Posted status (pending review ID after GitHub posting)

Human Workflow Summary¶

Invoke: Run /paw-review <PR-number-or-URL> in Copilot Chat
Review: All artifacts created in .paw/reviews/<identifier>/
Consult: Check ReviewComments.md for full comment history (original → assessment → updated)
Edit: Open GitHub pending review, edit/delete comments as needed
Recover: Manually add skipped comments if you disagree with critique
Submit: Submit review manually (Approve/Comment/Request Changes)

Key principle: Comments are filtered by critique before posting. You retain full control: review the pending review, consult the complete history in ReviewComments.md, and manually add any skipped comments you want to include.

Next Steps¶

Implementation Workflow — The complementary implementation workflow
Agents Reference — Complete agent documentation