Developer Guide

This guide is for people who change, review, operate, or release the code. User onboarding stays in User Guide; IRB/auditor evidence stays in IRB/Auditor Profile.

Start by Role

Reader

Goal

Start here

Pipeline developer

Change extraction, PHI scrub, cleanup, publish, variables, or lineage behavior.

Architecture, then Dataset Extraction

PDF pipeline developer

Change PDF extraction, redaction, merge, or snapshot fallback.

PDF Extraction, then PHI Architecture

Agent/tool developer

Add or change assistant tools without breaking file-zone and PHI gates.

Agent Instructions (for AI Coding Assistants), then API Reference

Privacy or security reviewer

Inspect load-bearing controls, invariants, and tests.

PHI Architecture, Sandbox: Subprocess-Isolated Code Execution, then Testing

Maintainer

Run verification, restore reviewed snapshots, and prepare releases.

Operations, then Production Readiness

Documentation contributor

Keep README and Sphinx organized by audience.

Documentation Style, then Contributing

Contents

Architecture & Decisions

Working Rules

  • Preserve the raw → staging → published bundle → agent-boundary PHI model described in PHI Architecture.

  • Keep implementation changes, tests, and documentation in the same PR when behavior changes.

  • Run the smallest focused tests first, then the repo gates required by Testing.

  • Keep README brief. Put durable detail in Sphinx and link to it.