Terminology Simplification Audit
For Developers: Documentation Language Review
This document tracks the simplification of technical terminology in user-facing documentation to ensure users have an easy-to-understand experience while developers retain access to precise technical details.
Date: October 23, 2025 Version: 0.8.8
Overview
The documentation has been audited to ensure:
User guides use simple, non-technical language
Developer guides maintain technical precision
Code examples preserve technical accuracy
Cross-references appropriately label technical content
Changes Made
Terminology Replacements in User Guides
The following technical terms were replaced with user-friendly alternatives:
General Technical Terms
module→tool/script(context-dependent)function→ kept only in code examplesAPI→technical documentation/using in Python codeparameter→setting/optionalgorithm→methodarchitecture→design/technical designparse→recognize/readpattern→detection rule/ kept in code examplescompile→ kept only in code examplesREPL→interactive environmentsdataclass→configuration optionsthread-safe→designed to work reliably
Specific Replacements
troubleshooting.rst
Module Import Errors→Missing Package ErrorsModuleNotFoundError→ Error message saying a package is not foundLogging module enhanced→Logging system enhancedthread-safe and optimized→designed to work reliablySystem architecturelink →Technical system design
introduction.rst
Dynamic dataset detection→Works with any dataset folder automaticallyModular architecture→Easy to customizeCommand-line options→Options to run specific parts
configuration.rst
detection algorithm→automatic detectionREPL/notebook compatibility→interactive environments like Jupyternormalize_dataset_name() function→automatically cleans up dataset namesDeidentificationConfig dataclass→configuration optionsUtility Functionssection →Helper ToolsImport from config module→Use the configuration fileModule not found→Config file not foundAPI documentation→technical documentation
installation.rst
De-identification module→De-identification scriptModuleNotFoundError→Package not found
quickstart.rst
Module not found→Package not foundModuleNotFoundError→Package not found
deidentification.rst
pattern detection→detection/smart detectionExtensible pattern support→Easy to extend with new detection rulesmodule automatically processes→tool automatically processesAlgorithm: AES-128→Encryption method: AES-128Algorithm: SHA-256→Hash method: SHA-256Regular pattern updates→Regular updates to detection rulesmodule implements→tool followsPattern Matching→Detection SpeedPattern Priority→Detection Orderpattern priorities→detection orderintelligent multi-format date parsing→smart multi-format date recognitionAPI Referencesection →Technical Reference
country_regulations.rst
parsed→recognizedPython APIsection →Using in Python CodeAPI Referencesection →Technical Reference
Preserved Technical Terms
The following remain in user guides as they appear only in:
Code examples - Necessary for accurate Python code
pattern=re.compile()DetectionPatterncustom_patternVariable and function names in code blocks
Error messages - Exact Python error text for searchability
Kept in quotes for user recognition
File/directory names - Exact system paths
.logs/config.py__pycache__
Cross-references to developer docs - Labeled appropriately
Links to
../api/sectionsLinks to
../developer_guide/sections
Style Checker Enhancements
Updated scripts/utils/check_docs_style.sh to detect technical jargon:
Previous Detection:
Only checked for “For Users:” and “For Developers:” headers
Limited technical term detection
Enhanced Detection:
Checks for specific technical term phrases
Excludes code blocks from jargon detection
Provides detailed warnings with term locations
Focus on prose text, not code examples
New Technical Terms List:
TECH_TERMS=(
"module reference"
"function call"
"class method"
" API documentation"
"parameter list"
"decorator pattern"
"singleton instance"
"algorithm implementation"
"dataclass definition"
"instantiate object"
"thread-safe implementation"
"REPL environment"
"__init__ method"
)
Verification Results
Sphinx Build Status:
Build succeeded: ✓
Warnings: 0
Errors: 0
HTML pages generated: 39
Style Checker Results:
User guide headers: ✓ All pass (8/8 files)
Developer guide headers: ✓ All pass (9/9 files)
Technical jargon in user guides: ✓ None found
Sphinx build: ✓ Success
Files Modified:
docs/sphinx/user_guide/troubleshooting.rstdocs/sphinx/user_guide/introduction.rstdocs/sphinx/user_guide/configuration.rstdocs/sphinx/user_guide/installation.rstdocs/sphinx/user_guide/quickstart.rstdocs/sphinx/user_guide/deidentification.rstdocs/sphinx/user_guide/country_regulations.rstscripts/utils/check_docs_style.sh
Best Practices
Guidelines for Future Documentation
For User Guide Content:
Use plain language
❌ “The module instantiates a singleton pattern”
✅ “The tool creates one instance automatically”
Explain technical concepts simply
❌ “Uses SHA-256 cryptographic hashing algorithm”
✅ “Uses SHA-256 hash method for security”
Avoid jargon in headings
❌ “API Reference”
✅ “Technical Reference” (in user guides)
Code examples can use technical terms
Keep variable names and Python keywords as-is
Add explanatory text in plain language
For Developer Guide Content:
Use precise technical terminology
✅ “Implements singleton pattern with lazy initialization”
✅ “Uses decorator pattern for cross-cutting concerns”
Include implementation details
Algorithm names, complexity analysis
Design patterns, architectural decisions
API contracts, type signatures
Reference specific code elements
Module names, class names, function signatures
Parameter types, return values
Testing Documentation Changes
Before committing documentation changes:
Run style checker:
bash scripts/utils/check_docs_style.shBuild documentation:
cd docs/sphinx make clean make html
Verify no warnings/errors:
Check build output for 0 warnings
Verify HTML generates correctly
Manual review:
Read as a non-technical user
Check code examples still work
Verify cross-references are accurate
Automation
The style checker now runs automatically to enforce standards:
What It Checks:
✓ All user guides have “For Users:” header
✓ All developer guides have “For Developers:” header
✓ User guides don’t contain technical jargon (except in code)
✓ Sphinx builds without warnings or errors
How to Run:
# From project root
bash scripts/utils/check_docs_style.sh
Exit Codes:
0- All checks passed1- Errors found (must fix)Warnings reported but don’t fail the check
Compliance Status
Current Status: ✅ FULLY COMPLIANT
All user guides use simple, accessible language
All developer guides maintain technical precision
Automated enforcement in place
Documentation builds without errors
Style guide documented and followed
Enforcement Layers:
Policy & Style Guide -
documentation_style_guide.rstdefines requirements and provides examplesQuick Automation -
scripts/utils/check_docs_style.shvalidates basic compliance (daily use)Comprehensive Automation -
scripts/utils/check_documentation_quality.pyperforms deep quality checks (quarterly)CI/CD Integration -
.github/workflows/docs-quality-check.ymlruns automated checksGit Control -
.gitignoreblocks non-compliant file types
See Also
Documentation Style Guide - Documentation style guide and policy
Historical Verification Archive - Archived verification and audit records
Changelog - Version history and changes