Commit Graph

8 Commits

Author SHA1 Message Date
czlonkowski
c3b691cedf feat(telemetry): capture error messages with security hardening
## Summary
Enhanced telemetry system to capture actual error messages for debugging
while implementing comprehensive security hardening to protect sensitive data.

## Changes
- Added optional errorMessage parameter to trackError() method
- Implemented sanitizeErrorMessage() with 7-layer security protection
- Updated all production and test call sites (atomic change)
- Added 18 new security-focused tests

## Security Fixes
- ReDoS Prevention: Early truncation + simplified regex patterns
- Full URL Redaction: Changed [URL]/path → [URL] to prevent leakage
- Credential Detection: AWS keys, GitHub tokens, JWT, Bearer tokens
- Correct Sanitization Order: URLs → credentials → emails → generic
- Error Handling: Try-catch wrapper with [SANITIZATION_FAILED] fallback

## Impact
- Resolves 272+ weekly errors with no error messages
- Protects against ReDoS attacks
- Prevents API structure and credential leakage
- 90.75% test coverage, 269 tests passing

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-03 15:53:13 +02:00
czlonkowski
a1a9ff63d2 fix: resolve remaining telemetry test failures
- Fix event validator to not filter out generic 'key' property
- Handle compound key terms (apikey, api_key) while allowing standalone 'key'
- Fix batch processor test expectations to account for circuit breaker limits
- Adjust dead letter queue test to expect 25 items due to circuit breaker opening after 5 failures
- Fix test mocks to fail for all retry attempts before adding to dead letter queue

All 252 telemetry tests now passing with 90.75% code coverage
2025-09-26 17:48:18 +02:00
czlonkowski
676c693885 fix: resolve test timeouts in telemetry tests
- Fix fake timer issues in rate-limiter and batch-processor tests
- Add proper timer handling for vitest fake timers
- Handle timer.unref() compatibility with fake timers
- Add test environment detection to skip timeouts in tests

This resolves the CI timeout issues where tests would hang indefinitely.
2025-09-26 16:58:41 +02:00
czlonkowski
e14c647b7d fix: refactor telemetry system with critical improvements (v2.14.1)
Major improvements to telemetry system addressing code review findings:

Architecture & Modularization:
- Split 636-line TelemetryManager into 7 focused modules
- Separated concerns: event tracking, batch processing, validation, rate limiting
- Lazy initialization pattern to avoid early singleton creation
- Clean separation of responsibilities

Security & Privacy:
- Added comprehensive input validation with Zod schemas
- Sanitization of sensitive data (URLs, API keys, emails)
- Expanded sensitive key detection patterns (25+ patterns)
- Row Level Security on Supabase backend
- Added data deletion contact info (romuald@n8n-mcp.com)

Performance & Reliability:
- Sliding window rate limiter (100 events/minute)
- Circuit breaker pattern for network failures
- Dead letter queue for failed events
- Exponential backoff with jitter for retries
- Performance monitoring with overhead tracking (<5%)
- Memory-safe array limits in rate limiter

Testing:
- Comprehensive test coverage (87%+ for core modules)
- Unit tests for all new modules
- Integration tests for MCP telemetry
- Fixed test isolation issues

Data Management:
- Clear user consent in welcome message
- Batch processing with deduplication
- Automatic workflow flushing

BREAKING CHANGE: TelemetryManager constructor is now private, use getInstance()

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-09-26 16:10:54 +02:00
czlonkowski
671c175d71 fix: resolve TypeErrors and enhance telemetry tracking
Fixes critical TypeErrors affecting 50% of tool calls and adds comprehensive telemetry tracking for better usage insights.

Bug Fixes:
- Add null safety checks in getNodeInfo with ?? and ?. operators
- Add null safety checks in getNodeEssentials for all metadata properties
- Add null safety checks in getNodeDocumentation with proper fallbacks
- Prevent TypeErrors when node properties are undefined/null from database

Telemetry Enhancements:
- Add trackSearchQuery to identify documentation gaps and zero-result searches
- Add trackValidationDetails to capture specific validation failure patterns
- Add trackToolSequence to understand user workflow patterns
- Add trackNodeConfiguration to monitor configuration complexity
- Add trackPerformanceMetric to identify bottlenecks
- Track tool sequences with timing to identify confusion points
- Track validation errors with details for improvement insights
- Track workflow creation on successful validation

Results:
- TypeErrors eliminated: 0 errors in 31+ tool calls (was 50% failure rate)
- Successfully tracking 37 tool sequences showing usage patterns
- Capturing validation error details for common issues
- Privacy preserved through comprehensive data sanitization

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-09-26 09:06:19 +02:00
czlonkowski
09e69df5a7 feat: implement anonymous telemetry system with Supabase integration
Adds zero-configuration anonymous usage statistics to track:
- Number of active users with deterministic user IDs
- Which MCP tools AI agents use most
- What workflows are built (sanitized to protect privacy)
- Common errors and issues

Key features:
- Zero-configuration design with hardcoded write-only credentials
- Privacy-first approach with comprehensive data sanitization
- Opt-out support via config file and environment variables
- Docker-friendly with environment variable support
- Multi-process safe with immediate flush strategy
- Row Level Security (RLS) policies for write-only access

Technical implementation:
- Supabase backend with anon key for INSERT-only operations
- Workflow sanitization removes all sensitive data
- Environment variables checked for opt-out (TELEMETRY_DISABLED, etc.)
- Telemetry enabled by default but respects user preferences
- Cleaned up all debug logging for production readiness

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-09-26 09:06:19 +02:00
czlonkowski
f150802bed fix: update telemetry to work with Supabase RLS and permissions
- Remove .select() from insert operations to avoid permission issues
- Add debug logging for successful flushes
- Add comprehensive test scripts for telemetry verification
- Telemetry now successfully sends anonymous usage data to Supabase
2025-09-26 09:06:19 +02:00
czlonkowski
5960d2826e feat: add anonymous telemetry system with Supabase integration
- Implement telemetry manager for tracking tool usage and workflows
- Add workflow sanitizer to remove sensitive data before storage
- Create config manager with opt-in/opt-out mechanism
- Integrate telemetry tracking into MCP server and workflow handlers
- Add CLI commands for telemetry control (enable/disable/status)
- Show first-run notice with clear privacy information
- Add comprehensive unit tests for sanitization and config
- Track tool usage metrics, workflow patterns, and errors
- Ensure complete anonymity with deterministic user IDs
- Never collect URLs, API keys, or sensitive information
2025-09-26 09:06:18 +02:00