autocoder

mirror of https://github.com/leonvanzyl/autocoder.git synced 2026-03-18 03:13:08 +00:00

Author	SHA1	Message	Date
cabana8471	8e23fee094	fix: Prevent mock data implementations with infrastructure features Problem: The coding agent can implement in-memory storage (e.g., `dev-store.ts` with `globalThis`) instead of a real database. These implementations pass all tests because data persists during runtime, but data is lost on server restart. This is a root cause for #68 - agent "passes" features that don't actually work. Solution: 1. Add 5 mandatory Infrastructure Features (indices 0-4) that run first: - Feature 0: Database connection established - Feature 1: Database schema applied correctly - Feature 2: Data persists across server restart (CRITICAL) - Feature 3: No mock data patterns in codebase - Feature 4: Backend API queries real database 2. Add STEP 5.7: Server Restart Persistence Test to coding prompt: - Create test data, stop server, restart, verify data still exists 3. Extend grep patterns for mock detection in STEP 5.6: - globalThis., devStore, dev-store, mockData, fakeData - TODO.*real, STUB, MOCK, new Map() as data stores Changes: - .claude/templates/initializer_prompt.template.md - Infrastructure features - .claude/templates/coding_prompt.template.md - STEP 5.6/5.7 enhancements - .claude/commands/create-spec.md - Phase 3b database question Backwards Compatible: - Works with YOLO mode (uses bash/grep, not browser automation) - Stateless apps can skip database features via create-spec question Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-25 08:01:30 +01:00
Auto	486979c3d9	refactor: remove testing agent claim mechanism for concurrent testing Remove the testing_in_progress claim/release mechanism from the testing agent architecture. Multiple testing agents can now test the same feature concurrently, simplifying the system and eliminating potential stale lock issues. Changes: - parallel_orchestrator.py: - Remove claim_feature_for_testing() and release_testing_claim() methods - Remove _cleanup_stale_testing_locks() periodic cleanup - Replace with simple _get_random_passing_feature() selection - Remove startup stale lock cleanup code - Remove STALE_TESTING_LOCK_MINUTES constant - Remove unused imports (timedelta, text) - api/database.py: - Remove testing_in_progress and last_tested_at columns from Feature model - Update to_dict() to exclude these fields - Convert _migrate_add_testing_columns() to no-op for backwards compat - mcp_server/feature_mcp.py: - Remove feature_release_testing tool entirely - Remove unused datetime import - prompts.py: - Update testing prompt to remove feature_release_testing instruction - Testing agents now just verify and exit (no cleanup needed) - server/websocket.py: - Update AgentTracker to use composite keys (feature_id, agent_type) - Prevents ghost agent creation from ambiguous [Feature #X] messages - Proper separation of coding vs testing agent tracking Benefits: - Eliminates artificial bottleneck from claim coordination - No stale locks to clean up after crashes - Simpler crash recovery (no testing state to restore) - Reduced database writes (no claim/release transactions) - Matches intended design: random, concurrent regression testing Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 15:30:31 +02:00
Auto	874359fcf6	improve performance	2026-01-23 14:37:43 +02:00
Auto	1be42cc734	fix: ensure agents are removed from Mission Control UI on completion Previously, agents that completed their work would remain visible in the Mission Control UI until a manual page refresh. This occurred because the AgentTracker._handle_agent_complete method silently dropped completion messages when an agent wasn't tracked (e.g., due to missed start messages from WebSocket connection issues). Backend changes: - Modified _handle_agent_complete in server/websocket.py to always emit completion messages, even for untracked agents - Synthetic completions use agentIndex=-1 and agentName='Unknown' as sentinel values to indicate untracked agents Frontend changes: - Updated useWebSocket.ts to handle synthetic completions by removing agents by featureId when agentIndex is -1 - Added 30-minute stale agent cleanup as defense-in-depth for users who leave the UI open for extended periods - Updated TypeScript types to allow 'Unknown' as valid agent name Component updates: - AgentAvatar.tsx: Added UNKNOWN_COLORS and UnknownSVG fallback for rendering unknown agents with a neutral gray question mark icon - CelebrationOverlay.tsx, DependencyGraph.tsx: Updated interfaces to accept 'Unknown' agent names Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 13:19:45 +02:00
Auto	a03d945fcd	feat: add orchestrator observability to Mission Control Add real-time visibility into the parallel orchestrator's decisions and state in the Mission Control UI. The orchestrator now has its own avatar ("Maestro") and displays capacity/queue information. Backend changes (server/websocket.py): - Add OrchestratorTracker class that parses orchestrator stdout - Define regex patterns for key orchestrator events (spawn, complete, capacity) - Track coding/testing agent counts, ready queue, blocked features - Emit orchestrator_update WebSocket messages - Reset tracker state when agent stops or crashes Frontend changes: - Add OrchestratorState, OrchestratorStatus, OrchestratorEvent types - Add WSOrchestratorUpdateMessage to WSMessage union - Handle orchestrator_update in useWebSocket hook - Create OrchestratorAvatar component (Maestro - robot conductor) - Create OrchestratorStatusCard with capacity badges and event ticker - Update AgentMissionControl to show orchestrator above agent cards - Add conducting/baton-tap CSS animations for Maestro The orchestrator status card shows: - Maestro avatar with state-based animations - Current orchestrator state and message - Coding agents, testing agents, ready queue badges - Blocked features count (when > 0) - Collapsible recent events list Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 13:02:36 +02:00
Auto	b21d2e3adc	fix: add Windows compatibility to security unit tests Add cross-platform temporary_home() context manager to handle environment variable differences between Unix and Windows systems. Changes: - Add temporary_home() context manager that handles both HOME (Unix) and USERPROFILE/HOMEDRIVE/HOMEPATH (Windows) environment variables - Update test_org_config_loading() to use temporary_home() - Update test_hierarchy_resolution() to use temporary_home() - Update test_org_blocklist_enforcement() to use temporary_home() - Add missing imports: os, contextmanager Why: The unit tests for org config loading were failing on Windows because they only set the HOME environment variable, but Windows uses USERPROFILE instead. The integration tests already had this fix via a similar context manager. Result: All 148 unit tests now pass on both Windows and Unix systems. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 12:24:50 +02:00
Leon van Zyl	1fe47736cc	Merge pull request #80 from ipodishima/feature/custom-commands Per-Project Bash Command Allowlist System	2026-01-23 12:18:30 +02:00
Auto	751ab01438	style: fix import order in settings.py for ruff compliance Move mimetypes import to the top of the import block to satisfy ruff's import sorting rules (I001). The Windows mimetype fix from PR #82 placed the import after other imports, which violated the project's linting standards. Changes: - Move `import mimetypes` to alphabetically correct position - Update comment to clarify timing requirement Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 12:12:44 +02:00
Leon van Zyl	307eba5bc9	Merge pull request #82 from paperlinguist/patch-1 fix: Windows Mimetype errors in server module (Issue#49)	2026-01-23 12:10:42 +02:00
Abigail Green	89e3b7af99	Update settings.py Adding in fix for windows issue	2026-01-22 14:13:52 -07:00
Marian Paul	edff398fe6	test: add safe environment variable handling in integration tests Changes: - Add temporary_home() context manager for safe HOME manipulation - Handle both Unix (HOME) and Windows (USERPROFILE, HOMEDRIVE, HOMEPATH) - Update test_org_blocklist_enforcement to use context manager - Update test_org_allowlist_inheritance to use context manager Benefits: - Environment variables always restored, even on exceptions - Prevents test pollution across test runs - Cross-platform compatibility (Windows + Unix) All 9 integration tests passing.	2026-01-22 16:31:50 +01:00
Marian Paul	996ac0065c	fix: improve path matching and org config validation Changes: - Support path patterns without ./ prefix (e.g., 'scripts/test.sh') - Reject non-string or empty command names in org config - Add 8 new test cases (5 for path patterns, 3 for validation) Details: - matches_pattern() now treats any pattern with '/' as a path pattern - load_org_config() validates that cmd['name'] is a non-empty string - All 148 unit tests + 9 integration tests passing Security hardening: Prevents invalid command names from reaching pattern matching logic, reducing attack surface.	2026-01-22 15:35:00 +01:00
Auto	b00eef5eca	refactor: orchestrator pre-selects features for all agents Replace agent-initiated feature selection with orchestrator pre-selection for both coding and testing agents. This ensures Mission Control displays correct feature numbers for testing agents (previously showed "Feature #0"). Key changes: MCP Server (mcp_server/feature_mcp.py): - Add feature_get_by_id tool for agents to fetch assigned feature details - Remove obsolete tools: feature_get_next, feature_claim_next, feature_claim_for_testing, feature_get_for_regression - Remove helper functions and unused imports (text, OperationalError, func) Orchestrator (parallel_orchestrator.py): - Change running_testing_agents from list to dict[int, Popen] - Add claim_feature_for_testing() with random selection - Add release_testing_claim() method - Pass --testing-feature-id to spawned testing agents - Use unified [Feature #X] output format for both agent types Agent Entry Points: - autonomous_agent_demo.py: Add --testing-feature-id CLI argument - agent.py: Pass testing_feature_id to get_testing_prompt() Prompt Templates: - coding_prompt.template.md: Update to use feature_get_by_id - testing_prompt.template.md: Update workflow for pre-assigned features - prompts.py: Update pre-claimed headers for both agent types WebSocket (server/websocket.py): - Simplify tracking with unified [Feature #X] pattern - Remove testing-specific parsing code Assistant (server/services/assistant_chat_session.py): - Update help text with current available tools Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 16:24:48 +02:00
Auto	357083dbae	feat: decouple regression testing agents from coding agents Major refactoring of the parallel orchestrator to run regression testing agents independently from coding agents. This improves system reliability and provides better control over testing behavior. Key changes: Database & MCP Layer: - Add testing_in_progress and last_tested_at columns to Feature model - Add feature_claim_for_testing() for atomic test claim with retry - Add feature_release_testing() to release claims after testing - Refactor claim functions to iterative loops (no recursion) - Add OperationalError retry handling for transient DB errors - Reduce MAX_CLAIM_RETRIES from 10 to 5 Orchestrator: - Decouple testing agent lifecycle from coding agents - Add _maintain_testing_agents() for continuous testing maintenance - Fix TOCTOU race in _spawn_testing_agent() - hold lock during spawn - Add _cleanup_stale_testing_locks() with 30-min timeout - Fix log ordering - start_session() before stale flag cleanup - Add stale testing_in_progress cleanup on startup Dead Code Removal: - Remove count_testing_in_concurrency from entire stack (12+ files) - Remove ineffective with_for_update() from features router API & UI: - Pass testing_agent_ratio via CLI to orchestrator - Update testing prompt template to use new claim/release tools - Rename UI label to "Regression Agents" with clearer description - Add process_utils.py for cross-platform process tree management Testing agents now: - Run continuously as long as passing features exist - Can re-test features multiple times to catch regressions - Are controlled by fixed count (0-3) via testing_agent_ratio setting - Have atomic claiming to prevent concurrent testing of same feature Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 15:22:48 +02:00
Marian Paul	f1b48be10e	feat: increase command limit to 100 and add optimization guide Changes: - Increase command limit from 50 to 100 per project - Add examples/OPTIMIZE_CONFIG.md with optimization strategies - Update all documentation references (50 → 100) - Update tests for new limit Rationale: - 50 was too restrictive for projects with many tools (Flutter, etc.) - Users were unknowingly exceeding limit by listing subcommands - 100 provides headroom while maintaining security - New guide teaches wildcard optimization (flutter* vs listing each subcommand) UI feedback idea: Show command count and optimization suggestions (tracked for Phase 3 or future enhancement)	2026-01-22 13:29:33 +01:00
Marian Paul	d1dac1383d	security: prevent bare wildcard '' from matching all commands Add validation to reject bare wildcards for security: - matches_pattern(): return False if pattern == '' - validate_project_command(): reject name == '*' with clear error - Added 4 new tests for bare wildcard rejection This prevents a config with from matching every command, which would be a major security risk. Tests: 140 unit tests passing (added 4 bare wildcard tests)	2026-01-22 12:40:31 +01:00
Marian Paul	a9a0fcd865	feat: add per-project bash command allowlist system Implement hierarchical command security with project and org-level configs: WHAT'S NEW: - Project-level YAML config (.autocoder/allowed_commands.yaml) - Organization-level config (~/.autocoder/config.yaml) - Pattern matching (exact, wildcards, local scripts) - Hardcoded blocklist (sudo, dd, shutdown - never allowed) - Org blocklist (terraform, kubectl - configurable) - Helpful error messages with config hints - Comprehensive documentation and examples ARCHITECTURE: - Hierarchical resolution: Hardcoded → Org Block → Org Allow → Global → Project - YAML validation with 50 command limit per project - Pattern matching: exact ("swift"), wildcards ("swift"), scripts ("./build.sh") - Secure by default: all examples commented out TESTING: - 136 unit tests (pattern matching, YAML, hierarchy, validation) - 9 integration tests (real security hook flows) - All tests passing, 100% backward compatible DOCUMENTATION: - examples/README.md - comprehensive guide with use cases - examples/project_allowed_commands.yaml - template (all commented) - examples/org_config.yaml - org config template (all commented) - PHASE3_SPEC.md - mid-session approval spec (future enhancement) - Updated CLAUDE.md with security model documentation USE CASES: - iOS projects: Add Swift toolchain (xcodebuild, swift, etc.) - Rust projects: Add cargo, rustc, clippy - Enterprise: Block aws, kubectl, terraform org-wide - Custom scripts: Allow ./scripts/build.sh PHASES: ✅ Phase 1: Project YAML + blocklist (implemented) ✅ Phase 2: Org config + hierarchy (implemented) 📋 Phase 3: Mid-session approval (spec ready, not implemented) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-22 12:29:20 +01:00
Auto	29c6b252a9	fix: correct SDK import and clear stale agent UI on stop Changes: - Revert incorrect import from claude_code_sdk to claude_agent_sdk in agent.py (PR #50 introduced an undocumented change to a deprecated package) - Clear activeAgents and recentActivity in useWebSocket when agent stops to prevent stale UI state The claude_code_sdk package is deprecated (last updated Sep 2025) while claude_agent_sdk is the active, maintained package. The import change in PR #50 was undocumented and would have caused ImportError since only claude-agent-sdk is specified in requirements.txt. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 09:39:24 +02:00
Leon van Zyl	a71406c2b5	Merge pull request #50 from kunalnano/fix/agent-completion-exit fix: exit agent loop when all features pass	2026-01-22 09:35:56 +02:00
Auto	9039108e82	perf: split bundle into smaller chunks for better caching Configure Vite's manualChunks to split the 1MB monolithic bundle into separate vendor chunks: - vendor-react (141 kB): React core libraries - vendor-query (42 kB): TanStack React Query - vendor-flow (270 kB): React Flow and dagre for graph visualization - vendor-xterm (334 kB): Terminal emulator - vendor-ui (27 kB): Radix UI components and Lucide icons - index (210 kB): Application code Benefits: - All chunks now under 500 kB warning threshold - Vendor chunks cache independently from app code - Parallel loading of chunks improves initial load time Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 09:16:43 +02:00
Auto	28e8bd6da8	fix: address performance and code quality issues in conversation history Performance improvements: - Fix N+1 query in get_conversations() using COUNT subquery instead of len(c.messages) which triggered lazy loading for each conversation - Add SQLAlchemy engine caching to avoid creating new database connections on every request - Add React.memo to ChatMessage component to prevent unnecessary re-renders during message streaming - Move BOLD_REGEX to module scope to avoid recreating on each render Code quality improvements: - Remove 10+ console.log debug statements from AssistantChat.tsx and AssistantPanel.tsx that were left from development - Add user feedback for delete errors in ConversationHistory - dialog now stays open and shows error message instead of silently failing - Update ConfirmDialog to accept ReactNode for message prop to support rich error content These changes address issues identified in the code review of PR #74 (conversation history feature). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 09:09:05 +02:00
Leon van Zyl	35ed14dfe3	Merge pull request #74 from lirielgozi/feature-conversation-history feature: add conversation history feature to AI assistant	2026-01-22 09:00:52 +02:00
Auto	0736b5ec6b	fix: address critical issues in PR #75 agent scheduling feature This commit fixes several issues identified in the agent scheduling feature from PR #75: Frontend Fixes: - Add day boundary handling in timeUtils.ts for timezone conversions - Add utcToLocalWithDayShift/localToUTCWithDayShift functions - Add shiftDaysForward/shiftDaysBackward helpers for bitfield adjustment - Update ScheduleModal to correctly adjust days_of_week when crossing day boundaries during UTC conversion (fixes schedules running on wrong days for users in extreme timezones like UTC+9) Backend Fixes: - Add MAX_SCHEDULES_PER_PROJECT (50) limit to prevent resource exhaustion - Wire up crash recovery callback in scheduler_service._start_agent() - Convert schedules.py endpoints to use context manager for DB sessions - Fix race condition in override creation with atomic delete-then-create - Replace deprecated datetime.utcnow with datetime.now(timezone.utc) - Add DB-level CHECK constraints for Schedule model fields Files Modified: - api/database.py: Add _utc_now helper, CheckConstraint imports, constraints - progress.py: Replace deprecated datetime.utcnow - server/routers/schedules.py: Add context manager, schedule limits - server/services/assistant_database.py: Replace deprecated datetime.utcnow - server/services/scheduler_service.py: Wire crash recovery, fix race condition - ui/src/components/ScheduleModal.tsx: Use day shift functions - ui/src/lib/timeUtils.ts: Add day boundary handling functions Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 08:35:57 +02:00
Leon van Zyl	44e333d034	Merge pull request #75 from ipodishima/feature/agent-scheduling feat: add time-based agent scheduling with APScheduler	2026-01-22 08:24:52 +02:00
Auto	33e9f38633	fix: prevent testing agents from running indefinitely This fix addresses two root causes that caused testing agents to accumulate (10-12 agents) instead of maintaining a 1:1 ratio with coding agents: 1. Testing agents now exit after one session (agent.py) - Added `or agent_type == "testing"` to the exit condition - Previously, testing agents never hit the exit condition since they're spawned with feature_id=None 2. Testing agents now spawn when coding agents START, not complete - Moved spawn logic from _on_agent_complete() to start_feature() - Removed the old spawn logic from _on_agent_complete() - This ensures proper 1:1 ratio and prevents accumulation Expected behavior after fix: - First coding agent: no testing agent (no passing features yet) - Subsequent coding agents: one testing agent spawns per start - Each testing agent tests ONE feature then terminates immediately Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 08:23:47 +02:00
Auto	f9d9ad9b85	fix: revert unsafe permission changes from PR #78 Security fixes to restore defense-in-depth after merging PR #78: client.py: - Revert permission mode from "bypassPermissions" to "acceptEdits" - Remove redundant web_tools_auto_approve_hook from PreToolUse hooks - Remove unused import of web_tools_auto_approve_hook security.py: - Remove web_tools_auto_approve_hook function (was redundant and returned {} for ALL tools, not just WebFetch/WebSearch) server/services/spec_chat_session.py: - Restore allowed_tools restriction: [Read, Write, Edit, Glob, WebFetch, WebSearch] - Revert permission mode from "bypassPermissions" to "acceptEdits" - Keeps setting_sources=["project", "user"] for global skills access ui/src/components/AgentAvatar.tsx: - Remove unused getMascotName export to fix React Fast Refresh warning - File now only exports AgentAvatar component as expected The bypassPermissions mode combined with unrestricted tool access in spec_chat_session.py created a security gap where Bash commands could execute without validation (sandbox disabled, no bash_security_hook). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 08:04:53 +02:00
Leon van Zyl	47dabb5f08	Merge pull request #78 from mmereu/master feat: add Create Spec button and fix Windows asyncio subprocess	2026-01-22 07:58:29 +02:00
Marian Paul	409560fb97	Add concurrent agent	2026-01-21 15:17:01 +01:00
mmereu	5937205bf8	Merge branch 'leonvanzyl:master' into master	2026-01-19 22:03:29 +01:00
mmereu	245cc5b7ad	feat: add "Create Spec" button and fix Windows asyncio subprocess UI Changes: - Add "Create Spec with AI" button in empty kanban when project has no spec - Button opens SpecCreationChat to guide users through spec creation - Shows in Pending column when has_spec=false and no features exist Windows Fixes: - Fix asyncio subprocess NotImplementedError on Windows - Set WindowsProactorEventLoopPolicy in server/__init__.py - Remove --reload from uvicorn (incompatible with Windows subprocess) - Add process cleanup on startup in start_ui.bat Spec Chat Improvements: - Enable full tool access (remove allowed_tools restriction) - Add "user" to setting_sources for global skills access - Use bypassPermissions mode for auto-approval - Add WebFetch/WebSearch auto-approve hook Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-19 21:53:09 +01:00
Marian Paul	71f3271274	Fix dark mode	2026-01-19 16:43:05 +01:00
Marian Paul	b34a116467	Fix review	2026-01-19 11:06:04 +01:00
Marian Paul	bd304b3878	Fix Ruff	2026-01-19 10:38:47 +01:00
Marian Paul	a6fe2ef633	Review	2026-01-19 10:31:23 +01:00
Marian Paul	0bab585630	feat: add time-based agent scheduling with APScheduler Add comprehensive scheduling system that allows agents to automatically start and stop during configured time windows, helping users manage Claude API token limits by running agents during off-hours. Backend Changes: - Add Schedule and ScheduleOverride database models for persistent storage - Implement APScheduler-based SchedulerService with UTC timezone support - Add schedule CRUD API endpoints (/api/projects/{name}/schedules) - Add manual override tracking to prevent unwanted auto-start/stop - Integrate scheduler lifecycle with FastAPI startup/shutdown - Fix timezone bug: explicitly set timezone=timezone.utc on CronTrigger to ensure correct UTC scheduling (critical fix) Frontend Changes: - Add ScheduleModal component for creating and managing schedules - Add clock button and schedule status display to AgentControl - Add timezone utilities for converting between UTC and local time - Add React Query hooks for schedule data fetching - Fix 204 No Content handling in fetchJSON for delete operations - Invalidate nextRun cache when manually stopping agent during window - Add TypeScript type annotations to Terminal component callbacks Features: - Multiple overlapping schedules per project supported - Auto-start at scheduled time via APScheduler cron jobs - Auto-stop after configured duration - Manual start/stop creates persistent overrides in database - Crash recovery with exponential backoff (max 3 retries) - Server restart preserves schedules and active overrides - Times displayed in user's local timezone, stored as UTC - Immediate start if schedule created during active window Dependencies: - Add APScheduler for reliable cron-like scheduling Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-19 10:31:23 +01:00
Auto	fbe4c399ac	fix: improve build_frontend reliability and cross-platform compatibility Addresses concerns from PR #76 code review: - Add exception handling for stat() calls to prevent crashes from race conditions when files are deleted/modified during iteration - Add 2-second timestamp tolerance for FAT32 filesystem compatibility (FAT32 has 2-second mtime precision on USB drives/SD cards) - Add config file checks (package.json, vite.config.ts, tailwind.config.ts, tsconfig.json, etc.) that also require rebuilds when changed - Add logging to show which file triggered the rebuild for debugging Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-19 10:26:01 +02:00
Leon van Zyl	3c80611b59	Merge pull request #76 from ipodishima/fix/ui-build-detection fix: improve UI build detection to check source file timestamps	2026-01-19 10:23:10 +02:00
Auto	6c8b463891	fix: use is_initializer instead of undefined is_first_run variable The PR #77 introduced a bug where `is_first_run` was used in the completion detection check, but this variable is only defined when `agent_type is None`. When the orchestrator runs agents with explicit `--agent-type` or `--feature-id`, the variable is undefined causing a NameError crash. Changed to use `is_initializer` which is always defined and has the correct semantic meaning for this check. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-19 09:03:38 +02:00
Leon van Zyl	0391df8fb2	Merge pull request #77 from rohitpalod/fix/agent-completion-detection fix: add completion detection to prevent infinite loop when all features done	2026-01-19 09:00:30 +02:00
Auto	13128361b0	feat: add dedicated testing agents and enhanced parallel orchestration Introduce a new testing agent architecture that runs regression tests independently from coding agents, improving quality assurance in parallel mode. Key changes: Testing Agent System: - Add testing_prompt.template.md for dedicated testing agent role - Add feature_mark_failing MCP tool for regression detection - Add --agent-type flag to select initializer/coding/testing mode - Remove regression testing from coding prompt (now handled by testing agents) Parallel Orchestrator Enhancements: - Add testing agent spawning with configurable ratio (--testing-agent-ratio) - Add comprehensive debug logging system (DebugLog class) - Improve database session management to prevent stale reads - Add engine.dispose() calls to refresh connections after subprocess commits - Fix f-string linting issues (remove unnecessary f-prefixes) UI Improvements: - Add testing agent mascot (Chip) to AgentAvatar - Enhance AgentCard to display testing agent status - Add testing agent ratio slider in SettingsModal - Update WebSocket handling for testing agent updates - Improve ActivityFeed to show testing agent activity API & Server Updates: - Add testing_agent_ratio to settings schema and endpoints - Update process manager to support testing agent type - Enhance WebSocket messages for agent_update events Template Changes: - Delete coding_prompt_yolo.template.md (consolidated into main prompt) - Update initializer_prompt.template.md with improved structure - Streamline coding_prompt.template.md workflow Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-18 13:49:50 +02:00
Rohit Palod	ffdd97a3f7	fix: add completion detection to prevent infinite loop when all features done The agent was running in an infinite loop when all kanban features were completed. This happened because: 1. The main loop in agent.py had no completion detection 2. The coding prompt instructs Claude to run regression tests BEFORE checking for new features 3. feature_get_next() returns "All features passing!" but nothing acted on it This fix adds three completion checks: 1. Pre-loop check: Exits immediately if project is already 100% complete when the agent starts (avoids running unnecessary sessions) 2. Post-session check: After each session, checks if all features are now passing and exits gracefully with a success message 3. Single-feature mode: Exits after one session since the parallel orchestrator manages spawning new agents for other features Tested with a project that had 240/240 features passing - agent now exits immediately with "ALL FEATURES ALREADY COMPLETE!" message. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-18 12:59:37 +05:30
Marian Paul	32fb4dce09	fix: improve UI build detection to check source file timestamps Enhance the build_frontend() function to detect when source files have been modified more recently than the newest file in dist/ directory. This ensures the UI is rebuilt automatically when source code changes, preventing stale UI from being served after pulling updates or switching branches. Changes: - Find newest modification time among all files in ui/dist/ - Compare each source file in ui/src/ against newest dist file - Trigger rebuild if any source file is newer than newest dist file - Handle edge case when dist/ exists but contains no files - Prevent serving outdated JavaScript bundles after code changes This fix applies to all UI launch methods (start_ui.sh, start_ui.bat) since they all call start_ui.py which contains the build logic. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-17 21:45:14 +01:00
Auto	5f786078fa	fix: prevent orchestrator early exit due to stale session cache The parallel orchestrator was exiting prematurely with "All features complete!" while pending features remained. This was caused by SQLAlchemy session caching not seeing commits made by agent subprocesses. Changes: - Add session.expire_all() to get_resumable_features() to force fresh reads - Add session.expire_all() to get_ready_features() to force fresh reads - Add session.expire_all() to get_all_complete() to force fresh reads - Add defensive retry logic in run_loop() when no features are ready but nothing is running - now forces a fresh check before declaring blocked - Add debug logging to get_all_complete() and get_ready_features() to track passing/pending/in_progress counts for easier diagnosis The root cause was cross-process database visibility: when an agent subprocess committed feature completion, the orchestrator's session had cached the old state and didn't see the update. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-17 15:25:12 +02:00
Auto	64b65311fe	chore: clean up unused imports and sort import blocks Remove unused imports and organize import statements to pass ruff linting checks: - mcp_server/feature_mcp.py: Remove unused imports (are_dependencies_satisfied, get_blocking_dependencies) and alphabetize import block - parallel_orchestrator.py: Remove unused imports (time, Awaitable) and add blank lines between import groups per PEP 8 - server/routers/features.py: Alphabetize imports in dependency resolver These changes were identified by running `ruff check .` and auto-fixed with `--fix` flag. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-17 15:05:25 +02:00
Auto	126151dccd	fix: production readiness fixes for dependency trees and parallel agents Critical fixes: - Lock file TOCTOU race condition: Use atomic O_CREAT\|O_EXCL for lock creation - PID reuse vulnerability on Windows: Store PID:CREATE_TIME in lock file to detect when a different process has reused the same PID - WAL mode on network drives: Detect network paths (UNC, mapped drives, NFS, CIFS) and fall back to DELETE journal mode to prevent corruption High priority fixes: - JSON migration now preserves dependencies field during legacy migration - Process tree termination on Windows: Use psutil to kill child processes recursively to prevent orphaned browser instances - Retry backoff jitter: Add random 30% jitter to prevent synchronized retries under high contention with 5 concurrent agents Files changed: - server/services/process_manager.py: Atomic lock creation, PID+create_time - api/database.py: Network filesystem detection for WAL mode fallback - api/migration.py: Add dependencies field to JSON migration - parallel_orchestrator.py: _kill_process_tree helper function - mcp_server/feature_mcp.py: Add jitter to exponential backoff Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-17 14:45:27 +02:00
Auto	92450a0029	fix graph refresh issue	2026-01-17 14:31:00 +02:00
Auto	76e6521331	fix: prevent dependency graph from going blank during agent activity - Memoize onNodeClick callback in App.tsx to prevent unnecessary re-renders - Add useRef pattern in DependencyGraph to store callback without triggering useMemo recalculation when callback identity changes - Add hash-based change detection to only update ReactFlow state when actual graph data changes (node status, edges), not on every parent render - Add GraphErrorBoundary class component to catch ReactFlow rendering errors and provide a "Reload Graph" recovery button instead of blank screen - Wrap DependencyGraph with error boundary and resetKey for graceful recovery The root cause was frequent WebSocket updates during active agent sessions causing parent re-renders, which created new inline callback functions, triggering useMemo/useEffect chains that corrupted ReactFlow's internal state over time (approximately 1 minute of continuous updates). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-17 14:19:56 +02:00
Auto	bf3a6b0b73	feat: add per-agent logging UI and fix stuck agent issues Changes: - Add per-agent log viewer with copy-to-clipboard functionality - New AgentLogEntry type for structured log entries - Logs stored per-agent in WebSocket state (up to 500 entries) - Log modal rendered via React Portal to avoid overflow issues - Click log icon on agent card to view full activity history - Fix agents getting stuck in "failed" state - Wrap client context manager in try/except (agent.py) - Remove failed agents from UI on error state (useWebSocket.ts) - Handle permanently failed features in get_all_complete() - Add friendlier agent state labels - "Hit an issue" → "Trying plan B..." - "Retrying..." → "Being persistent..." - Softer colors (yellow/orange instead of red) - Add scheduling scores for smarter feature ordering - compute_scheduling_scores() in dependency_resolver.py - Features that unblock others get higher priority - Update CLAUDE.md with parallel mode documentation Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-17 14:11:24 +02:00
Auto	85f6940a54	feat: add concurrent agents with dependency system and delightful UI Major feature implementation for parallel agent execution with dependency-aware scheduling and an engaging multi-agent UI experience. Backend Changes: - Add parallel_orchestrator.py for concurrent feature processing - Add api/dependency_resolver.py with cycle detection (Kahn's algorithm + DFS) - Add atomic feature_claim_next() with retry limit and exponential backoff - Fix circular dependency check arguments in 4 locations - Add AgentTracker class for parsing agent output and emitting updates - Add browser isolation with --isolated flag for Playwright MCP - Extend WebSocket protocol with agent_update messages and log attribution - Add WSAgentUpdateMessage schema with agent states and mascot names - Fix WSProgressMessage to include in_progress field New UI Components: - AgentMissionControl: Dashboard showing active agents with collapsible activity - AgentCard: Individual agent status with avatar and thought bubble - AgentAvatar: SVG mascots (Spark, Fizz, Octo, Hoot, Buzz) with animations - ActivityFeed: Recent activity stream with stable keys (no flickering) - CelebrationOverlay: Confetti animation with click/Escape dismiss - DependencyGraph: Interactive node graph visualization with dagre layout - DependencyBadge: Visual indicator for feature dependencies - ViewToggle: Switch between Kanban and Graph views - KeyboardShortcutsHelp: Help overlay accessible via ? key UI/UX Improvements: - Celebration queue system to handle rapid success messages - Accessibility attributes on AgentAvatar (role, aria-label, aria-live) - Collapsible Recent Activity section with persisted preference - Agent count display in header - Keyboard shortcut G to toggle Kanban/Graph view - Real-time thought bubbles and state animations Bug Fixes: - Fix circular dependency validation (swapped source/target arguments) - Add MAX_CLAIM_RETRIES=10 to prevent stack overflow under contention - Fix THOUGHT_PATTERNS to match actual [Tool: name] format - Fix ActivityFeed key prop to prevent re-renders on new items - Add featureId/agentIndex to log messages for proper attribution Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-17 12:59:42 +02:00
liri	c229e2b39b	fix: address CodeRabbitAI review comments for conversation history - Fix duplicate onConversationCreated callbacks by tracking activeConversationId - Fix history loss when switching conversations with Map-based deduplication - Disable input while conversation is loading to prevent message routing issues - Gate WebSocket debug logs behind DEV flag (import.meta.env.DEV) - Downgrade server logging from info to debug level for reduced noise - Fix .gitignore prefixes for playwright paths (ui/playwright-report/, ui/test-results/) - Remove debug console.log from ConversationHistory.tsx - Add staleTime (30s) to single conversation query for better caching - Increase history message cap from 20 to 35 for better context - Replace fixed timeouts with condition-based waits in e2e tests	2026-01-16 22:43:15 +00:00

1 2 3

130 Commits