autocoder

mirror of https://github.com/leonvanzyl/autocoder.git synced 2026-03-21 12:53:09 +00:00

Author	SHA1	Message	Date
Auto	8b2251331d	feat: increase batch size limits to 15 and add testing_batch_size setting Batch size configuration: - Increase coding agent batch size limit from 1-3 to 1-15 - Increase testing agent batch size limit from 1-5 to 1-15 - Add separate `testing_batch_size` setting (previously only CLI-configurable) - Pass testing_batch_size through full stack: schema → settings router → agent router → process manager → CLI flag UI changes: - Replace 3-button batch size selector with range slider (1-15) - Add new Slider component (ui/src/components/ui/slider.tsx) - Add "Features per Testing Agent" slider in settings panel - Add custom slider CSS styling for webkit and mozilla Updated across: CLAUDE.md, autonomous_agent_demo.py, parallel_orchestrator.py, server/{schemas,routers,services}, and UI types/hooks/components. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 13:39:19 +02:00
Auto	41c1a14ae3	feat: add scaffold router and project template selection step Add a new scaffold system that lets users choose a project template (blank or agentic starter) during project creation. This inserts a template selection step between folder selection and spec method choice. Backend: - New server/routers/scaffold.py with SSE streaming endpoint for running hardcoded scaffold commands (npx create-agentic-app) - Path validation, security checks, and cross-platform npx resolution - Registered scaffold_router in server/main.py and routers/__init__.py Frontend (NewProjectModal.tsx): - New "template" step with Blank Project and Agentic Starter cards - Real-time scaffold output streaming with auto-scroll log viewer - Success, error, and retry states with proper back-navigation - Updated step flow: name → folder → template → method → chat/complete Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 13:18:55 +02:00
Auto	819ebcd112	Merge remote-tracking branch 'origin/master' into feature/blocked-for-human-input # Conflicts: # server/services/process_manager.py	2026-02-12 07:36:11 +02:00
Caitlyn Byrne	656df0fd9a	feat: add "blocked for human input" feature across full stack Agents can now request structured human input when they encounter genuine blockers (API keys, design choices, external configs). The request is displayed in the UI with a dynamic form, and the human's response is stored and made available when the agent resumes. Changes span 21 files + 1 new component: - Database: 3 new columns (needs_human_input, human_input_request, human_input_response) with migration - MCP: new feature_request_human_input tool + guards on existing tools - API: new resolve-human-input endpoint, 4th feature bucket - Orchestrator: skip needs_human_input features in scheduling - Progress: 4-tuple return from count_passing_tests - WebSocket: needs_human_input count in progress messages - UI: conditional 4th Kanban column, HumanInputForm component, amber status indicators, dependency graph support Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 14:11:35 -05:00
Caitlyn Byrne	9721368188	feat: add graceful pause (drain mode) for running agents File-based signal (.pause_drain) lets the orchestrator finish current work before pausing instead of hard-freezing the process tree. New status states pausing/paused_graceful flow through WebSocket to the UI where a Pause button, draining indicator, and Resume button are shown. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 13:37:22 -05:00
Auto	71f17c73c2	feat: add structured questions (AskUserQuestion) to assistant chat Add interactive multiple-choice question support to the project assistant, allowing it to present clickable options when clarification is needed. Backend changes: - Add ask_user MCP tool to feature_mcp.py with input validation - Add mcp__features__ask_user to assistant allowed tools list - Intercept ask_user tool calls in _query_claude() to yield question messages - Add answer WebSocket message handler in assistant_chat router - Document ask_user tool in assistant system prompt Frontend changes: - Add AssistantChatQuestionMessage type and update server message union - Add currentQuestions state and sendAnswer() to useAssistantChat hook - Handle question WebSocket messages by attaching to last assistant message - Render QuestionOptions component between messages and input area - Disable text input while structured questions are active Flow: Claude calls ask_user → backend intercepts → WebSocket question message → frontend renders QuestionOptions → user clicks options → answer sent back → Claude receives formatted answer and continues conversation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 15:26:36 +02:00
Auto	a52f191a54	refactor: make Settings UI the single source of truth for API provider Remove legacy env-var-based provider/mode detection that caused misleading UI badges (e.g., GLM badge showing when Settings was set to Claude). Key changes: - Remove _is_glm_mode() and _is_ollama_mode() env-var sniffing functions from server/routers/settings.py; derive glm_mode/ollama_mode purely from the api_provider setting - Remove `import os` from settings router (no longer needed) - Update schema comments to reflect settings-based derivation - Remove "(configured via .env)" from badge tooltips in App.tsx - Remove Kimi/GLM/Ollama/Playwright-headless sections from .env.example; add note pointing to Settings UI - Update CLAUDE.md and README.md documentation to reference Settings UI for alternative provider configuration - Update model IDs from claude-opus-4-5-20251101 to claude-opus-4-6 across registry, client, chat sessions, tests, and UI defaults - Add LEGACY_MODEL_MAP with auto-migration in get_all_settings() - Show model ID subtitle in SettingsModal model selector - Add Vertex passthrough test for claude-opus-4-6 (no date suffix) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 09:23:06 +02:00
Auto	73d6cfcd36	fix: address PR #163 review findings - Fix model selection regression: _get_settings_defaults() now checks api_model (set by new provider UI) before falling back to legacy model setting, ensuring Claude model selection works end-to-end - Add input validation for provider settings: api_base_url must start with http:// or https:// (max 500 chars), api_auth_token max 500 chars, api_model max 200 chars - Fix terminal.py misleading import alias: replace is_valid_project_name aliased as validate_project_name with direct is_valid_project_name import across all 5 call sites Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 08:10:18 +02:00
nioasoft	a752ece70c	fix: wrong import alias overwrote project_name with bool assistant_chat.py and spec_creation.py imported is_valid_project_name (returns bool) aliased as validate_project_name. When used as `project_name = validate_project_name(project_name)`, the project name was replaced with True, causing "Project not found in registry" errors. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 06:20:03 +02:00
nioasoft	13785325d7	feat: add API provider selection UI and fix stuck features on agent crash API Provider Selection: - Add provider switcher in Settings modal (Claude, Kimi, GLM, Ollama, Custom) - Auth tokens stored locally only (registry.db), never returned by API - get_effective_sdk_env() builds provider-specific env vars for agent subprocess - All chat sessions (spec, expand, assistant) use provider settings - Backward compatible: defaults to Claude, env vars still work as override Fix Stuck Features: - Add _cleanup_stale_features() to process_manager.py - Reset in_progress features when agent stops, crashes, or fails healthcheck - Prevents features from being permanently stuck after rate limit crashes - Uses separate SQLAlchemy engine to avoid session conflicts with subprocess Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 05:55:51 +02:00
nioasoft	70131f2271	fix: accept WebSocket before validation to prevent opaque 403 errors All WebSocket endpoints now call websocket.accept() before any validation checks. Previously, closing the connection before accepting caused Starlette to return an opaque HTTP 403 instead of a meaningful error message. Changes: - Server: Accept WebSocket first, then send JSON error + close with 4xxx code if validation fails (expand, spec, assistant, terminal, main project WS) - Server: ConnectionManager.connect() no longer calls accept() to avoid double-accept - UI: Gate expand button and keyboard shortcut on hasSpec - UI: Skip WebSocket reconnection on application error codes (4000-4999) - UI: Update keyboard shortcuts help text Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-06 05:46:24 +02:00
nioasoft	035e8fdfca	fix: accept WebSocket before validation to prevent opaque 403 errors All 5 WebSocket endpoints (expand, spec, assistant, terminal, project) were closing the connection before calling accept() when validation failed. Starlette converts pre-accept close into an HTTP 403, giving clients no meaningful error information. Server changes: - Move websocket.accept() before all validation checks in every WS handler - Send JSON error message before closing so clients get actionable errors - Fix validate_project_name usage (raises HTTPException, not returns bool) - ConnectionManager.connect() no longer calls accept() (caller's job) Client changes: - All 3 WS hooks (useWebSocket, useExpandChat, useSpecChat) skip reconnection on 4xxx close codes (application errors won't self-resolve) - Gate expand button, keyboard shortcut, and modal on hasSpec - Add hasSpec to useEffect dependency array to prevent stale closure - Update keyboard shortcuts help text for E key context Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 21:08:46 +02:00
Auto	c55a1a0182	fix: harden dev server RCE mitigations from PR #153 Address security gaps and improve validation in the dev server command execution path introduced by PR #153: Security fixes (critical): - Add missing shell metacharacters to dangerous_ops blocklist: single & (Windows cmd.exe command separator), >, <, ^, %, \n, \r - The single & gap was a confirmed RCE bypass on Windows where .cmd files are always executed via cmd.exe even with shell=False (CPython limitation documented in issue #77696) - Apply validate_custom_command_strict at /start endpoint for defense-in-depth against config file tampering Validation improvements: - Fix uvicorn --flag=value syntax (split on = before comparing) - Expand Python support: Django (manage.py), Flask, custom .py scripts - Add runners: flask, poetry, cargo, go, npx - Expand npm script allowlist: serve, develop, server, preview - Reorder PATCH /config validation to run strict check first (fail fast) - Extract constants: ALLOWED_NPM_SCRIPTS, ALLOWED_PYTHON_MODULES, BLOCKED_SHELLS for reuse and testability Cleanup: - Remove unused security.py imports from dev_server_manager.py - Fix deprecated datetime.utcnow() -> datetime.now(timezone.utc) - Remove unnecessary _remove_lock() in exception handlers where lock was never created (Popen failure path) Tests: - Add test_devserver_security.py with 78 tests covering valid commands, blocked shells, blocked commands, injection attempts, dangerous_ops blocking, and constant verification Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-05 08:52:47 +02:00
Leon van Zyl	75766a433a	Merge pull request #153 from syphonetic/master Implemented RCE mitigation measures	2026-02-05 08:31:28 +02:00
Auto	c2ad993e75	rebrand: rename AutoCoder to AutoForge across entire codebase Complete project rebrand from AutoCoder to AutoForge, touching 62 files across Python backend, FastAPI server, React UI, documentation, config, and CI/CD. Key changes: - Rename autocoder_paths.py -> autoforge_paths.py with backward-compat migration from .autocoder/ -> .autoforge/ directories - Update registry.py to migrate ~/.autocoder/ -> ~/.autoforge/ global config directory with fallback support - Update security.py with fallback reads from legacy .autocoder/ paths - Rename .claude/commands and skills from gsd-to-autocoder-spec to gsd-to-autoforge-spec - Update all Python modules: client, prompts, progress, agent, orchestrator, server routers and services - Update React UI: package.json name, index.html title, localStorage keys, all documentation sections, component references - Update start scripts (bat/sh/py), examples, and .env.example - Update CLAUDE.md and README.md with new branding and paths - Update test files for new .autoforge/ directory structure - Transfer git remote from leonvanzyl/autocoder to AutoForgeAI/autoforge Backward compatibility preserved: legacy .autocoder/ directories are auto-detected and migrated on next agent start. Config fallback chain checks both new and old paths. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-04 12:02:06 +02:00
syphonetic	81d2f0cbe0	Merge branch 'master' into master	2026-02-04 05:50:35 +08:00
syphonetic	c7c88449ad	Remove unused dev server management functions Removed unused functions and endpoints related to dev server management, including command validation and configuration updates.	2026-02-04 02:34:29 +08:00
syphonetic	83d2182107	Refactor dev server API for security and validation Refactor dev server API to enhance security and command validation. Added logging and improved command handling.	2026-02-04 02:19:19 +08:00
Auto	1607fc8175	feat: add multi-feature batching for coding agents Enable the orchestrator to assign 1-3 features per coding agent subprocess, selected via dependency chain extension + same-category fill. This reduces cold-start overhead and leverages shared context across related features. Orchestrator (parallel_orchestrator.py): - Add batch tracking: _batch_features and _feature_to_primary data structures - Add build_feature_batches() with dependency chain + category fill algorithm - Add start_feature_batch() and _spawn_coding_agent_batch() methods - Update _on_agent_complete() for batch cleanup across all features - Update stop_feature() with _feature_to_primary lookup - Update get_ready_features() to exclude all batch feature IDs - Update main loop to build batches then spawn per available slot CLI and agent layer: - Add --feature-ids (comma-separated) and --batch-size CLI args - Add feature_ids parameter to run_autonomous_agent() with batch prompt selection - Add get_batch_feature_prompt() with sequential workflow instructions WebSocket layer (server/websocket.py): - Add BATCH_CODING_AGENT_START_PATTERN and BATCH_FEATURES_COMPLETE_PATTERN - Add _handle_batch_agent_start() and _handle_batch_agent_complete() methods - Add featureIds field to all agent_update messages - Track current_feature_id updates as agent moves through batch Frontend (React UI): - Add featureIds to ActiveAgent and WSAgentUpdateMessage types - Update KanbanColumn and DependencyGraph agent-feature maps for batch - Update AgentCard to show "Batch: #X, #Y, #Z" with active feature highlight - Add "Features per Agent" segmented control (1-3) in SettingsModal Settings integration (full stack): - Add batch_size to schemas, settings router, agent router, process manager - Default batch_size=3, user-configurable 1-3 via settings UI - batch_size=1 is functionally identical to pre-batching behavior Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 16:35:07 +02:00
Auto	24481d474d	feat: add headless browser toggle to settings UI Replace the PLAYWRIGHT_HEADLESS environment variable with a global setting toggle in the Settings modal. The setting is persisted in the registry DB and injected as an env var into agent subprocesses, so client.py reads it unchanged. Backend: - Add playwright_headless field to SettingsResponse/SettingsUpdate schemas - Read/write the setting in settings router via existing _parse_bool helper - Pass playwright_headless from agent router through to process manager - Inject PLAYWRIGHT_HEADLESS env var into subprocess environment Frontend: - Add playwright_headless to Settings/SettingsUpdate TypeScript types - Add "Headless Browser" Switch toggle below YOLO mode in SettingsModal - Add default value to DEFAULT_SETTINGS in useProjects Also fix CSS build warning: change @import url("tw-animate-css") to bare @import "tw-animate-css" so Tailwind v4 inlines it during compilation instead of leaving it for Vite/Lightning CSS post-processing. Remove stale summary.md from previous refactoring session. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 13:40:46 +02:00
Auto	94e0b05cb1	refactor: optimize token usage, deduplicate code, fix bugs across agents Token reduction (~40% per session, ~2.3M fewer tokens per 200-feature project): - Agent-type-specific tool lists: coding 9, testing 5, init 5 (was 19 for all) - Right-sized max_turns: coding 300, testing 100 (was 1000 for all) - Trimmed coding prompt template (~150 lines removed) - Streamlined testing prompt with batch support - YOLO mode now strips browser testing instructions from prompt - Added Grep, WebFetch, WebSearch to expand project session Performance improvements: - Rate limit retries start at ~15s with jitter (was fixed 60s) - Post-spawn delay reduced to 0.5s (was 2s) - Orchestrator consolidated to 1 DB query per loop (was 5-7) - Testing agents batch 3 features per session (was 1) - Smart context compaction preserves critical state, discards noise Bug fixes: - Removed ghost feature_release_testing MCP tool (wasted tokens every test session) - Forward all 9 Vertex AI env vars to chat sessions (was missing 3) - Fix DetachedInstanceError risk in test batch ORM access - Prevent duplicate testing of same features in parallel mode Code deduplication: - _get_project_path(): 9 copies -> 1 shared utility (project_helpers.py) - validate_project_name(): 9 copies -> 2 variants in 1 file (validation.py) - ROOT_DIR: 10 copies -> 1 definition (chat_constants.py) - API_ENV_VARS: 4 copies -> 1 source of truth (env_constants.py) Security hardening: - Unified sensitive directory blocklist (14 dirs, was two divergent lists) - Cached get_blocked_paths() for O(1) directory listing checks - Terminal security warning when ALLOW_REMOTE=1 exposes WebSocket - 20 new security tests for EXTRA_READ_PATHS blocking - Extracted _validate_command_list() and _validate_pkill_processes() helpers Type safety: - 87 mypy errors -> 0 across 58 source files - Installed types-PyYAML for proper yaml stub types - Fixed SQLAlchemy Column[T] coercions across all routers Dead code removed: - 13 files deleted (~2,679 lines): unused UI components, debug logs, outdated docs - 7 unused npm packages removed (Radix UI components with 0 imports) - AgentAvatar.tsx reduced from 615 -> 119 lines (SVGs extracted to mascotData.tsx) New CLI options: - --testing-batch-size (1-5) for parallel mode test batching - --testing-feature-ids for direct multi-feature testing Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 13:16:24 +02:00
Auto	dc5bcc4ae9	feat: move autocoder runtime files into .autocoder/ subdirectory Add centralized path resolution module (autocoder_paths.py) that consolidates all autocoder-generated file paths behind a dual-path strategy: check .autocoder/X first, fall back to root-level X for backward compatibility, default to .autocoder/X for new projects. Key changes: - New autocoder_paths.py with dual-path resolution for features.db, assistant.db, lock files, settings, prompts dir, and progress cache - migrate_project_layout() safely moves old-layout projects to new layout with SQLite WAL flush and integrity verification - Updated 22 files to delegate path construction to autocoder_paths - Reset/delete logic cleans both old and new file locations - Orphan lock cleanup checks both locations per project - Migration called automatically at agent start in autonomous_agent_demo.py - Updated markdown commands/skills to reference .autocoder/prompts/ - CLAUDE.md documentation updated with new project structure Files at project root that remain unchanged: - CLAUDE.md (Claude SDK reads from cwd via setting_sources=["project"]) - app_spec.txt root copy (agent templates reference it via cat) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 11:32:06 +02:00
Auto	6609a0f7d6	fix: prevent PendingRollbackError and add MCP tool support for sessions - Add explicit session.rollback() in exception handlers for database context managers in features.py, schedules.py, and database.py get_db() to prevent SQLAlchemy PendingRollbackError on failed transactions - Add EXPAND_FEATURE_TOOLS to expand session security settings allow list so the expand skill can use the MCP tools it references - Update assistant session prompt to direct the LLM to call MCP tools directly for feature creation instead of suggesting CLI commands Cherry-picked fixes from PR #92 (closed) with cleaner implementation. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-01 09:15:24 +02:00
Auto	cf62885e83	feat: add project reset functionality with quick and full reset options Add the ability to reset a project to its initial state with two options: - Quick Reset: Clears features.db, assistant.db, and settings files while preserving app spec and prompts - Full Reset: Deletes everything including prompts directory, triggering the setup wizard for project reconfiguration Backend changes: - Add POST /{name}/reset endpoint to projects router with full_reset query param - Validate agent lock file to prevent reset while agent is running (409 Conflict) - Dispose database engines before deleting files to release Windows file locks - Add engine caching to api/database.py for better connection management - Add dispose_engine() functions to both database modules - Delete WAL mode journal files (.db-wal, .db-shm) during reset Frontend changes: - Add ResetProjectModal component with toggle between Quick/Full reset modes - Add ProjectSetupRequired component shown when has_spec is false - Add resetProject API function and useResetProject React Query hook - Integrate reset button in header (disabled when agent running) - Add 'R' keyboard shortcut to open reset modal - Show ProjectSetupRequired when project needs setup after full reset This implements the feature from PR #4 directly on master to avoid merge conflicts. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-29 10:42:05 +02:00
Auto	f6ddffa6e2	feat: persist concurrent agents slider at project level Add `default_concurrency` column to the projects table in the registry database, allowing each project to remember its preferred concurrency setting (1-5 agents). The value persists across page refreshes and app restarts. Backend changes: - Add `default_concurrency` column to Project model in registry.py - Add database migration for existing databases (ALTER TABLE) - Add get/set_project_concurrency() CRUD functions - Add ProjectSettingsUpdate schema with validation - Add PATCH /{name}/settings endpoint in projects router - Include default_concurrency in ProjectSummary/ProjectDetail responses Frontend changes: - Add default_concurrency to ProjectSummary TypeScript interface - Add ProjectSettingsUpdate type and updateProjectSettings API function - Add useUpdateProjectSettings React Query mutation hook - Update AgentControl to accept defaultConcurrency prop - Sync local state when project changes via useEffect - Debounce slider changes (500ms) before saving to backend - Pass defaultConcurrency from selectedProjectData in App.tsx Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-29 09:08:17 +02:00
Leon van Zyl	ccfd1aa73e	Merge pull request #97 from cabana8471-arch/fix/pydantic-datetime-serialization fix: Pydantic datetime serialization for API endpoints	2026-01-26 16:07:03 +02:00
Leon van Zyl	d5e423b805	Merge pull request #98 from cabana8471-arch/fix/skip-priority-consistency fix: use consistent priority increment when skipping features	2026-01-26 15:59:00 +02:00
Auto	095d248a66	add ollama support	2026-01-26 09:42:01 +02:00
cabana8471	d6ba075ac4	style: align priority calculation pattern with rest of file Address CodeRabbit feedback - use consistent conditional pattern: `(max_priority.priority + 1) if max_priority else 1` This matches the pattern used in create_feature and create_features_bulk. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-25 12:36:54 +01:00
cabana8471	6731ef44ea	fix: use consistent priority increment when skipping features (#65 ) The REST API skip endpoint was using max_priority + 1000, while the MCP server used max_priority + 1. This caused priority inflation where values could reach 10,000+ after multiple skips. Changed to use + 1 for consistency with mcp_server/feature_mcp.py:345. Fixes: leonvanzyl/autocoder#65 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-25 12:07:36 +01:00
cabana8471	43c37c52fe	fix: Pydantic datetime serialization for API endpoints Problem: Several API endpoints return 500 Internal Server Error because datetime objects are not serializable by Pydantic. The error occurs when: - GET /agent/{project}/status - GET /devserver/{project}/status - GET /schedules/{project}/next Root cause: Pydantic models expect strings for Optional datetime fields, but the code was passing raw datetime objects. Solution: Convert datetime objects to ISO 8601 strings using .isoformat() before returning in Pydantic response models. Changes: - server/routers/agent.py: Fix started_at serialization - server/routers/devserver.py: Fix started_at serialization - server/routers/schedules.py: Fix next_start/next_end serialization Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-25 08:04:14 +01:00
Auto	751ab01438	style: fix import order in settings.py for ruff compliance Move mimetypes import to the top of the import block to satisfy ruff's import sorting rules (I001). The Windows mimetype fix from PR #82 placed the import after other imports, which violated the project's linting standards. Changes: - Move `import mimetypes` to alphabetically correct position - Update comment to clarify timing requirement Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-23 12:12:44 +02:00
Abigail Green	89e3b7af99	Update settings.py Adding in fix for windows issue	2026-01-22 14:13:52 -07:00
Auto	357083dbae	feat: decouple regression testing agents from coding agents Major refactoring of the parallel orchestrator to run regression testing agents independently from coding agents. This improves system reliability and provides better control over testing behavior. Key changes: Database & MCP Layer: - Add testing_in_progress and last_tested_at columns to Feature model - Add feature_claim_for_testing() for atomic test claim with retry - Add feature_release_testing() to release claims after testing - Refactor claim functions to iterative loops (no recursion) - Add OperationalError retry handling for transient DB errors - Reduce MAX_CLAIM_RETRIES from 10 to 5 Orchestrator: - Decouple testing agent lifecycle from coding agents - Add _maintain_testing_agents() for continuous testing maintenance - Fix TOCTOU race in _spawn_testing_agent() - hold lock during spawn - Add _cleanup_stale_testing_locks() with 30-min timeout - Fix log ordering - start_session() before stale flag cleanup - Add stale testing_in_progress cleanup on startup Dead Code Removal: - Remove count_testing_in_concurrency from entire stack (12+ files) - Remove ineffective with_for_update() from features router API & UI: - Pass testing_agent_ratio via CLI to orchestrator - Update testing prompt template to use new claim/release tools - Rename UI label to "Regression Agents" with clearer description - Add process_utils.py for cross-platform process tree management Testing agents now: - Run continuously as long as passing features exist - Can re-test features multiple times to catch regressions - Are controlled by fixed count (0-3) via testing_agent_ratio setting - Have atomic claiming to prevent concurrent testing of same feature Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 15:22:48 +02:00
Leon van Zyl	35ed14dfe3	Merge pull request #74 from lirielgozi/feature-conversation-history feature: add conversation history feature to AI assistant	2026-01-22 09:00:52 +02:00
Auto	0736b5ec6b	fix: address critical issues in PR #75 agent scheduling feature This commit fixes several issues identified in the agent scheduling feature from PR #75: Frontend Fixes: - Add day boundary handling in timeUtils.ts for timezone conversions - Add utcToLocalWithDayShift/localToUTCWithDayShift functions - Add shiftDaysForward/shiftDaysBackward helpers for bitfield adjustment - Update ScheduleModal to correctly adjust days_of_week when crossing day boundaries during UTC conversion (fixes schedules running on wrong days for users in extreme timezones like UTC+9) Backend Fixes: - Add MAX_SCHEDULES_PER_PROJECT (50) limit to prevent resource exhaustion - Wire up crash recovery callback in scheduler_service._start_agent() - Convert schedules.py endpoints to use context manager for DB sessions - Fix race condition in override creation with atomic delete-then-create - Replace deprecated datetime.utcnow with datetime.now(timezone.utc) - Add DB-level CHECK constraints for Schedule model fields Files Modified: - api/database.py: Add _utc_now helper, CheckConstraint imports, constraints - progress.py: Replace deprecated datetime.utcnow - server/routers/schedules.py: Add context manager, schedule limits - server/services/assistant_database.py: Replace deprecated datetime.utcnow - server/services/scheduler_service.py: Wire crash recovery, fix race condition - ui/src/components/ScheduleModal.tsx: Use day shift functions - ui/src/lib/timeUtils.ts: Add day boundary handling functions Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-22 08:35:57 +02:00
Marian Paul	a6fe2ef633	Review	2026-01-19 10:31:23 +01:00
Marian Paul	0bab585630	feat: add time-based agent scheduling with APScheduler Add comprehensive scheduling system that allows agents to automatically start and stop during configured time windows, helping users manage Claude API token limits by running agents during off-hours. Backend Changes: - Add Schedule and ScheduleOverride database models for persistent storage - Implement APScheduler-based SchedulerService with UTC timezone support - Add schedule CRUD API endpoints (/api/projects/{name}/schedules) - Add manual override tracking to prevent unwanted auto-start/stop - Integrate scheduler lifecycle with FastAPI startup/shutdown - Fix timezone bug: explicitly set timezone=timezone.utc on CronTrigger to ensure correct UTC scheduling (critical fix) Frontend Changes: - Add ScheduleModal component for creating and managing schedules - Add clock button and schedule status display to AgentControl - Add timezone utilities for converting between UTC and local time - Add React Query hooks for schedule data fetching - Fix 204 No Content handling in fetchJSON for delete operations - Invalidate nextRun cache when manually stopping agent during window - Add TypeScript type annotations to Terminal component callbacks Features: - Multiple overlapping schedules per project supported - Auto-start at scheduled time via APScheduler cron jobs - Auto-stop after configured duration - Manual start/stop creates persistent overrides in database - Crash recovery with exponential backoff (max 3 retries) - Server restart preserves schedules and active overrides - Times displayed in user's local timezone, stored as UTC - Immediate start if schedule created during active window Dependencies: - Add APScheduler for reliable cron-like scheduling Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-19 10:31:23 +01:00
Auto	13128361b0	feat: add dedicated testing agents and enhanced parallel orchestration Introduce a new testing agent architecture that runs regression tests independently from coding agents, improving quality assurance in parallel mode. Key changes: Testing Agent System: - Add testing_prompt.template.md for dedicated testing agent role - Add feature_mark_failing MCP tool for regression detection - Add --agent-type flag to select initializer/coding/testing mode - Remove regression testing from coding prompt (now handled by testing agents) Parallel Orchestrator Enhancements: - Add testing agent spawning with configurable ratio (--testing-agent-ratio) - Add comprehensive debug logging system (DebugLog class) - Improve database session management to prevent stale reads - Add engine.dispose() calls to refresh connections after subprocess commits - Fix f-string linting issues (remove unnecessary f-prefixes) UI Improvements: - Add testing agent mascot (Chip) to AgentAvatar - Enhance AgentCard to display testing agent status - Add testing agent ratio slider in SettingsModal - Update WebSocket handling for testing agent updates - Improve ActivityFeed to show testing agent activity API & Server Updates: - Add testing_agent_ratio to settings schema and endpoints - Update process manager to support testing agent type - Enhance WebSocket messages for agent_update events Template Changes: - Delete coding_prompt_yolo.template.md (consolidated into main prompt) - Update initializer_prompt.template.md with improved structure - Streamline coding_prompt.template.md workflow Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-18 13:49:50 +02:00
Auto	64b65311fe	chore: clean up unused imports and sort import blocks Remove unused imports and organize import statements to pass ruff linting checks: - mcp_server/feature_mcp.py: Remove unused imports (are_dependencies_satisfied, get_blocking_dependencies) and alphabetize import block - parallel_orchestrator.py: Remove unused imports (time, Awaitable) and add blank lines between import groups per PEP 8 - server/routers/features.py: Alphabetize imports in dependency resolver These changes were identified by running `ruff check .` and auto-fixed with `--fix` flag. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-17 15:05:25 +02:00
Auto	85f6940a54	feat: add concurrent agents with dependency system and delightful UI Major feature implementation for parallel agent execution with dependency-aware scheduling and an engaging multi-agent UI experience. Backend Changes: - Add parallel_orchestrator.py for concurrent feature processing - Add api/dependency_resolver.py with cycle detection (Kahn's algorithm + DFS) - Add atomic feature_claim_next() with retry limit and exponential backoff - Fix circular dependency check arguments in 4 locations - Add AgentTracker class for parsing agent output and emitting updates - Add browser isolation with --isolated flag for Playwright MCP - Extend WebSocket protocol with agent_update messages and log attribution - Add WSAgentUpdateMessage schema with agent states and mascot names - Fix WSProgressMessage to include in_progress field New UI Components: - AgentMissionControl: Dashboard showing active agents with collapsible activity - AgentCard: Individual agent status with avatar and thought bubble - AgentAvatar: SVG mascots (Spark, Fizz, Octo, Hoot, Buzz) with animations - ActivityFeed: Recent activity stream with stable keys (no flickering) - CelebrationOverlay: Confetti animation with click/Escape dismiss - DependencyGraph: Interactive node graph visualization with dagre layout - DependencyBadge: Visual indicator for feature dependencies - ViewToggle: Switch between Kanban and Graph views - KeyboardShortcutsHelp: Help overlay accessible via ? key UI/UX Improvements: - Celebration queue system to handle rapid success messages - Accessibility attributes on AgentAvatar (role, aria-label, aria-live) - Collapsible Recent Activity section with persisted preference - Agent count display in header - Keyboard shortcut G to toggle Kanban/Graph view - Real-time thought bubbles and state animations Bug Fixes: - Fix circular dependency validation (swapped source/target arguments) - Add MAX_CLAIM_RETRIES=10 to prevent stack overflow under contention - Fix THOUGHT_PATTERNS to match actual [Tool: name] format - Fix ActivityFeed key prop to prevent re-renders on new items - Add featureId/agentIndex to log messages for proper attribution Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-17 12:59:42 +02:00
liri	c229e2b39b	fix: address CodeRabbitAI review comments for conversation history - Fix duplicate onConversationCreated callbacks by tracking activeConversationId - Fix history loss when switching conversations with Map-based deduplication - Disable input while conversation is loading to prevent message routing issues - Gate WebSocket debug logs behind DEV flag (import.meta.env.DEV) - Downgrade server logging from info to debug level for reduced noise - Fix .gitignore prefixes for playwright paths (ui/playwright-report/, ui/test-results/) - Remove debug console.log from ConversationHistory.tsx - Add staleTime (30s) to single conversation query for better caching - Increase history message cap from 20 to 35 for better context - Replace fixed timeouts with condition-based waits in e2e tests	2026-01-16 22:43:15 +00:00
liri	7d761cb8d0	feat: add conversation history feature to AI assistant - Add ConversationHistory dropdown component with list of past conversations - Add useConversations hook for fetching and managing conversations via React Query - Implement conversation switching with proper state management - Fix bug where reopening panel showed new greeting instead of resuming conversation - Fix bug where selecting from history caused conversation ID to revert - Add server-side history context loading for resumed conversations - Add Playwright E2E tests for conversation history feature - Add logging for debugging conversation flow Key changes: - AssistantPanel: manages conversation state with localStorage persistence - AssistantChat: header with [+] New Chat and [History] buttons - Server: skips greeting for resumed conversations, loads history context on first message - Fixed race condition in onConversationCreated callback	2026-01-16 21:47:58 +00:00
Auto	91cc00a9d0	fix: add explicit in_progress=False to all feature creation paths Complete the defense-in-depth approach from PR #53 by adding explicit in_progress=False to all remaining feature creation locations. This ensures consistency with the MCP server pattern and prevents potential NULL values in the in_progress field. Changes: - server/routers/features.py: Add in_progress=False to create_feature() and create_features_bulk() endpoints - server/services/expand_chat_session.py: Add in_progress=False to _create_features_bulk() in the expand chat session - api/migration.py: Add in_progress field handling in JSON migration, reading from source data with False as default This follows up on PR #53 which added nullable=False constraints and fixed existing NULL values, but only updated the MCP server creation paths. Now all 6 feature creation locations explicitly set both passes=False and in_progress=False. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-15 15:14:24 +02:00
Leon van Zyl	ab51bb6089	Merge pull request #53 from Quenos/fix/null-boolean-fields-resilience fix: make boolean fields resilient to NULL values	2026-01-15 15:09:16 +02:00
Auto	d1b8eb5f99	feat: add feature editing capability for pending/in-progress features Add the ability for users to edit features that are not yet completed, allowing them to provide corrections or additional instructions when the agent is stuck or implementing a feature incorrectly. Backend changes: - Add FeatureUpdate schema in server/schemas.py with optional fields - Add PATCH /api/projects/{project_name}/features/{feature_id} endpoint - Validate that completed features (passes=True) cannot be edited Frontend changes: - Add FeatureUpdate type in ui/src/lib/types.ts - Add updateFeature() API function in ui/src/lib/api.ts - Add useUpdateFeature() React Query mutation hook - Create EditFeatureForm.tsx component with pre-filled form values - Update FeatureModal.tsx with Edit button for non-completed features The edit form allows modifying category, name, description, priority, and test steps. Save button is disabled until changes are detected. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-14 14:54:53 +02:00
Quenos	3c97051122	fix: make boolean fields resilient to NULL values Problem: Features with NULL values in passes/in_progress fields caused Pydantic validation errors in the API. Solution - defense in depth: 1. Database model: Add nullable=False to passes and in_progress columns 2. Migration: Auto-fix existing NULL values to False on database connect 3. API layer: Handle NULL gracefully in feature_to_response (treat as False) 4. MCP server: Explicitly set in_progress=False when creating features This ensures: - New databases cannot have NULL boolean fields - Existing databases are auto-migrated on connect - Even if NULL values exist, they're handled gracefully at runtime Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-13 11:47:46 +01:00
Auto	f31ea403ea	feat: add GLM/alternative API support via environment variables Add support for using alternative API endpoints (like Zhipu AI's GLM models) without affecting the user's global Claude Code settings. Configuration is done via AutoCoder's .env file. Changes: - Add API_ENV_VARS constant and pass through ClaudeAgentOptions.env parameter in client.py and all server service files (spec, expand, assistant sessions) - Add glm_mode to settings API response to indicate when GLM is configured - Add purple "GLM" badge in UI header when GLM mode is active - Update setup status to accept GLM credentials as valid authentication - Update .env.example with GLM configuration documentation - Update README.md with AutoCoder-scoped GLM setup instructions Supported environment variables: - ANTHROPIC_BASE_URL: Custom API endpoint (e.g., https://api.z.ai/api/anthropic) - ANTHROPIC_AUTH_TOKEN: API authentication token - API_TIMEOUT_MS: Request timeout in milliseconds - ANTHROPIC_DEFAULT_SONNET_MODEL: Model override for Sonnet - ANTHROPIC_DEFAULT_OPUS_MODEL: Model override for Opus - ANTHROPIC_DEFAULT_HAIKU_MODEL: Model override for Haiku This approach routes API requests through the alternative endpoint while keeping all Claude Code features (MCP servers, hooks, permissions) intact. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-12 12:25:13 +02:00
Auto	a7f8c3aa8d	feat: add multiple terminal tabs with rename capability Add support for multiple terminal instances per project with tabbed navigation in the debug panel. Each terminal maintains its own PTY session and WebSocket connection. Backend changes: - Add terminal metadata storage (id, name, created_at) per project - Update terminal_manager.py with create, list, rename, delete functions - Extend WebSocket endpoint to /api/terminal/ws/{project}/{terminal_id} - Add REST endpoints for terminal CRUD operations - Implement deferred PTY start with initial resize message Frontend changes: - Create TerminalTabs component with neobrutalism styling - Support double-click rename and right-click context menu - Fix terminal switching issues with transform-based hiding - Use isActiveRef to prevent stale closure bugs in connect() - Add double requestAnimationFrame for reliable activation timing - Implement proper dimension validation in fitTerminal() Other updates: - Add GLM model configuration documentation to README - Simplify client.py by removing CLI_COMMAND support - Update chat session services with consistent patterns Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-12 11:55:50 +02:00
Auto	c1985eb285	feat: add interactive terminal and dev server management Add new features for interactive terminal sessions and dev server control: Terminal Component: - New Terminal.tsx component using xterm.js for full terminal emulation - WebSocket-based PTY communication with bidirectional I/O - Cross-platform support (Windows via winpty, Unix via built-in pty) - Auto-reconnection with exponential backoff - Fix duplicate WebSocket connection bug by checking CONNECTING state - Add manual close flag to prevent auto-reconnect race conditions - Add project tracking to avoid duplicate connects on initial activation Dev Server Management: - New DevServerControl.tsx for starting/stopping dev servers - DevServerManager service for subprocess management - WebSocket streaming of dev server output - Project configuration service for reading package.json scripts Backend Infrastructure: - Terminal router with WebSocket endpoint for PTY I/O - DevServer router for server lifecycle management - Terminal session manager with callback-based output streaming - Enhanced WebSocket schemas for terminal and dev server messages UI Integration: - New Terminal and Dev Server tabs in the main application - Updated DebugLogViewer with improved UI and functionality - Extended useWebSocket hook for terminal message handling Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-12 10:35:36 +02:00

1 2

68 Commits