trading_bot_v4

Author	SHA1	Message	Date
mindesbunister	1f83a7d7c4	feat: Add coordinator log viewer to cluster UI - Created /api/cluster/logs endpoint to read coordinator.log - Added real-time log display in cluster UI (updates every 3s) - Shows last 100 lines of coordinator.log in terminal-style display - Includes manual refresh button - Improves debugging experience - no need to SSH for logs User feedback: 'why dont we add the output of the log at the bottom of the page so i know whats going on' This addresses poor visibility into coordinator errors and failures. Next step: Fix SSH timeout issue blocking worker execution.	2025-12-01 11:49:23 +01:00
mindesbunister	db33af9f17	fix: Stop button database reset + UI state display (DATABASE-FIRST ARCHITECTURE) CRITICAL FIXES: 1. Stop button now resets database FIRST (before pkill) - Database cleanup happens even if coordinator crashed - Prevents stale 'running' chunks blocking restart - Uses Node.js sqlite library (not CLI - Docker compatible) 2. UI enhancement - 4-state display - ⚡ Processing (running > 0) - ⏳ Pending (pending > 0, running = 0) - ✅ Complete (all completed) - ⏸️ Idle (no work queued) [NEW] - Shows pending chunk count when present TECHNICAL DETAILS: - Replaced sqlite3 CLI calls with proper Node.js API - Fixed permissions: chown 1001:1001 cluster/ for container write - Database-first logic: reset → pkill → verify - Detailed logging for each operation step FILES CHANGED: - app/api/cluster/control/route.ts (database operations refactored) - app/cluster/page.tsx (4-state UI display) VERIFIED: - Stop button successfully reset 3 'running' chunks → 'pending' - UI correctly shows Idle state after Stop - Container logs show detailed operation flow - Database operations work in Docker environment DEPLOYMENT: - Container rebuilt with fixed code - Tested with real stale database (3 running chunks) - All operations working correctly	2025-12-01 11:34:47 +01:00
mindesbunister	c343daeb44	docs: Document EPYC cluster SSH timeout fix in Common Pitfalls - Added Common Pitfall #64: SSH timeout for nested hop scenarios - Documented 30s→60s timeout increase rationale - Explained SSH options: StrictHostKeyChecking, ConnectTimeout, ServerAliveInterval - Included verification data: 23-24 processes per worker at 99% CPU - Provided formula for calculating minimum timeouts for multi-hop SSH - Cross-referenced commit `ef371a1` (the actual code fix) - Added future prevention guidance (timeout formulas, SSH multiplexing) This documentation update accompanies the cluster fix deployed earlier.	2025-12-01 09:46:17 +01:00
mindesbunister	ef371a19b9	fix: EPYC cluster SSH timeout - increase timeout 30s→60s + add SSH options CRITICAL FIX (Dec 1, 2025): Cluster start was failing with 'operation failed' Problem: - SSH commands timing out after 30s (too short for 2-hop SSH to worker2) - Missing SSH options caused prompts/delays - Result: Coordinator failed to start worker processes Solution: - Increased timeout from 30s to 60s for nested SSH hops - Added SSH options: -o StrictHostKeyChecking=no -o ConnectTimeout=10 - Applied options to both ssh_command() and worker startup commands Verification (Dec 1, 09:40): - Worker1: 23 processes running (chunk 0-2000) - Worker2: 24 processes running (chunk 2000-4000) - Cluster status: ACTIVE with 2 workers - Both chunks processing successfully Files changed: - cluster/distributed_coordinator.py (lines 302-314, 388-414)	2025-12-01 09:41:42 +01:00
mindesbunister	549fe8e077	docs: CRITICAL - Make documentation + git commit hand-in-hand #1 PRIORITY USER MANDATE (Dec 1, 2025): Documentation MUST go hand-in-hand with EVERY git commit. This is NOT optional. This is NOT a suggestion. This is MANDATORY. Changes: - Elevated documentation section to #1 PRIORITY status - Added user's direct quote: 'this HAS to go hand in hand' - Expanded from 15 lines to 100+ lines with comprehensive guidelines - Added 'Why This is #1 Priority' section with user's frustration quote - Added explicit 'When Documentation is MANDATORY' checklist - Added 'The Correct Mindset' section emphasizing it's part of the work - Added 4 scenario examples showing what MUST be documented - Added 'Red Flags' section to catch missing documentation - Added 'Integration with Existing Sections' guide - Made it crystal clear: Code without documentation = INCOMPLETE WORK This addresses user's repeated reminders about documentation being mandatory. Future AI agents will now see this as the #1 priority it is. NO MORE PUSHING CODE WITHOUT DOCUMENTATION UPDATES.	2025-12-01 09:17:51 +01:00
mindesbunister	b1a41733b8	docs: Document Dec 1 adaptive leverage UI enhancements - Updated adaptive leverage configuration section with current values (10x/5x) - Added Settings UI documentation with 5 configurable fields - Documented direction-specific thresholds (LONG/SHORT split) - Added dynamic collateral display implementation details - Documented new /api/drift/account-health endpoint - Added commit history for Dec 1 changes (`2e511ce`, `21c13b9`, `a294f44`, `67ef5b1`) - Updated API endpoints section with account-health route Changes reflect full UI implementation completed Dec 1, 2025: - Independent LONG (95) and SHORT (90) quality threshold controls - Real-time collateral fetching from Drift Protocol - Position size calculator with dynamic balance updates - Complete production-ready adaptive leverage system	2025-12-01 09:15:03 +01:00
mindesbunister	67ef5b1ac6	feat: Add direction-specific quality thresholds and dynamic collateral display - Split QUALITY_LEVERAGE_THRESHOLD into separate LONG and SHORT variants - Added /api/drift/account-health endpoint for real-time collateral data - Updated settings UI to show separate controls for LONG/SHORT thresholds - Position size calculations now use dynamic collateral from Drift account - Updated .env and docker-compose.yml with new environment variables - LONG threshold: 95, SHORT threshold: 90 (configurable independently) Files changed: - app/api/drift/account-health/route.ts (NEW) - Account health API endpoint - app/settings/page.tsx - Added collateral state, separate threshold inputs - app/api/settings/route.ts - GET/POST handlers for LONG/SHORT thresholds - .env - Added QUALITY_LEVERAGE_THRESHOLD_LONG/SHORT variables - docker-compose.yml - Added new env vars with fallback defaults Impact: - Users can now configure quality thresholds independently for LONG vs SHORT signals - Position size display dynamically updates based on actual Drift account collateral - More flexible risk management with direction-specific leverage tiers	2025-12-01 09:09:30 +01:00
mindesbunister	a294f44a06	fix: Add adaptive leverage env vars to docker-compose.yml Added 4 adaptive leverage environment variables to docker-compose.yml so they are properly passed to the container: - USE_ADAPTIVE_LEVERAGE (default: true) - HIGH_QUALITY_LEVERAGE (default: 5) - LOW_QUALITY_LEVERAGE (default: 1) - QUALITY_LEVERAGE_THRESHOLD (default: 95) Without these in the environment section, the container couldn't access them via process.env, causing the settings API to return null. Now the settings UI can properly load and save adaptive leverage configuration via the web interface.	2025-12-01 08:52:07 +01:00
mindesbunister	21c13b915a	feat: Add adaptive leverage controls to settings UI Complete implementation of adaptive leverage configuration via web interface: Frontend (app/settings/page.tsx): - Added 4 fields to TradingSettings interface: * USE_ADAPTIVE_LEVERAGE: boolean * HIGH_QUALITY_LEVERAGE: number * LOW_QUALITY_LEVERAGE: number * QUALITY_LEVERAGE_THRESHOLD: number - Added complete Adaptive Leverage section with: * Purple-themed informational box explaining quality-based leverage * Toggle switch for enabling/disabling (🎯 Enable Adaptive Leverage) * Number inputs for high leverage (1-20), low leverage (1-20), threshold (80-100) * Visual tier display showing leverage multipliers and position sizes * Dynamic calculation based on $560 free collateral Backend (app/api/settings/route.ts): - GET handler: Load 4 adaptive leverage fields from environment variables - POST handler: Save 4 adaptive leverage fields to .env file - Proper type conversion (boolean from 'true', numbers from parseInt/parseFloat) Visual Tier Display Example: Below Threshold: Blocked (no trade) Changes enable users to adjust leverage settings via web UI instead of manually editing .env file and restarting container.	2025-12-01 08:47:38 +01:00
mindesbunister	2e511ceddc	config: Update adaptive leverage to 10x high-quality, 5x low-quality User requirements (Dec 1, 2025): - Base leverage: 5x (SOLANA_LEVERAGE=5, unchanged) - High-quality signals (Q90+ SHORT, Q95+ LONG): 10x leverage - Low-quality signals (Q80-89 SHORT, Q90-94 LONG): 5x leverage Changes: - HIGH_QUALITY_LEVERAGE: 5 → 10 - LOW_QUALITY_LEVERAGE: 1 → 5 Expected behavior: - Regular signals: 5x leverage (60 × 5 = ,800 position) - High-quality signals: 10x leverage (60 × 10 = ,600 position) Container restarted and config active.	2025-12-01 08:39:09 +01:00
mindesbunister	203eedd33e	docs: Update cluster start button fix documentation with Dec 1 database cleanup solution	2025-12-01 08:29:37 +01:00
mindesbunister	5d07fbbd28	critical: Fix EPYC cluster start button - database cleanup before start Problem: - Start button showed 'already running' when cluster wasn't actually running - Database had stale chunks in 'running' state from crashed/killed coordinator - Control endpoint checked process but not database state Solution: 1. Reset stale 'running' chunks to 'pending' before starting coordinator 2. Verify coordinator not running before starting (prevent duplicates) 3. Add database cleanup to stop action as well (prevent future stale states) 4. Enhanced error reporting with coordinator log output Changes: - app/api/cluster/control/route.ts - Added database cleanup in start action (reset running chunks) - Added process check before start (prevent duplicates) - Added database cleanup in stop action (cleanup orphaned state) - Added coordinator log output on start failure - Improved error messages and logging Impact: - Start button now works correctly even after unclean coordinator shutdown - Prevents false 'already running' reports - Automatic cleanup of stale database state - Better error diagnostics Verified: - Container rebuilt and restarted successfully - Cluster status shows 'idle' after database cleanup - Ready for user to test start button functionality	2025-12-01 08:28:05 +01:00
mindesbunister	d4ecbcd168	docs: Add Smart Validation threshold optimization findings (n=200 backtest) - Backtested 200 random DATA_COLLECTION_ONLY signals - Validated initial n=11 finding at scale - CURRENT (±0.3%): +0.169% avg, 67.9% WR, 14% entry rate (WINNER) - OPTION 1 (±0.2%): -0.363% avg, 43.1% WR, 26% entry rate - OPTION 2 (±0.15%): -0.524% avg, 35.6% WR, 36% entry rate - Key insight: Lower thresholds catch more losers than winners - Decision: Keep current ±0.3% thresholds (statistically validated)	2025-12-01 00:42:58 +01:00
mindesbunister	9d2055e59c	docs: Add mandatory documentation workflow - git commit must go hand-in-hand with documentation	2025-12-01 00:12:28 +01:00
mindesbunister	56feef723b	docs: Add Smart Entry Validation System to Common Pitfall #63	2025-12-01 00:07:21 +01:00
mindesbunister	7367673e4d	feat: Complete Smart Entry Validation System with Telegram notifications Implementation: - Smart validation queue monitors quality 50-89 signals - Block & Watch strategy: queue → validate → enter if confirmed - Validation thresholds: LONG +0.3% confirms / -0.4% abandons - Validation thresholds: SHORT -0.3% confirms / +0.4% abandons - Monitoring: Every 30 seconds for 10 minute window - Auto-execution via API when price confirms direction Telegram Notifications: - ⏰ Queued: Alert when signal enters validation queue - ✅ Confirmed: Alert when price validates entry (with slippage) - ❌ Abandoned: Alert when price invalidates (saved from loser) - ⏱️ Expired: Alert when 10min window passes without confirmation - ✅ Executed: Alert when validated trade opens (with delay time) Files: - lib/trading/smart-validation-queue.ts (NEW - 460+ lines) - lib/notifications/telegram.ts (added sendValidationNotification) - app/api/trading/check-risk/route.ts (await async addSignal) Integration: - check-risk endpoint already queues signals (lines 433-452) - Startup initialization already exists - Market data cache provides 1-min price updates Expected Impact: - Recover 77% of moves from quality 50-89 false negatives - Example: +1.79% move → entry at +0.41% → capture +1.38% - Protect from weak signals that fail validation - User visibility into validation activity via Telegram Status: READY FOR DEPLOYMENT	2025-11-30 23:48:36 +01:00
mindesbunister	e6cd6c836d	feat: Smart Entry Validation System - COMPLETE - Created lib/trading/smart-validation-queue.ts (270 lines) - Queue marginal quality signals (50-89) for validation - Monitor 1-minute price action for 10 minutes - Enter if +0.3% confirms direction (LONG up, SHORT down) - Abandon if -0.4% invalidates direction - Auto-execute via /api/trading/execute when confirmed - Integrated into check-risk endpoint (queues blocked signals) - Integrated into startup initialization (boots with container) - Expected: Catch ~30% of blocked winners, filter ~70% of losers - Estimated profit recovery: +$1,823/month Files changed: - lib/trading/smart-validation-queue.ts (NEW - 270 lines) - app/api/trading/check-risk/route.ts (import + queue call) - lib/startup/init-position-manager.ts (import + startup call) User approval: 'sounds like we can not loose anymore with this system. go for it'	2025-11-30 23:37:31 +01:00
mindesbunister	78757d2111	critical: Fix FALSE TP1 detection - add price verification (Pitfall #63 ) CRITICAL BUG FIXED (Nov 30, 2025): Position Manager was setting tp1Hit=true based ONLY on size mismatch, without verifying price actually reached TP1 target. This caused: - Premature order cancellation (on-chain TP1 removed before fill) - Lost profit potential (optimal exits missed) - Ghost orders after container restarts ROOT CAUSE (line 1086 in position-manager.ts): trade.tp1Hit = true // Set without checking this.shouldTakeProfit1() FIX IMPLEMENTED: - Added price verification: this.shouldTakeProfit1(currentPrice, trade) - Only set tp1Hit when BOTH conditions met: 1. Size reduced by 5%+ (positionSizeUSD < trade.currentSize * 0.95) 2. Price crossed TP1 target (this.shouldTakeProfit1 returns true) - Verbose logging for debugging (shows price vs target, size ratio) - Fallback: Update tracked size but don't trigger TP1 logic REAL INCIDENT: - Trade cmim4ggkr00canv07pgve2to9 (SHORT SOL-PERP Nov 30) - TP1 target: $137.07, actual exit: $136.84 - False detection triggered premature order cancellation - Position closed successfully but system integrity compromised FILES CHANGED: - lib/trading/position-manager.ts (lines 1082-1111) - CRITICAL_TP1_FALSE_DETECTION_BUG.md (comprehensive incident report) TESTING REQUIRED: - Monitor next trade with TP1 for correct detection - Verify logs show TP1 VERIFIED or TP1 price NOT reached - Confirm no premature order cancellation ALSO FIXED: - Restarted telegram-trade-bot to fix /status command conflict See: Common Pitfall #63 in copilot-instructions.md (to be added)	2025-11-30 23:08:34 +01:00
mindesbunister	887ae3b924	docs: Add comprehensive cluster status detection to copilot instructions - Document database-first architecture pattern - Include problem, root cause, and solution details - Add verification methodology with before/after examples - Document cluster control system (Start/Stop buttons) - Include database schema and operational state - Add lessons learned about infrastructure vs business logic - Reference STATUS_DETECTION_FIX_COMPLETE.md for full details - Current state: 2 workers active, processing 4000 combinations	2025-11-30 22:38:06 +01:00
mindesbunister	c5a8f5e32d	docs: Add comprehensive status detection fix documentation	2025-11-30 22:27:08 +01:00
mindesbunister	cc56b72df2	fix: Database-first cluster status detection + Stop button clarification CRITICAL FIX (Nov 30, 2025): - Dashboard showed 'idle' despite 22+ worker processes running - Root cause: SSH-based worker detection timing out - Solution: Check database for running chunks FIRST Changes: 1. app/api/cluster/status/route.ts: - Query exploration database before SSH detection - If running chunks exist, mark workers 'active' even if SSH fails - Override worker status: 'offline' → 'active' when chunks running - Log: '✅ Cluster status: ACTIVE (database shows running chunks)' - Database is source of truth, SSH only for supplementary metrics 2. app/cluster/page.tsx: - Stop button ALREADY EXISTS (conditionally shown) - Shows Start when status='idle', Stop when status='active' - No code changes needed - fixed by status detection Result: - Dashboard now shows 'ACTIVE' with 2 workers (correct) - Workers show 'active' status (was 'offline') - Stop button automatically visible when cluster active - System resilient to SSH timeouts/network issues Verified: - Container restarted: Nov 30 21:18 UTC - API tested: Returns status='active', activeWorkers=2 - Logs confirm: Database-first logic working - Workers confirmed running: 22+ processes on worker1, workers on worker2	2025-11-30 22:23:01 +01:00
mindesbunister	83b4915d98	fix: Reduce coordinator chunk_size from 10k to 2k for small explorations - Changed default chunk_size from 10,000 to 2,000 - Fixes bug where coordinator exited immediately for 4,096 combo exploration - Coordinator was calculating: chunk 1 starts at 10,000 > 4,096 total = 'all done' - Now creates 2-3 appropriately-sized chunks for distribution - Verified: Workers now start and process assigned chunks - Status: ✅ Docker rebuilt and deployed to port 3001	2025-11-30 22:07:59 +01:00
mindesbunister	8a3141e793	feat: Add cluster page navigation - Add EPYC Cluster card to landing page (first position, purple/pink gradient) - Add back button to cluster page (animated left arrow, links to dashboard) - Update landing page grid layout (lg:grid-cols-3 xl:grid-cols-4 for 7 cards) - Complete bidirectional navigation: dashboard ↔ cluster monitoring Navigation features: - Cluster card: 🖥️ icon, "Monitor distributed parameter exploration" description - Back button: Animated hover effect (arrow slides left, color transitions) - Responsive grid: 2 cols (mobile), 3 cols (tablet), 4 cols (desktop) - Consistent styling with existing navigation cards	2025-11-30 13:18:03 +01:00
mindesbunister	b77282b560	feat: Add EPYC cluster distributed sweep with web UI New Features: - Distributed coordinator orchestrates 2x AMD EPYC 16-core servers - 64 total cores processing 12M parameter combinations (70% CPU limit) - Worker1 (pve-nu-monitor01): Direct SSH access at 10.10.254.106 - Worker2 (bd-host01): 2-hop SSH through worker1 (10.20.254.100) - Web UI at /cluster shows real-time status and AI recommendations - API endpoint /api/cluster/status serves cluster metrics - Auto-refresh every 30s with top strategies and actionable insights Files Added: - cluster/distributed_coordinator.py (510 lines) - Main orchestrator - cluster/distributed_worker.py (271 lines) - Worker1 script - cluster/distributed_worker_bd_clean.py (275 lines) - Worker2 script - cluster/monitor_bd_host01.sh - Monitoring script - app/api/cluster/status/route.ts (274 lines) - API endpoint - app/cluster/page.tsx (258 lines) - Web UI - cluster/CLUSTER_SETUP.md - Complete setup and access documentation Technical Details: - SQLite database tracks chunk assignments - 10,000 combinations per chunk (1,195 total chunks) - Multiprocessing.Pool with 70% CPU limit (22 cores per EPYC) - SSH/SCP for deployment and result collection - Handles 2-hop SSH for bd-host01 access - Results in CSV format with top strategies ranked Access Documentation: - Worker1: ssh root@10.10.254.106 - Worker2: ssh root@10.10.254.106 "ssh root@10.20.254.100" - Web UI: http://localhost:3001/cluster - See CLUSTER_SETUP.md for complete guide Status: Deployed and operational	2025-11-30 13:02:18 +01:00
mindesbunister	2a8e04fe57	feat: Continuous optimization cluster for 2 EPYC servers - Master controller with job queue and result aggregation - Worker scripts for parallel backtesting (22 workers per server) - SQLite database for strategy ranking and performance tracking - File-based job queue (simple, robust, survives crashes) - Auto-setup script for both EPYC servers - Status dashboard for monitoring progress - Comprehensive deployment guide Architecture: - Master: Job generation, worker coordination, result collection - Worker 1 (pve-nu-monitor01): AMD EPYC 7282, 22 parallel jobs - Worker 2 (srv-bd-host01): AMD EPYC 7302, 22 parallel jobs - Total capacity: ~49,000 backtests/day (44 cores @ 70%) Initial focus: v9 parameter refinement (27 configurations) Target: Find strategies >00/1k P&L (current baseline 92/1k) Files: - cluster/master.py: Main controller (570 lines) - cluster/worker.py: Worker execution script (220 lines) - cluster/setup_cluster.sh: Automated deployment - cluster/status.py: Real-time status dashboard - cluster/README.md: Operational documentation - cluster/DEPLOYMENT.md: Step-by-step deployment guide	2025-11-29 22:34:52 +01:00
mindesbunister	2d14f2d5c5	docs: Complete v9 parameter optimization & backtesting documentation - v10 removal background (Nov 28, 2025) - v9 baseline performance (05.88, 569 trades, 60.98% WR) - Adaptive leverage implementation (5x high quality, 1x borderline) - Parameter sweep strategy (8 parameters, 65,536 combinations) - EPYC exhaustive sweep status (24 workers, ~17h remaining) - Backtesting infrastructure details - Expected outcomes and analysis plan - Key lessons learned from v10 failure	2025-11-29 00:04:48 +01:00
mindesbunister	5f7702469e	remove: V10 momentum system - backtest proved it adds no value - Removed v10 TradingView indicator (moneyline_v10_momentum_dots.pinescript) - Removed v10 penalty system from signal-quality.ts (-30/-25 point penalties) - Removed backtest result files (sweep_*.csv) - Updated copilot-instructions.md to remove v10 references - Simplified direction-specific quality thresholds (LONG 90+, SHORT 80+) Rationale: - 1,944 parameter combinations tested in backtest - All top results IDENTICAL (568 trades, $498 P&L, 61.09% WR) - Momentum parameters had ZERO impact on trade selection - Profit factor 1.027 too low (barely profitable after fees) - Max drawdown -$1,270 vs +$498 profit = terrible risk-reward - v10 penalties were blocking good trades (bug: applied to wrong positions) Keeping v9 as production system - simpler, proven, effective.	2025-11-28 22:35:32 +01:00
mindesbunister	4fb6a45fab	docs: Update SHORT threshold to 80 with v10 penalty system explanation - SHORT threshold now 80 (works WITH v10 penalties, not standalone) - v10 applies -30 to -55 point penalties for weak setups (ADX < 23, mid-range) - Documented penalty calculation examples (bad/trap/good setups) - Removed outdated Nov 23 data analysis (pre-v10 system) - Added RSI filter evolution context (removed because RSI 50+ = best 68.2% WR) - Updated quality threshold references throughout docs (95 → 80 for SHORTs)	2025-11-28 00:39:43 +01:00
mindesbunister	3cd292d90d	docs: Add Common Pitfall #62 - Missing quality threshold validation - Bug: Execute endpoint calculated quality but never validated it - Three trades executed at quality 30/50/50 (threshold: 90/95) - All three stopped out, confirming low quality = losing trades - Root cause: TradingView sent incomplete data (metrics=0, old v5) + missing validation after timeframe check - Fix: Added validation block lines 193-213 in execute/route.ts - Returns HTTP 400 if quality < minQualityScore - Deployed: Nov 27, 2025 23:16 UTC (commit `cefa3e6`) - Lesson: Calculate ≠ Validate - minQualityScore must be enforced at ALL execution pathways This documents the CRITICAL FIX from commit `cefa3e6`. Per Nov 27 mandatory documentation rules, work is INCOMPLETE without copilot-instructions.md updates.	2025-11-27 23:28:26 +01:00
mindesbunister	cefa3e646d	critical: MANDATORY quality score check in execute endpoint ROOT CAUSE: - Execute endpoint calculated quality score but NEVER checked it - After timeframe='5' validation, proceeded directly to execution - TradingView sent signal with all metrics=0 (ADX, ATR, RSI, etc.) - Quality scored as 30, but no threshold check existed - Position opened with 909.77 size at quality 30 (need 90+ for LONG) THE FIX: - Added MANDATORY quality check after timeframe validation - Blocks execution if score < minQualityScore (90 LONG, 95 SHORT) - Returns HTTP 400 with detailed error message - Logs Quality check passed OR ❌ QUALITY TOO LOW: AFFECTED TRADES: - cmihwkjmb0088m407lqd8mmbb: Quality 30 LONG (stopped out) - cmih6ghn20002ql07zxfvna1l: Quality 50 LONG (stopped out) - cmih5vrpu0001ql076mj3nm63: Quality 50 LONG (stopped out) This is a FINANCIAL SAFETY critical fix - prevents low-quality trades.	2025-11-27 23:17:29 +01:00
mindesbunister	2749c08d15	docs: MANDATORY copilot-instructions.md updates + 1-min data direction field CRITICAL: Added iron-clad rule that copilot-instructions.md MUST be updated for every significant change. User is 'sick and tired' of reminding. New mandatory section explains: - When to update this file (8 specific scenarios) - Why it's the primary knowledge base for future developers - Automatic workflow: Change → Code → Test → Update Docs → Commit 1-Minute Data Collection documented: - Direction field is meaningless (TradingView artifact) - Analysis should ignore direction for timeframe='1' - Focus on ADX/ATR/RSI/volume/price position metrics - Example correct vs wrong SQL queries This is NON-NEGOTIABLE going forward.	2025-11-27 19:32:22 +01:00
mindesbunister	8310a5c42b	docs: Clarify 1-minute signal direction field is meaningless - Direction field populated due to TradingView alert syntax requirement - NOT trading signals, pure market data collection - Analysis should ignore direction, focus on metrics	2025-11-27 19:30:22 +01:00
mindesbunister	b13a0f1b6b	docs: Update copilot-instructions.md with Phase 7.3 Adaptive Trailing Stop - Position Manager section: Complete Phase 7.3 documentation with real-time ADX queries - Documented adaptive multiplier logic: acceleration bonus, deceleration penalty, combined 3.16× max - Added example calculation showing 2.15× wider trail vs old static system - When Making Changes section: Added Phase 7.3 verification steps and log monitoring - Trailing stop changes: Updated with new adaptive system details and testing procedures - References: PHASE_7.3_ADAPTIVE_TRAILING_DEPLOYED.md and 1MIN_DATA_ENHANCEMENTS_ROADMAP.md	2025-11-27 17:02:59 +01:00
mindesbunister	e3d98a3f5b	docs: Phase 7.3 deployment summary and verification checklist	2025-11-27 16:55:42 +01:00
mindesbunister	130e9328d8	feat: Phase 7.3 - 1-Minute Adaptive TP/SL (DEPLOYED Nov 27, 2025) - Query fresh 1-minute ADX from market cache every monitoring loop - Dynamically adjust trailing stop based on trend strength changes - Acceleration bonus: ADX increased >5 points = 1.3× wider trail - Deceleration penalty: ADX decreased >3 points = 0.7× tighter trail - Combined with existing ADX strength tiers and profit acceleration - Expected impact: +,000-3,000 over 100 trades by capturing accelerating trends - Directly addresses MA crossover pattern (ADX 22.5→29.5 in 35 minutes) - Files: lib/trading/position-manager.ts (adaptive logic), 1MIN_DATA_ENHANCEMENTS_ROADMAP.md (Phase 7.3 complete)	2025-11-27 16:40:02 +01:00
mindesbunister	56e9522740	docs: Add MA crossover detection to copilot instructions	2025-11-27 16:31:05 +01:00
mindesbunister	ad3eb9841f	docs: Add MA crossover detection implementation summary	2025-11-27 16:25:49 +01:00
mindesbunister	633d204b66	feat: Add MA crossover detection to n8n workflow - Updated parse_signal_enhanced.json to detect 'crossing' keyword - Added three new flags: isMACrossover, isDeathCross, isGoldenCross - Death cross = MA50 crossing below MA200 (short direction) - Golden cross = MA50 crossing above MA200 (long direction) - Enables automated data collection for MA crossover pattern validation - Documented in INDICATOR_V9_MA_GAP_ROADMAP.md validation strategy - User configured TradingView alert to send crossover events - Goal: Collect 5-10 examples to validate ADX weak→strong pattern	2025-11-27 16:24:53 +01:00
mindesbunister	d1ffa077c9	docs: Correct indicator version v8→v9 in MA cross documentation USER CORRECTION: System currently running v9, not v8 Changes: - Updated MA cross ADX pattern finding to reference v9 - Noted v9 already includes MA Gap Analysis (deployed Nov 26) - Clarified v9 system status and current capabilities - Updated historical Nov 25 incident as "Pre-v9" context - This finding VALIDATES v9's early detection design Key Points: - ADX strengthens during cross (22.5 → 29.5) - Current v9 SHORT filter (ADX ≥23) would pass at crossover - 1-minute monitoring proves the approach works Status: v9 PRODUCTION (Nov 26+), MA Gap already deployed	2025-11-27 16:20:07 +01:00
mindesbunister	f318b5161c	docs: Add critical MA cross ADX pattern discovery (Nov 27, 2025) JACKPOT FINDING: v8 signals arrive 35 minutes BEFORE actual crossover! Timeline Evidence (Nov 27 Death Cross): - 10:30 Berlin: Signal fires with ADX 22.5 (weak, blocked) - ADX progression: 22.5 → 28.9 → 29.2 → 29.5 → 29.8 (35-minute climb) Pattern Discovered: - Early signals have weak ADX (below threshold) - ADX strengthens DURING the crossover event - 1-minute data captures this progression perfectly - Smart Entry Timer + Phase 7.2 validation can catch strengthening Validation Plan: - Collect 5-10 more MA cross examples - Verify pattern consistency (weak → strong ADX during cross) - If consistent: Adjust quality scoring or timing logic Impact: - Proves v8 indicator timing is CORRECT (early detection) - Explains why quality filter blocks these signals - 1-minute monitoring validates the approach works Files Updated: - INDICATOR_V9_MA_GAP_ROADMAP.md (added Critical Finding section)	2025-11-27 16:18:40 +01:00
mindesbunister	2cf3deb551	docs: CRITICAL - Fix roadmap Phase 7.1/7.2 status + add MANDATORY update requirements PROBLEM DISCOVERED (Nov 27, 2025): User: "whats next on our roadmap with the biggest impact?" Agent: "Phase 3 Smart Entry Timing - NOT STARTED" User: "i thought that was already implemented?" ← USER WAS RIGHT! Reality: Phase 3 = Phase 7.1 (smart-entry-timer.ts, 718 lines, DEPLOYED) ROOT CAUSE: - Roadmap said "Phase 3: NOT STARTED" - Code reality: Phase 7.1 Smart Entry Timer fully operational - Phase 7.2 Signal Quality Validation also deployed Nov 27 - Documentation completely out of sync with deployed code IMPACT: - User confusion justified (roadmap misleading) - Wasted time investigating "next feature" already deployed - Agent suggested implementing feature that exists - Phase numbering confusion (Phase 2/3/7.1/7.2 mixed) FIXES APPLIED: 1. 1MIN_DATA_ENHANCEMENTS_ROADMAP.md: - Phase 7.1: Smart Entry Timer → ✅ COMPLETE (DEPLOYED) - Added complete documentation of deployed system - Explained timeout protection (NO MISSED TRADES) - Showed ENV config (SMART_ENTRY_ENABLED=true) - Phase 7.2: Already marked complete (Nov 27) 2. .github/copilot-instructions.md: - NEW SECTION: "📋 MANDATORY: ROADMAP MAINTENANCE" - 6 iron-clad rules for roadmap updates - Update immediately after deployment (same session) - Verify roadmap accuracy BEFORE recommending features - Never suggest features based only on roadmap status - ALWAYS grep codebase to verify implementation - Checklist for roadmap updates - Before recommending "next feature" verification steps MANDATORY WORKFLOW (ALL FUTURE AGENTS): . Complete feature/phase . Update roadmap status → COMPLETE with date . Document actual impact (after data collection) . Consolidate phase numbering if inconsistencies . Commit roadmap changes SAME SESSION . VERIFY feature exists before recommending LESSON LEARNED: Roadmap documentation is CRITICAL in real money system. Wrong roadmap = wrong priorities = wasted time = lost profits. User time is valuable - outdated docs waste it. FILES CHANGED: - 1MIN_DATA_ENHANCEMENTS_ROADMAP.md (Phase 7.1 marked COMPLETE) - .github/copilot-instructions.md (MANDATORY section added) STATUS: Documentation now reflects deployed reality ✅	2025-11-27 14:26:15 +01:00
mindesbunister	a676eb4753	docs: Add zero-downtime changes guide - stop unnecessary rebuilds PROBLEM: Rebuilding container 4-6 times per session when most changes don't need it - Every rebuild: 40-70 seconds downtime - Recent session: 200 seconds downtime that could've been 50 seconds - Rebuilding for documentation (should be git only) - Rebuilding for n8n workflows (should be manual import) - Rebuilding for ENV changes (should be restart only) SOLUTION: Created comprehensive guide on what actually needs rebuilds ZERO DOWNTIME (just commit): - Documentation (.md files) - Workflows (.json, .pinescript) - Hot-reload endpoints (roadmap reload) RESTART ONLY (5-10 seconds): - ENV variable changes (.env) - Database schema (prisma migrate + generate) REBUILD REQUIRED (40-70 seconds): - Code changes (.ts, .tsx, .js) - Dependencies (package.json) - Dockerfile changes SMART BATCHING: - Group multiple code changes into ONE rebuild - Example: 6 fixes → 1 rebuild = 50s total (not 6× rebuilds = 300s) CREATED FILES: - docs/ZERO_DOWNTIME_CHANGES.md (comprehensive guide with examples) - Updated copilot-instructions.md (quick decision matrix) EXPECTED IMPACT: - 60-80% reduction in rebuild frequency - 60-80% reduction in downtime per session - Better workflow: batch changes, test together, deploy once User was right: We were rebuilding WAY too often unnecessarily ✅	2025-11-27 14:08:42 +01:00
mindesbunister	49f19b1a8c	feat: Phase 7.2 Real-Time Quality Validation COMPLETE + Hot-Reload Roadmap PHASE 7.2 COMPLETE (Nov 27, 2025): 4 validation checks before Smart Entry execution ADX degradation check (drops >2 points = cancel) Volume collapse check (drops >40% = cancel) RSI reversal detection (LONG RSI <30 or SHORT RSI >70 = cancel) MAGAP divergence check (wrong MA structure = cancel) Integrated with Smart Entry Timer (waits 2-4 min pullback) Detailed logging shows validation results EXPECTED IMPACT: - Block 5-10% of degraded signals during wait period - Save $300-800 in prevented losses over 100 trades - Prevent entries when ADX/volume/momentum weakens FILES CHANGED: - app/api/roadmap/route.ts (marked Phase 7.2 complete) - 1MIN_DATA_ENHANCEMENTS_ROADMAP.md (updated Phase 2 → Phase 7.2 complete) HOT-RELOAD SOLUTION (Zero Downtime Updates): Created /api/roadmap/reload endpoint POST to reload roadmap without container restart Roadmap page has Reload button with status messages No more unnecessary downtime for documentation updates! USAGE: - Web UI: Click Reload button on roadmap page - API: curl -X POST http://localhost:3001/api/roadmap/reload - Updates live instantly without rebuild/redeploy User request: "update the roadmap and documentation. also try to find a way to update the roadmap website without having to restart/rebuild/redeploy the whole container. thats unnessary downtime" All complete ✅	2025-11-27 14:00:46 +01:00
mindesbunister	53c8c59c25	feat: Phase 7.2 Real-Time Quality Validation FEATURE: Validate signal quality before Smart Entry execution - Re-checks market conditions after pullback wait (2-4 min) - Cancels trade if conditions degraded significantly VALIDATION CHECKS (4): 1. ADX degradation: Cancel if drops >2 points (enhanced existing) 2. Volume collapse: Cancel if drops >40% (NEW - momentum fading) 3. RSI reversal: Cancel if LONG RSI <30 or SHORT RSI >70 (NEW) 4. MAGAP divergence: Cancel if wrong MA structure (NEW) EXPECTED IMPACT: - Block 5-10% of signals that degrade during Smart Entry wait - Save $300-800 in prevented losses over 100 trades - Prevent entries when ADX/volume/momentum weakens FILES CHANGED: - lib/trading/smart-entry-timer.ts (115 lines validation logic) - lib/trading/market-data-cache.ts (added maGap to interface) INTEGRATION: Works with Phase 7.1 Smart Entry Timer - Smart Entry waits for pullback (2-4 min) - Phase 7.2 validates quality before execution - Cancels if conditions degraded, executes if maintained	2025-11-27 13:53:53 +01:00
mindesbunister	f420d98d55	critical: Make health monitor 3-4x more aggressive to prevent heap crashes PROBLEM (Nov 27, 2025 - 11:53 UTC): - accountUnsubscribe errors accumulated 200+ times in 2 seconds - JavaScript heap out of memory crash BEFORE health monitor could trigger - Old settings: 50 errors / 30s window / check every 10s = too slow - Container crashed from memory exhaustion, not clean restart SOLUTION - 3-4x FASTER RESPONSE: - Error window: 30s → 10s (3× faster detection) - Error threshold: 50 → 20 errors (2.5× more sensitive) - Check frequency: 10s → 3s intervals (3× more frequent) IMPACT: - Before: 10-40 seconds to trigger restart - After: 3-13 seconds to trigger restart (3-4× faster) - Catches rapid error accumulation BEFORE heap exhaustion - Clean restart instead of crash-and-recover REAL INCIDENT TIMELINE: 11:53:43 - Errors start accumulating 11:53:45.606 - FATAL: heap out of memory (2.2 seconds) 11:53:47.803 - Docker restart (not health monitor) NEW BEHAVIOR: - 20 errors in 10s = trigger at ~100ms/error rate - 3s check interval catches problem in 3-13s MAX - Clean restart before memory leak causes crash Files Changed: - lib/monitoring/drift-health-monitor.ts (lines 13-14, 32)	2025-11-27 13:04:14 +01:00
mindesbunister	d602744938	critical: Fix 1-minute signal price logging - query Drift oracle directly Problem: - 1-minute signals logged wrong prices (4-55 vs actual 41-144 SOL) - Pyth price cache empty for 1-minute signals - Fallback to body.signalPrice (from n8n) contained pricePosition value - signalPrice === pricePosition in every request (39.29 = 39.29) Failed Attempts (5): 1. Added @ close to TradingView indicator (commit `212a36f`) 2. Added signalPrice regex to n8n (commit `99a5223`) 3. Fixed regex to avoid POS collision (commit `ff402ed`) 4. Added DEBUG logging to n8n (no output visible) 5. FINAL FIX: Query Drift oracle directly Solution: - Bypass Pyth cache and n8n signalPrice entirely - Query Drift Protocol oracle directly for real-time price - Lines 145-149 in execute/route.ts: * OLD: latestPrice?.price \|\| body.signalPrice \|\| 0 * NEW: await driftService.getOraclePrice(marketConfig.driftMarketIndex) - Drift oracle is source of truth (on-chain), no caching issues - Added getMarketConfig import to resolve marketConfig Impact: - Logs will show actual SOL price ($141.XX) instead of indicator values ($24-55) - Accurate price logging critical for REAL MONEY system P&L tracking - BlockedSignal database records will have correct signalPrice values Files Changed: - app/api/trading/execute/route.ts (lines 11, 145-149) * Added getMarketConfig to imports * Replaced Pyth cache fallback with direct Drift oracle query	2025-11-27 12:48:37 +01:00
mindesbunister	ff402ed4d2	critical: Fix n8n signalPrice regex to avoid pricePosition collision PROBLEM: n8n extracting pricePosition (25.19) as signalPrice instead of close price (142.08) - Request body showed: signalPrice: 25.1908396947 (IDENTICAL to pricePosition) - Pyth oracle confirmed actual SOL price: $141.796 - TradingView sending correct format: "buy 1 @ 142.08 \| ATR:... \| POS:25.19" ROOT CAUSE: Old regex /@\s([\d.]+)/ too loose, matched first number after @ - Could match POS:25.19 if @ somehow associated with it FIX: Changed to /@\s([\d.]+)\s*\\|/ - Now REQUIRES pipe after price: "@ 142.08 \|" - Cannot match POS:25.19 (no @ before POS) - More specific pattern prevents collision VERIFICATION: - User must re-import updated parse_signal_enhanced.json into n8n - Next signal should show $141.XX not $25.XX in logs - Request body signalPrice should match Pyth price, not pricePosition	2025-11-27 12:27:52 +01:00
mindesbunister	99a5223ec6	fix: Add signal price parsing to n8n workflow PROBLEM: - Bot logs showing wrong prices ($30-43 vs actual $141-144) - TradingView sending correct format: 'buy 1 @ 142.08' - n8n Parse Signal Enhanced wasn't extracting @ price field ROOT CAUSE: - n8n workflow parsed ATR, ADX, RSI, VOL, POS, MAGAP, IND - But @ price field was never extracted - Bot fell back to undefined → used RSI value instead SOLUTION: - Added signalPrice extraction: /@\s*([\d.]+)/ - Returns signalPrice field in n8n output - Bot receives correct price in body.signalPrice IMPACT: - Logs will show correct SOL price ($141-144) - Database signalPrice field accurate - BlockedSignalTracker can calculate correct P&L FILES CHANGED: - workflows/trading/parse_signal_enhanced.json NEXT STEP: User must import updated workflow into n8n Then 1-minute signals will log correct prices ✅	2025-11-27 12:18:53 +01:00
mindesbunister	212a36fef3	fix: Add close price to 1-minute data feed webhook PROBLEM: - Logs showing wrong prices: $30-43 when SOL actually at $141-144 - Webhook message missing close price field - Bot falling back to RSI/ATR values (30-40 range) ROOT CAUSE: - TradingView indicator sending: 'SOLUSDT buy 1 \| ATR:X \| ADX:Y...' - No @ price field in message - n8n couldn't extract signalPrice, bot used wrong fallback SOLUTION: - Added close price to webhook format - New format: 'SOLUSDT buy 1 @ 143.50 \| ATR:X \| ADX:Y...' - Matches main trading signal format (v9 uses same pattern) IMPACT: - Logs will now show correct SOL price ($141-144) - Database signalPrice field accurate - BlockedSignalTracker can calculate correct P&L movements FILES CHANGED: - workflows/trading/moneyline_1min_data_feed.pinescript User deployed updated indicator to TradingView ✅ Next 1-minute alert will show correct price	2025-11-27 12:14:38 +01:00
mindesbunister	85581a670a	fix: Skip frequency checks for data collection signals PROBLEM: - 1-minute data collection signals were getting blocked - Overtrading penalty: '30 signals in 30min (-20 pts)' - Flip-flop penalty: 'opposite direction 1min ago (-25 pts)' - These penalties don't make sense for data collection ROOT CAUSE: - Quality scoring runs for ALL timeframes (needed for analysis) - But frequency checks (overtrading/flip-flop) only apply to production (5min) - Data collection signals (1min, 15min, 1H, etc.) shouldn't be penalized SOLUTION: - Added skipFrequencyCheck parameter to scoreSignalQuality() - Set to true for all non-5min timeframes: skipFrequencyCheck: timeframe !== '5' - Moved timeframe variable declaration earlier for reuse - 1-minute signals now score purely on technical merit (ADX/ATR/RSI/etc.) IMPACT: - 1-minute data collection works correctly - No false 'overtrading' blocks every minute - Quality scores still calculated for cross-timeframe analysis - Production 5min signals still have full frequency validation FILES CHANGED: - app/api/trading/execute/route.ts (quality scoring call) DEPLOYED: Nov 27, 2025 (71.8s build time)	2025-11-27 12:07:37 +01:00

... 3 4 5 6 7 ...

675 Commits