trading_bot_v4

Author	SHA1	Message	Date
copilot-swe-agent[bot]	4f793ec22a	feat: Add integration test suite for Position Manager - Added Jest + ts-jest configuration (jest.config.js) - Added global test setup with mocks (tests/setup.ts) - Added trade factory helpers (tests/helpers/trade-factory.ts) - Added 7 test suites covering Position Manager logic: - tp1-detection.test.ts (13 tests) - breakeven-sl.test.ts (9 tests) - adx-runner-sl.test.ts (18 tests) - trailing-stop.test.ts (14 tests) - edge-cases.test.ts (18 tests) - price-verification.test.ts (13 tests) - decision-helpers.test.ts (28 tests) - Added test documentation (tests/README.md) - Updated package.json with Jest dependencies and scripts - All 113 tests pass Co-authored-by: mindesbunister <32161838+mindesbunister@users.noreply.github.com>	2025-12-05 00:16:12 +00:00
copilot-swe-agent[bot]	52ff787352	Initial plan	2025-12-05 00:03:51 +00:00
mindesbunister	302511293c	feat: Add production logging gating (Phase 1, Task 1.1) - Created logger utility with environment-based gating (lib/utils/logger.ts) - Replaced 517 console.log statements with logger.log (71% reduction) - Fixed import paths in 15 files (resolved comment-trapped imports) - Added DEBUG_LOGS=false to .env - Achieves 71% immediate log reduction (517/731 statements) - Expected 90% reduction in production when deployed Impact: Reduced I/O blocking, lower log volume in production Risk: LOW (easy rollback, non-invasive) Phase: Phase 1, Task 1.1 (Quick Wins - Console.log Production Gating) Files changed: - NEW: lib/utils/logger.ts (production-safe logging) - NEW: scripts/replace-console-logs.js (automation tool) - Modified: 15 lib/*.ts files (console.log → logger.log) - Modified: .env (DEBUG_LOGS=false) Next: Task 1.2 (Image Size Optimization)	2025-12-05 00:32:41 +01:00
mindesbunister	cc3a0a85a0	docs: Document manual trade quality bypass requirement User mandate: Manual Telegram trades bypass quality scoring entirely. Documentation updates: - Added 'Manual Trade Quality Bypass' section - Explains user requirement for instant execution - Documents implementation details (timeframe='manual' detection) - Clarifies that analytics check is now advisory only - Notes --force flag no longer needed for manual trades Context: This is part of the mandatory documentation workflow - every code change requires corresponding documentation update. Related commit: `0982578` (quality bypass implementation) Date: Dec 4, 2025	2025-12-04 19:56:54 +01:00
mindesbunister	09825782bb	feat: Bypass quality scoring for manual Telegram trades User requirement: Manual long/short commands via Telegram shall execute immediately without quality checks. Changes: - Execute endpoint now checks for timeframe='manual' flag - Added isManualTrade bypass alongside isValidatedEntry bypass - Manual trades skip quality threshold validation completely - Logs show 'MANUAL TRADE BYPASS' for transparency Impact: Telegram commands (long sol, short eth) now execute instantly without being blocked by low quality scores. Commit: Dec 4, 2025	2025-12-04 19:56:17 +01:00
mindesbunister	5feb6ba61b	optimize: Reduce 1-min webhook payload from 8 metrics to 2 (price + ADX only) - Removed 6 unused metrics: ATR, RSI, volumeRatio, pricePosition, maGap, volume - Systems only use: currentPrice (Smart Validation Queue) + ADX (adaptive trailing + revenge) - Result: 75% smaller webhook payload (may fix 5-min signal skipping issue) - Backward compatible: n8n parser handles missing fields gracefully - Testing: Upload to TradingView and monitor if 5-min signals process normally	2025-12-04 19:53:12 +01:00
mindesbunister	31ef8b01f2	docs: Add Common Pitfall #54 - Telegram webhook vs polling conflict	2025-12-04 17:19:08 +01:00
mindesbunister	14f28bf464	docs: Add mandatory rule #5 - CHECK DOCUMENTATION FIRST before suggestions - New IRON-CLAD RULE: Always search docs before making suggestions or asking questions - Purpose: Prevent wasting user time with already-answered questions - Examples: TradingView rate limits, roadmap features, known bugs, configuration - Workflow: Read request → Search docs → Check if answered → THEN respond - Applies to: Features, bugs, config, architecture, deployment, troubleshooting - Red flags: User says 'we already documented this' or 'check docs first' - Why: User spent months documenting comprehensively, 'NOTHING gets lost' principle - Impact: Respect user's documentation effort, save time = save money in financial system Files modified: - .github/copilot-instructions.md (line ~103-150, added Rule #5 with examples and workflow)	2025-12-04 17:05:32 +01:00
mindesbunister	c4cc16ede2	docs: EPYC cluster status report Dec 4, 2025 - Worker2 time restriction implementation complete - Stuck chunk 14 resolved - Performance impact analysis - Monitoring commands and verification tests - Expected behavior documentation	2025-12-04 15:19:21 +01:00
mindesbunister	f2f2992a98	fix: Add is_worker_allowed_to_run function definition Function was referenced but not defined - added implementation	2025-12-04 15:16:18 +01:00
mindesbunister	0babd1ea1a	docs: Add worker2 time restriction documentation - Complete guide for noise constraint management - Time-based scheduling logic explained - Performance impact analysis (27% reduction) - Monitoring commands and troubleshooting - Fixed stuck chunk 14 documentation	2025-12-04 14:12:09 +01:00
mindesbunister	f40fd66486	feat: Add time-restricted scheduling for worker2 (noise constraint) - Worker2 (bd-host01) now only runs 19:00-06:00 due to noise - Added is_worker_allowed_to_run() function for time-based control - Worker1 continues 24/7 operation - Reset stuck chunk 14 that was blocking progress since Dec 2	2025-12-04 14:12:00 +01:00
mindesbunister	76040fa82b	docs: Add New Agent Quick Start section to copilot instructions - Added explicit onboarding workflow at top of file - 4-step sequence: copilot-instructions → docs/README → README → explore - Lists all 8 documentation subdirectories with descriptions - Emphasizes 'NOTHING gets lost' principle - Ensures new agents have clear entry point without manual explanation	2025-12-04 14:06:05 +01:00
mindesbunister	0798229582	docs: Add comprehensive reorganization summary - Complete checklist of all 8 tasks accomplished - Before/after statistics (68 files → 8 organized subdirectories) - Documentation metrics (9 READMEs, 2000+ lines, 100+ ENV vars) - New structure diagram showing all subdirectories - Key improvements (discoverability, organization, maintainability) - Next steps and future maintenance guidelines - Lessons learned and best practices established	2025-12-04 13:29:40 +01:00
mindesbunister	6bfb02aa81	docs: Update main README to reference new documentation structure - Add comprehensive documentation navigation section - Link to all 8 subdirectory READMEs (setup, architecture, roadmaps, etc.) - Organize by topic (performance, design, bugs, infrastructure) - Add status-based finding (completed, in progress, analysis) - Replace old flat list with category-based structure - Essential reading section highlights key documents	2025-12-04 12:50:01 +01:00
mindesbunister	dc674ec6d5	docs: Add 1-minute simplified price feed to reduce TradingView alert queue pressure - Create moneyline_1min_price_feed.pinescript (70% smaller payload) - Remove ATR/ADX/RSI/VOL/POS from 1-minute alerts (not used for decisions) - Keep only price + symbol + timeframe for market data cache - Document rationale in docs/1MIN_SIMPLIFIED_FEED.md - Fix: 5-minute trading signals being dropped due to 1-minute flood (60/hour) - Impact: Preserve priority for actual trading signals	2025-12-04 11:19:04 +01:00
mindesbunister	4c36fa2bc3	docs: Major documentation reorganization + ENV variable reference Documentation Structure: - Created docs/ subdirectory organization (analysis/, architecture/, bugs/, cluster/, deployments/, roadmaps/, setup/, archived/) - Moved 68 root markdown files to appropriate categories - Root directory now clean (only README.md remains) - Total: 83 markdown files now organized by purpose New Content: - Added comprehensive Environment Variable Reference to copilot-instructions.md - 100+ ENV variables documented with types, defaults, purpose, notes - Organized by category: Required (Drift/RPC/Pyth), Trading Config (quality/ leverage/sizing), ATR System, Runner System, Risk Limits, Notifications, etc. - Includes usage examples (correct vs wrong patterns) File Distribution: - docs/analysis/ - Performance analyses, blocked signals, profit projections - docs/architecture/ - Adaptive leverage, ATR trailing, indicator tracking - docs/bugs/ - CRITICAL_.md, FIXES_.md bug reports (7 files) - docs/cluster/ - EPYC setup, distributed computing docs (3 files) - docs/deployments/ - _COMPLETE.md, DEPLOYMENT_.md status (12 files) - docs/roadmaps/ - All ROADMAP.md strategic planning files (7 files) - docs/setup/ - TradingView guides, signal quality, n8n setup (8 files) - docs/archived/2025_pre_nov/ - Obsolete verification checklist (1 file) Key Improvements: - ENV variable reference: Single source of truth for all configuration - Common Pitfalls #68-71: Already complete, verified during audit - Better findability: Category-based navigation vs 68 files in root - Preserves history: All files git mv (rename), not copy/delete - Zero broken functionality: Only documentation moved, no code changes Verification: - 83 markdown files now in docs/ subdirectories - Root directory cleaned: 68 files → 0 files (except README.md) - Git history preserved for all moved files - Container running: trading-bot-v4 (no restart needed) Next Steps: - Create README.md files in each docs subdirectory - Add navigation index - Update main README.md with new structure - Consolidate duplicate deployment docs - Archive truly obsolete files (old SQL backups) See: docs/analysis/CLEANUP_PLAN.md for complete reorganization strategy	2025-12-04 08:29:59 +01:00
mindesbunister	e48332e347	docs: Add verification status for Common Pitfall #53 fixes (Dec 3, 2025)	2025-12-03 23:03:40 +01:00
mindesbunister	aa61194aa6	fix: Add TypeScript interface for Smart Validation Queue properties (Bug 5) - Added validatedEntry?: boolean to ExecuteTradeRequest interface - Added originalQualityScore?: number to interface - Added validationDelayMinutes?: number to interface - Fixes TypeScript compilation error at line 231 - Required for Smart Validation Queue integration to work	2025-12-03 20:34:43 +01:00
mindesbunister	835fe176da	docs: Add Common Pitfalls #70 & #71 - Bug 5 & Bug 1 fixes Pitfall #70: Smart Validation Queue rejected by execute endpoint - Fixed execute endpoint to accept validatedEntry=true bypass flag - Allows quality 50-89 signals validated by price action - Smart Validation Queue now works end-to-end Pitfall #71: Revenge system missing external closure integration - Fixed external closure handler to trigger revenge for quality 85+ SL - 30-minute revenge window activates for external stop-outs - Completes revenge system coverage for all exit scenarios Both fixes deployed in commit `785b09e` (Dec 3, 2025) Container restart required to activate fixes	2025-12-03 20:23:21 +01:00
mindesbunister	785b09eeed	critical: Fix Bug 1 (revenge external closures) & Bug 5 (validated entry bypass) Bug 1 Fix - Revenge System External Closures: - External closure handler now checks if SL stop-out with quality 85+ - Calls stopHuntTracker.recordStopHunt() after database save - Enables revenge trading for on-chain order fills (not just Position Manager closes) - Added null safety for trade.signalQualityScore (defaults to 0) - Location: lib/trading/position-manager.ts line ~999 Bug 5 Fix - Execute Endpoint Validated Entry Bypass: - Added isValidatedEntry check before quality threshold rejection - Smart Validation Queue signals (quality 50-89) now execute successfully - Logs show bypass reason and validation details (delay, original quality) - Only affects signals with validatedEntry=true flag from queue - Location: app/api/trading/execute/route.ts line ~228 User Clarification: - TradingView price issue (4.47) was temporary glitch, not a bug - Only Bug 1 (revenge) and Bug 5 (execute rejection) needed fixing - Both fixes implemented and TypeScript errors resolved	2025-12-03 20:08:46 +01:00
mindesbunister	0f88d88dd3	docs: Add Common Pitfalls #68-69 (Dec 3, 2025 bug fixes) - Pitfall #68: Smart Entry using webhook percentage as signal price * Root cause: TradingView webhook price field contained percentage (70.80) instead of market price (42.50) * Impact: 97% pullback calculations made Smart Entry impossible to trigger * Fix: Use Pyth current price instead of webhook price * Commit: `7d0d38a` - Pitfall #69: Direction-specific leverage thresholds not explicit * Made LONG/SHORT leverage assignment explicit even though values same * Improves code clarity and maintainability * Commit: `58f812f` Both fixes deployed Dec 3, 2025, 09:02:45 CET (timestamp verified)	2025-12-03 10:27:07 +01:00
mindesbunister	7d0d38a8b0	critical: Fix Bug #1 - Smart Entry using wrong signal price PROBLEM: Smart Entry showed 'Signal Price: $70.80' when actual SOL price was ~$139.70 Calculated 'Pullback: -97.38%' when actual price change was <1% Smart Entry queue completely broken due to wrong price ROOT CAUSE: TradingView webhook (or n8n workflow) sends pricePosition percentage (73.77) as signalPrice instead of actual dollar price ($139.70) Code used body.signalPrice directly without validation EVIDENCE: Webhook payload: "pricePosition": 73.7704918033, "signalPrice": 73.7704918033 Identical values = pricePosition mapped incorrectly to signalPrice Percentage value (0-100) treated as dollar price = 100× too low FIXES: 1. Added detection: If signalPrice < $10, log warning (likely percentage) 2. Changed signalPrice source: Use currentPrice from Pyth (NOT body.signalPrice) 3. At signal time: priceChange = 0, pullbackMagnitude = 0 (no pullback yet) 4. Queue with correct price: Smart Entry timer gets current market price 5. Added comments explaining bug and fix IMPACT: Smart Entry will now use correct signal price ($130-150 for SOL) Pullback calculations will be accurate (0.15-0.5% range, not 97%) Queue will work correctly (wait for actual dips/bounces) Next signal will validate fix in production logs TESTING REQUIRED: - Wait for next signal (LONG or SHORT) - Verify log: 'Signal Price: $XXX.XX (using current market price)' - Verify log: 'Current Price: $XXX.XX (same as signal)' - Verify: No more -97% pullback calculations - Verify: Smart Entry queues correctly if no pullback yet FILES CHANGED: - app/api/trading/execute/route.ts lines 485-555 (rewritten Smart Entry logic) LOCATION: - Line 495: Added currentPrice null check - Line 502: Added percentage detection warning - Line 507: Changed to use currentPrice as signalPrice - Line 509-511: Set priceChange/pullback to 0 at signal time - Line 517: Queue with corrected signalPrice RELATED: - Bug #2: Leverage thresholds (FIXED separately, commit `58f812f`) - Bug #3: Missing Telegram entry notifications (pending investigation)	2025-12-03 08:16:27 +01:00
mindesbunister	58f812f0a7	critical: Fix Bug #2 - Direction-specific leverage thresholds not loaded PROBLEM: Quality 90 LONGs getting 5x instead of expected 10x leverage ROOT CAUSE: ENV vars QUALITY_LEVERAGE_THRESHOLD_LONG/SHORT existed but never loaded in code IMPACT: 50% smaller position sizes on quality 90-94 signals FIXES: 1. Added qualityLeverageThresholdLong and qualityLeverageThresholdShort to TradingConfig interface 2. Added ENV loading for both direction-specific thresholds 3. Updated getLeverageForQualityScore() to use direction-specific thresholds 4. Added proper fallback hierarchy: direction-specific → backward compat → hardcoded default 5. Added console logs showing which threshold and leverage tier is applied RESULT: Quality 90 LONGs will now get 10x leverage (highQualityLeverage) Position sizes will double from ~$89 to ~$178 User reported: 'last trades were very small positions. no way near a 10 or 15x leverage' This fix addresses that complaint - user expectation was correct, code was wrong. Files: config/trading.ts (interface lines 20-27, ENV loading lines 520-532, function lines 673-730)	2025-12-03 08:11:24 +01:00
mindesbunister	1a5205c289	critical: Fix SL/TP exit P&L compounding with atomic deduplication CRITICAL BUG FIX: Stop loss and take profit exits were sending duplicate Telegram notifications with compounding P&L (16 duplicates, 796x inflation). Real Incident (Dec 2, 2025): - Manual SOL-PERP SHORT position stopped out - 16 duplicate Telegram notifications received - P&L compounding: $0.23 → $12.10 → $24.21 → $183.12 (796× multiplication) - All showed identical: entry $139.64, hold 4h 5-6m, exit reason SL - First notification: Ghost detected (handled correctly) - Next 15 notifications: SL exit (all duplicates with compounding P&L) Root Cause: - Multiple monitoring loops detect SL condition simultaneously - All call executeExit() before any can remove position from tracking - Race condition: check closingInProgress → both true → both proceed - Database update happens BEFORE activeTrades.delete() - Each execution sends Telegram notification - P&L values compound across notifications Solution: Applied same atomic delete pattern as ghost detection fix (commit `93dd950`): - Move activeTrades.delete() to START of executeExit() (before any async operations) - Check wasInMap return value (only true for first caller, false for duplicates) - Early return if already deleted (atomic deduplication guard) - Only first loop proceeds to close, save DB, send notification - Removed redundant removeTrade() call (already deleted at start) Impact: - Prevents duplicate notifications for SL, TP1, TP2, emergency stops - Ensures accurate P&L reporting (no compounding) - Database receives correct single exit record - User receives ONE notification per exit (as intended) Code Changes: - Line ~1520: Added atomic delete guard for full closes (percentToClose >= 100) - Line ~1651: Removed redundant removeTrade() call - Both changes prevent race condition at function entry Scope: - ✅ Stop loss exits: Fixed - ✅ Take profit 2 exits: Fixed - ✅ Emergency stops: Fixed - ✅ Trailing stops: Fixed - ℹ️ Take profit 1: Not affected (partial close keeps position in monitoring) Related: - Ghost detection fix: commit `93dd950` (Dec 2, 2025) - same pattern, different function - Manual trade enhancement: commit `23277b7` (Dec 2, 2025) - unrelated feature - P&L compounding series: Common Pitfalls #48-49, #59-61, #67 in docs	2025-12-02 23:32:09 +01:00
mindesbunister	23277b7c87	feat: Manual trades wait for fresh 1-minute ATR datapoint PHASE 2 ENHANCED: Manual trades now wait for next 1-minute datapoint instead of using cached/stale data. Guarantees fresh ATR (<60s old). User requirement: 'when i send a telegram message to enter the market, the bot will simply wait for the next 1 minute datapoint' Implementation: - Add wait_for_fresh_market_data() async helper function - Polls market data cache every 5 seconds (max 60s) - Detects fresh data by timestamp change - Extracts real ATR/ADX/RSI from 1-minute TradingView data - User sees waiting message + confirmation when fresh data arrives - Falls back to preset ATR 0.43 on timeout (fail-safe) Benefits: - Adaptive targets match CURRENT volatility (not historical) - No stale data risk (guaranteed <60s old) - Better than Phase 2 v1 (5-minute tolerance) - Consistent with automated trades (same 1-min data source) User Experience: 1. User: /long sol 2. Bot: ⏳ Waiting for next 1-minute datapoint... 3. [Wait 15-45 seconds typically] 4. Bot: ✅ Fresh ATR: 0.4523 \| ADX: 34.2 \| RSI: 56.8 5. Bot: ✅ Position opened with adaptive targets Changes: - Add asyncio import for async sleep - Add wait_for_fresh_market_data() before manual_trade_handler - Replace Phase 2 v1 (5min tolerance) with polling logic - Add 3 user messages (waiting, confirmation, timeout) - Extract ATR/ADX/RSI from fresh data or fallback Files: - telegram_command_bot.py: +70 lines polling logic	2025-12-02 19:35:24 +01:00
mindesbunister	702ef7953b	docs: Add Common Pitfall #67 - Ghost detection race condition Bug: 23 duplicate Telegram notifications with P&L compounding (-7.96 to -,129.24) Cause: Multiple monitoring loops passed has() check before any deleted from Map Fix: Use Map.delete() atomic return value as deduplication lock Result: First caller deletes and proceeds, subsequent callers return immediately Related: #48-49 (TP1 P&L compound), #59-61 (external closure duplicates) Deployed: Dec 2, 2025 17:32:52 UTC (commit `93dd950`)	2025-12-02 18:43:24 +01:00
mindesbunister	93dd950821	critical: Fix ghost detection P&L compounding - delete from Map BEFORE check Bug: Multiple monitoring loops detect ghost simultaneously - Loop 1: has(tradeId) → true → proceeds - Loop 2: has(tradeId) → true → ALSO proceeds (race condition) - Both send Telegram notifications with compounding P&L Real incident (Dec 2, 2025): - Manual SHORT at $138.84 - 23 duplicate notifications - P&L compounded: -$47.96 → -$1,129.24 (23× accumulation) - Database shows single trade with final compounded value Fix: Map.delete() returns true if key existed, false if already removed - Call delete() FIRST - Check return value proceeds - All other loops get false → skip immediately - Atomic operation prevents race condition Pattern: This is variant of Common Pitfalls #48, #49, #59, #60, #61 - All had "check then delete" pattern - All vulnerable to async timing issues - Solution: "delete then check" pattern - Map.delete() is synchronous and atomic Files changed: - lib/trading/position-manager.ts lines 390-410 Related: DUPLICATE PREVENTED message was working but too late	2025-12-02 18:25:56 +01:00
mindesbunister	d156abc976	docs: Add mandatory git workflow and critical feedback requirements CRITICAL UPDATES to AI assistant instructions: 1. MANDATORY GIT WORKFLOW (DO NOT SKIP): - Added explicit requirement: implement → test → verify → document → commit → push - Made git commits NON-OPTIONAL for all significant changes - Added to both general prompt and copilot-instructions.md - Rationale: Agent has pattern of skipping documentation/commits 2. CHALLENGE USER IDEAS: - Added requirement to think critically about user requests - Instruction: "Think freely and don't hold back" - Goal: Find BEST solution, not just A solution - Push back on ideas that don't make sense - Ask "is there a simpler/faster/safer way?" 3. COMPREHENSIVE DOCUMENTATION SECTION: - Replaced brief documentation note with full workflow guide - Added 80+ lines of detailed documentation requirements - Includes examples, red flags, mindset principles - Emphasizes: "Git commit + Documentation = Complete work" Files modified: - .github/prompts/general prompt.prompt.md (added sections 5a, 6, updated 7-8) - .github/copilot-instructions.md (comprehensive documentation workflow) User mandate: "I am sick and tired of reminding you" - this makes it automatic. Impact: Future implementations will ALWAYS include documentation and git commits as part of standard workflow, not as afterthoughts.	2025-12-02 15:23:20 +01:00
mindesbunister	c581c62c83	docs: Add comprehensive documentation of MarketData execute endpoint fix	2025-12-02 12:54:19 +01:00
mindesbunister	79ab30782c	fix: MarketData storage now working in execute endpoint - Added debug logging to trace execution - Confirmed 1-minute signals being stored continuously - Database accumulating rows every 1-3 minutes - All indicators (ATR, ADX, RSI, volume, price position) storing correctly - 1-year retention active (365 days) - Foundation ready for 8-hour blocked signal tracking	2025-12-02 12:43:35 +01:00
mindesbunister	ea591d2c29	docs: Add comprehensive 1-year retention deployment documentation	2025-12-02 12:07:45 +01:00
mindesbunister	5773d7d36d	feat: Extend 1-minute data retention from 4 weeks to 1 year - Updated lib/maintenance/data-cleanup.ts retention period: 28 days → 365 days - Storage requirements validated: 251 MB/year (negligible) - Rationale: 13× more historical data for better pattern analysis - Benefits: 260-390 blocked signals/year vs 20-30/month - Cleanup cutoff: Now Dec 2, 2024 (vs Nov 4, 2025 previously) - Deployment verified: Container restarted, cleanup scheduled for 3 AM daily	2025-12-02 11:55:36 +01:00
mindesbunister	4239c99057	docs: Add Common Pitfall #66 - Smart Entry Validation Queue symbol normalization bug - Symptom: Abandonment notifications showing impossible prices (26 → 8.18 in 30s) - Root cause: Symbol format mismatch (TradingView 'SOLUSDT' vs cache 'SOL-PERP') - Fix: Added normalizeTradingViewSymbol() in check-risk endpoint before validation queue - Impact: Cache lookup now succeeds, Telegram shows correct abandonment prices - Files: check-risk/route.ts line 9 (import), lines 432-444 (normalization) - Commit: `6cec2e8` deployed Dec 1, 2025 - Lesson: Always normalize symbols at integration boundaries, cache key mismatches fail silently	2025-12-01 23:51:40 +01:00
mindesbunister	6cec2e8e71	critical: Fix Smart Entry Validation Queue wrong price display - Bug: Validation queue used TradingView symbol format (SOLUSDT) to lookup market data cache - Cache uses normalized Drift format (SOL-PERP) - Result: Cache lookup failed, wrong/stale price shown in Telegram abandonment notifications - Real incident: Signal at $126.00 showed $98.18 abandonment price (-22.08% impossible drop) - Fix: Added normalizeTradingViewSymbol() call in check-risk endpoint before passing to validation queue - Files changed: app/api/trading/check-risk/route.ts (import + symbol normalization) - Impact: Validation queue now correctly retrieves current price from market data cache - Deployed: Dec 1, 2025	2025-12-01 23:45:21 +01:00
mindesbunister	4fb301328d	docs: Document 70% CPU deployment and Python buffering fix - CRITICAL FIX: Python output buffering caused silent failure - Solution: python3 -u flag for unbuffered output - 70% CPU optimization: int(cpu_count() * 0.7) = 22-24 cores per server - Current state: 47 workers, load ~22 per server, 16.3 hour timeline - System operational since Dec 1 22:50:32 - Expected completion: Dec 2 15:15	2025-12-01 23:27:17 +01:00
mindesbunister	e748cf709d	fix: Correct SSH hop for EPYC worker2 connectivity - ProxyJump (-J) doesn't work from Docker container - Changed to nested SSH: hop -> target - Proper command escaping for nested SSH - Worker2 (srv-bd-host01) only accessible via worker1 (pve-nu-monitor01)	2025-12-01 19:42:08 +01:00
mindesbunister	7e1fe1cc30	feat: V9 advanced parameter sweep with MA gap filter (810K configs) Parameter space expansion: - Original 15 params: 101K configurations - NEW: MA gap filter (3 dimensions) = 18× expansion - Total: ~810,000 configurations across 4 time profiles - Chunk size: 1,000 configs/chunk = ~810 chunks MA Gap Filter parameters: - use_ma_gap: True/False (2 values) - ma_gap_min_long: -5.0%, 0%, +5.0% (3 values) - ma_gap_min_short: -5.0%, 0%, +5.0% (3 values) Implementation: - money_line_v9.py: Full v9 indicator with MA gap logic - v9_advanced_worker.py: Chunk processor (1,000 configs) - v9_advanced_coordinator.py: Work distributor (2 EPYC workers) - run_v9_advanced_sweep.sh: Startup script (generates + launches) Infrastructure: - Uses existing EPYC cluster (64 cores total) - Worker1: bd-epyc-02 (32 threads) - Worker2: bd-host01 (32 threads via SSH hop) - Expected runtime: 70-80 hours - Database: SQLite (chunk tracking + results) Goal: Find optimal MA gap thresholds for filtering false breakouts during MA whipsaw zones while preserving trend entries.	2025-12-01 18:11:47 +01:00
mindesbunister	2993bc8895	feat: Update v9 with optimal parameters from exhaustive sweep + consolidate files Parameter updates (from 4,096 config sweep analysis): - flipThreshold: 0.6 → 0.5 (optimal for reversal confirmation) - adxMin: 18 → 21 (stronger trend filter) - longPosMax: 85 → 75 (prevent chasing tops) - shortPosMin: 15 → 20 (catch momentum shorts) - volMin: 0.7 → 1.0 (stronger conviction requirement) File consolidation: - Archived moneyline_v9_ma_gap_clean.pinescript (suboptimal defaults) - Archived moneyline_v9_test.pinescript (suboptimal defaults, missing MA gap) - Kept moneyline_v9_ma_gap.pinescript as canonical v9 (optimal + MA gap analysis) Result: Single v9 file with optimal defaults producing 19.44% returns over 4 months (194.4% annualized) from sweep validation.	2025-12-01 16:04:42 +01:00
mindesbunister	f050372d7a	docs: Add Common Pitfall #65 - distributed worker quality_filter bug	2025-12-01 15:21:27 +01:00
mindesbunister	11a0ea324b	critical: Fix distributed worker quality_filter - dict to lambda function Root cause: Passing dict {'min_adx': 15, 'min_volume_ratio': vol_min} when simulate_money_line() expects callable function. Bug caused ALL 2,096 backtests to fail with 'dict' object is not callable. Fix: Changed to lambda function matching comprehensive_sweep.py pattern: quality_filter = lambda s: s.adx >= 15 and s.volume_ratio >= vol_min Verified fix working: Workers running at 100% CPU, no errors after 2+ minutes.	2025-12-01 14:59:08 +01:00
mindesbunister	a886555d44	docs: Complete SSH timeout + resumption logic fix documentation Comprehensive documentation including: - Root cause analysis for both bugs - Manual test procedures that validated fixes - Code changes with before/after comparisons - Verification results (24 worker processes running) - Lessons learned for future debugging - Current cluster state and next steps Files: cluster/SSH_TIMEOUT_FIX_COMPLETE.md (288 lines)	2025-12-01 12:58:03 +01:00
mindesbunister	323ef03f5f	critical: Fix SSH timeout + resumption logic bugs SSH Command Fix: - CRITICAL: Removed && after background command (&) - Pattern: 'cmd & echo Started' works, 'cmd && echo' waits forever - Manually tested: Works perfectly on direct SSH - Result: Chunk 0 now starts successfully on worker1 (24 processes running) Resumption Logic Fix: - CRITICAL: Only count completed/running chunks, not pending - Query: Added 'AND status IN (completed, running)' filter - Result: Starts from chunk 0 when no chunks complete (was skipping to chunk 3) Database Cleanup: - CRITICAL: Delete pending/failed chunks on coordinator start - Prevents UNIQUE constraint errors on retry - Result: Clean slate allows coordinator to assign chunks fresh Verification: - ✅ Chunk v9_chunk_000000: status='running', assigned_worker='worker1' - ✅ Worker1: 24 Python processes running backtester - ✅ Database: Cleaned 3 pending chunks, created 1 running chunk - ⚠️ Worker2: SSH hop still timing out (separate infrastructure issue) Files changed: - cluster/distributed_coordinator.py (3 critical fixes: line 388-401, 514-533, 507-514)	2025-12-01 12:56:35 +01:00
mindesbunister	1f83a7d7c4	feat: Add coordinator log viewer to cluster UI - Created /api/cluster/logs endpoint to read coordinator.log - Added real-time log display in cluster UI (updates every 3s) - Shows last 100 lines of coordinator.log in terminal-style display - Includes manual refresh button - Improves debugging experience - no need to SSH for logs User feedback: 'why dont we add the output of the log at the bottom of the page so i know whats going on' This addresses poor visibility into coordinator errors and failures. Next step: Fix SSH timeout issue blocking worker execution.	2025-12-01 11:49:23 +01:00
mindesbunister	db33af9f17	fix: Stop button database reset + UI state display (DATABASE-FIRST ARCHITECTURE) CRITICAL FIXES: 1. Stop button now resets database FIRST (before pkill) - Database cleanup happens even if coordinator crashed - Prevents stale 'running' chunks blocking restart - Uses Node.js sqlite library (not CLI - Docker compatible) 2. UI enhancement - 4-state display - ⚡ Processing (running > 0) - ⏳ Pending (pending > 0, running = 0) - ✅ Complete (all completed) - ⏸️ Idle (no work queued) [NEW] - Shows pending chunk count when present TECHNICAL DETAILS: - Replaced sqlite3 CLI calls with proper Node.js API - Fixed permissions: chown 1001:1001 cluster/ for container write - Database-first logic: reset → pkill → verify - Detailed logging for each operation step FILES CHANGED: - app/api/cluster/control/route.ts (database operations refactored) - app/cluster/page.tsx (4-state UI display) VERIFIED: - Stop button successfully reset 3 'running' chunks → 'pending' - UI correctly shows Idle state after Stop - Container logs show detailed operation flow - Database operations work in Docker environment DEPLOYMENT: - Container rebuilt with fixed code - Tested with real stale database (3 running chunks) - All operations working correctly	2025-12-01 11:34:47 +01:00
mindesbunister	c343daeb44	docs: Document EPYC cluster SSH timeout fix in Common Pitfalls - Added Common Pitfall #64: SSH timeout for nested hop scenarios - Documented 30s→60s timeout increase rationale - Explained SSH options: StrictHostKeyChecking, ConnectTimeout, ServerAliveInterval - Included verification data: 23-24 processes per worker at 99% CPU - Provided formula for calculating minimum timeouts for multi-hop SSH - Cross-referenced commit `ef371a1` (the actual code fix) - Added future prevention guidance (timeout formulas, SSH multiplexing) This documentation update accompanies the cluster fix deployed earlier.	2025-12-01 09:46:17 +01:00
mindesbunister	ef371a19b9	fix: EPYC cluster SSH timeout - increase timeout 30s→60s + add SSH options CRITICAL FIX (Dec 1, 2025): Cluster start was failing with 'operation failed' Problem: - SSH commands timing out after 30s (too short for 2-hop SSH to worker2) - Missing SSH options caused prompts/delays - Result: Coordinator failed to start worker processes Solution: - Increased timeout from 30s to 60s for nested SSH hops - Added SSH options: -o StrictHostKeyChecking=no -o ConnectTimeout=10 - Applied options to both ssh_command() and worker startup commands Verification (Dec 1, 09:40): - Worker1: 23 processes running (chunk 0-2000) - Worker2: 24 processes running (chunk 2000-4000) - Cluster status: ACTIVE with 2 workers - Both chunks processing successfully Files changed: - cluster/distributed_coordinator.py (lines 302-314, 388-414)	2025-12-01 09:41:42 +01:00
mindesbunister	549fe8e077	docs: CRITICAL - Make documentation + git commit hand-in-hand #1 PRIORITY USER MANDATE (Dec 1, 2025): Documentation MUST go hand-in-hand with EVERY git commit. This is NOT optional. This is NOT a suggestion. This is MANDATORY. Changes: - Elevated documentation section to #1 PRIORITY status - Added user's direct quote: 'this HAS to go hand in hand' - Expanded from 15 lines to 100+ lines with comprehensive guidelines - Added 'Why This is #1 Priority' section with user's frustration quote - Added explicit 'When Documentation is MANDATORY' checklist - Added 'The Correct Mindset' section emphasizing it's part of the work - Added 4 scenario examples showing what MUST be documented - Added 'Red Flags' section to catch missing documentation - Added 'Integration with Existing Sections' guide - Made it crystal clear: Code without documentation = INCOMPLETE WORK This addresses user's repeated reminders about documentation being mandatory. Future AI agents will now see this as the #1 priority it is. NO MORE PUSHING CODE WITHOUT DOCUMENTATION UPDATES.	2025-12-01 09:17:51 +01:00
mindesbunister	b1a41733b8	docs: Document Dec 1 adaptive leverage UI enhancements - Updated adaptive leverage configuration section with current values (10x/5x) - Added Settings UI documentation with 5 configurable fields - Documented direction-specific thresholds (LONG/SHORT split) - Added dynamic collateral display implementation details - Documented new /api/drift/account-health endpoint - Added commit history for Dec 1 changes (`2e511ce`, `21c13b9`, `a294f44`, `67ef5b1`) - Updated API endpoints section with account-health route Changes reflect full UI implementation completed Dec 1, 2025: - Independent LONG (95) and SHORT (90) quality threshold controls - Real-time collateral fetching from Drift Protocol - Position size calculator with dynamic balance updates - Complete production-ready adaptive leverage system	2025-12-01 09:15:03 +01:00
mindesbunister	67ef5b1ac6	feat: Add direction-specific quality thresholds and dynamic collateral display - Split QUALITY_LEVERAGE_THRESHOLD into separate LONG and SHORT variants - Added /api/drift/account-health endpoint for real-time collateral data - Updated settings UI to show separate controls for LONG/SHORT thresholds - Position size calculations now use dynamic collateral from Drift account - Updated .env and docker-compose.yml with new environment variables - LONG threshold: 95, SHORT threshold: 90 (configurable independently) Files changed: - app/api/drift/account-health/route.ts (NEW) - Account health API endpoint - app/settings/page.tsx - Added collateral state, separate threshold inputs - app/api/settings/route.ts - GET/POST handlers for LONG/SHORT thresholds - .env - Added QUALITY_LEVERAGE_THRESHOLD_LONG/SHORT variables - docker-compose.yml - Added new env vars with fallback defaults Impact: - Users can now configure quality thresholds independently for LONG vs SHORT signals - Position size display dynamically updates based on actual Drift account collateral - More flexible risk management with direction-specific leverage tiers	2025-12-01 09:09:30 +01:00

1 2 3 4 5 ...

518 Commits