Files
trading_bot_v4/SIGNAL_QUALITY_OPTIMIZATION_ROADMAP.md
mindesbunister ba13c20c60 feat: implement blocked signals tracking system
- Add BlockedSignal table with 25 fields for comprehensive signal analysis
- Track all blocked signals with metrics (ATR, ADX, RSI, volume, price position)
- Store quality scores, block reasons, and detailed breakdowns
- Include future fields for automated price analysis (priceAfter1/5/15/30Min)
- Restore signalQualityVersion field to Trade table

Database changes:
- New table: BlockedSignal with indexes on symbol, createdAt, score, blockReason
- Fixed schema drift from manual changes

API changes:
- Modified check-risk endpoint to save blocked signals automatically
- Fixed hasContextMetrics variable scope (moved to line 209)
- Save blocks for: quality score too low, cooldown period, hourly limit
- Use config.minSignalQualityScore instead of hardcoded 60

Database helpers:
- Added createBlockedSignal() function with try/catch safety
- Added getRecentBlockedSignals(limit) for queries
- Added getBlockedSignalsForAnalysis(olderThanMinutes) for automation

Documentation:
- Created BLOCKED_SIGNALS_TRACKING.md with SQL queries and analysis workflow
- Created SIGNAL_QUALITY_OPTIMIZATION_ROADMAP.md with 5-phase plan
- Documented data-first approach: collect 10-20 signals before optimization

Rationale:
Only 2 historical trades scored 60-64 (insufficient sample size for threshold decision).
Building data collection infrastructure before making premature optimizations.

Phase 1 (current): Collect blocked signals for 1-2 weeks
Phase 2 (next): Analyze patterns and make data-driven threshold decision
Phase 3-5 (future): Automation and ML optimization
2025-11-11 11:49:21 +01:00

370 lines
11 KiB
Markdown

# Signal Quality Optimization Roadmap
**Goal:** Optimize signal quality thresholds and scoring logic using data-driven analysis
**Current Status:** Phase 1 - Data Collection (Active)
**Last Updated:** November 11, 2025
---
## Overview
This roadmap guides the systematic improvement of signal quality filtering. We follow a **data-first approach**: collect evidence, analyze patterns, then make changes. No premature optimization.
### Current System
- **Quality Score Threshold:** 65 points (recently raised from 60)
- **Executed Trades:** 157 total (155 closed, 2 open)
- **Performance:** +$3.43 total P&L, 44.5% win rate
- **Score Distribution:**
- 80-100 (Excellent): 49 trades, +$46.48, 46.9% WR
- 70-79 (Good): 15 trades, -$2.20, 40.0% WR ⚠️
- 65-69 (Pass): 13 trades, +$28.28, 53.8% WR ✅
- 60-64 (Just Below): 2 trades, +$45.78, **100% WR** 🔥
- 0-49 (Very Weak): 13 trades, -$127.89, 30.8% WR 💀
---
## Phase 1: Data Collection (CURRENT) ✅ IN PROGRESS
**Status:** Infrastructure complete, collecting data
**Started:** November 11, 2025
**Target:** Collect 10-20 blocked signals (1-2 weeks)
### Completed (Nov 11, 2025)
- [x] Created `BlockedSignal` database table
- [x] Implemented automatic saving in check-risk endpoint
- [x] Deployed to production (trading-bot-v4 container)
- [x] Created tracking documentation (BLOCKED_SIGNALS_TRACKING.md)
### What's Being Tracked
Every blocked signal captures:
- **Metrics:** ATR, ADX, RSI, volume ratio, price position, timeframe
- **Score:** Quality score (0-100), version, detailed breakdown
- **Block Reason:** Quality score, cooldown, hourly limit, daily drawdown
- **Context:** Symbol, direction, price at signal time, timestamp
### What We're Looking For
1. How many signals score 60-64 (just below threshold)?
2. What are their characteristics (ADX, ATR, price position)?
3. Are there patterns (extreme positions, specific timeframes)?
4. Do they cluster around specific block reasons?
### Phase 1 Completion Criteria
- [ ] Minimum 10 blocked signals with quality scores 55-64
- [ ] At least 2 signals in 60-64 range (close calls)
- [ ] Mix of block reasons (not all quality score)
- [ ] Data spans multiple market conditions (trending, choppy, volatile)
### SQL Queries for Phase 1
```sql
-- Check progress
SELECT COUNT(*) as total_blocked
FROM "BlockedSignal";
-- Score distribution
SELECT
CASE
WHEN signalQualityScore >= 60 THEN '60-64 (Close)'
WHEN signalQualityScore >= 55 THEN '55-59 (Marginal)'
WHEN signalQualityScore >= 50 THEN '50-54 (Weak)'
ELSE '0-49 (Very Weak)'
END as tier,
COUNT(*) as count
FROM "BlockedSignal"
WHERE blockReason = 'QUALITY_SCORE_TOO_LOW'
GROUP BY tier
ORDER BY MIN(signalQualityScore) DESC;
```
---
## Phase 2: Pattern Analysis 🔜 NEXT
**Prerequisites:** 10-20 blocked signals collected
**Estimated Duration:** 2-3 days
**Owner:** Manual analysis + SQL queries
### Analysis Tasks
#### 2.1: Score Distribution Analysis
```sql
-- Analyze blocked signals by score range
SELECT
CASE
WHEN signalQualityScore >= 60 THEN '60-64'
WHEN signalQualityScore >= 55 THEN '55-59'
ELSE '50-54'
END as score_range,
COUNT(*) as count,
ROUND(AVG(atr)::numeric, 2) as avg_atr,
ROUND(AVG(adx)::numeric, 1) as avg_adx,
ROUND(AVG(pricePosition)::numeric, 1) as avg_price_pos,
ROUND(AVG(volumeRatio)::numeric, 2) as avg_volume
FROM "BlockedSignal"
WHERE blockReason = 'QUALITY_SCORE_TOO_LOW'
GROUP BY score_range
ORDER BY MIN(signalQualityScore) DESC;
```
#### 2.2: Compare with Executed Trades
```sql
-- Find executed trades with similar scores to blocked signals
SELECT
'Executed' as type,
signalQualityScore,
COUNT(*) as trades,
ROUND(AVG(realizedPnL)::numeric, 2) as avg_pnl,
ROUND(100.0 * SUM(CASE WHEN realizedPnL > 0 THEN 1 ELSE 0 END) / COUNT(*)::numeric, 1) as win_rate
FROM "Trade"
WHERE exitReason IS NOT NULL
AND signalQualityScore BETWEEN 60 AND 69
GROUP BY signalQualityScore
ORDER BY signalQualityScore;
```
#### 2.3: ADX Pattern Analysis
Key finding from existing data: ADX 20-25 is a trap zone!
```sql
-- ADX distribution in blocked signals
SELECT
CASE
WHEN adx >= 25 THEN 'Strong (25+)'
WHEN adx >= 20 THEN 'Moderate (20-25)'
WHEN adx >= 15 THEN 'Weak (15-20)'
ELSE 'Very Weak (<15)'
END as adx_tier,
COUNT(*) as count,
ROUND(AVG(signalQualityScore)::numeric, 1) as avg_score
FROM "BlockedSignal"
WHERE blockReason = 'QUALITY_SCORE_TOO_LOW'
AND adx IS NOT NULL
GROUP BY adx_tier
ORDER BY MIN(adx) DESC;
```
#### 2.4: Extreme Position Analysis
Test hypothesis: Extremes (<10% or >90%) need different thresholds
```sql
-- Blocked signals at range extremes
SELECT
direction,
signalQualityScore,
ROUND(pricePosition::numeric, 1) as pos,
ROUND(adx::numeric, 1) as adx,
ROUND(volumeRatio::numeric, 2) as vol
FROM "BlockedSignal"
WHERE blockReason = 'QUALITY_SCORE_TOO_LOW'
AND (pricePosition < 10 OR pricePosition > 90)
ORDER BY signalQualityScore DESC;
```
### Phase 2 Deliverables
- [ ] Score distribution report
- [ ] ADX pattern analysis
- [ ] Extreme position analysis
- [ ] Comparison with executed trades
- [ ] **DECISION:** Keep threshold at 65, lower to 60, or implement dual-threshold system
---
## Phase 3: Implementation (Conditional) 🎯 FUTURE
**Trigger:** Analysis shows clear pattern worth exploiting
**Prerequisites:** Phase 2 complete + statistical significance (15+ blocked signals)
### Option A: Dual-Threshold System (Recommended)
**IF** data shows extreme positions (price <10% or >90%) with scores 60-64 are profitable:
**Implementation:**
```typescript
// In check-risk endpoint
const isExtremePosition = pricePosition < 10 || pricePosition > 90
const requiredScore = isExtremePosition ? 60 : 65
if (qualityScore.score < requiredScore) {
// Block signal
}
```
**Changes Required:**
- `app/api/trading/check-risk/route.ts` - Add dual threshold logic
- `lib/trading/signal-quality.ts` - Add `isExtremePosition` helper
- `config/trading.ts` - Add `minScoreForExtremes` config option
- Update AI instructions with new logic
### Option B: ADX-Based Gates (Alternative)
**IF** data shows strong ADX trends (25+) with lower scores are profitable:
**Implementation:**
```typescript
const requiredScore = adx >= 25 ? 60 : 65
```
**Changes Required:**
- Similar to Option A but based on ADX threshold
### Option C: Keep Current (If No Clear Pattern)
**IF** data shows no consistent profit opportunity in blocked signals:
- No changes needed
- Continue monitoring
- Revisit in 20 more trades
### Phase 3 Checklist
- [ ] Decision made based on Phase 2 analysis
- [ ] Code changes implemented
- [ ] Updated signalQualityVersion to 'v5' in database
- [ ] AI instructions updated
- [ ] Tested with historical blocked signals
- [ ] Deployed to production
- [ ] Monitoring for 10 trades to validate improvement
---
## Phase 4: Price Analysis Automation 🤖 FUTURE
**Goal:** Automatically track if blocked signals would have been profitable
**Complexity:** Medium - requires price monitoring job
**Prerequisites:** Phase 3 complete OR 50+ blocked signals collected
### Architecture
```
Monitoring Job (runs every 30 min)
Fetch BlockedSignal records where:
- analysisComplete = false
- createdAt > 30 minutes ago
For each signal:
- Get price history from Pyth/Drift
- Calculate if TP1/TP2/SL would have been hit
- Update priceAfter1Min/5Min/15Min/30Min
- Set wouldHitTP1/TP2/SL flags
- Mark analysisComplete = true
Save results back to database
```
### Implementation Tasks
- [ ] Create price history fetching service
- [ ] Implement TP/SL hit calculation logic
- [ ] Create cron job or Next.js API route with scheduler
- [ ] Add monitoring dashboard for blocked signal outcomes
- [ ] Generate weekly reports on missed opportunities
### Success Metrics
- X% of blocked signals would have hit SL (blocks were correct)
- Y% would have hit TP1/TP2 (missed opportunities)
- Overall P&L of hypothetical blocked trades
---
## Phase 5: ML-Based Optimization 🧠 DISTANT FUTURE
**Goal:** Use machine learning to optimize scoring weights
**Prerequisites:** 200+ trades with quality scores, 100+ blocked signals
**Complexity:** High
### Approach
1. Extract features: ATR, ADX, RSI, volume, price position, timeframe
2. Train model on: executed trades (outcome = P&L)
3. Validate on: blocked signals (if price analysis complete)
4. Generate: Optimal scoring weights for each feature
5. Implement: Dynamic threshold adjustment based on market conditions
### Not Implemented Yet
This is a future consideration only. Current data-driven approach is sufficient.
---
## Key Principles
### 1. Data Before Action
- Minimum 10 samples before any decision
- Prefer 20+ for statistical confidence
- No changes based on 1-2 outliers
### 2. Incremental Changes
- Change one variable at a time
- Test for 10-20 trades after each change
- Revert if performance degrades
### 3. Version Tracking
- Every scoring logic change gets new version (v4 → v5)
- Store version with each trade/blocked signal
- Enables A/B testing and rollback
### 4. Document Everything
- Update this roadmap after each phase
- Record decisions and rationale
- Link to SQL queries and analysis
---
## Progress Tracking
### Milestones
- [x] Nov 11, 2025: Phase 1 infrastructure complete
- [ ] Target: ~Nov 20-25, 2025: Phase 1 complete (10-20 blocked signals)
- [ ] Target: ~Nov 25-30, 2025: Phase 2 analysis complete
- [ ] TBD: Phase 3 implementation (conditional)
### Metrics to Watch
- **Blocked signals collected:** 0/10 minimum
- **Close calls (60-64 score):** 0/2 minimum
- **Days of data collection:** 0/7 minimum
- **Market conditions covered:** 0/3 (trending, choppy, volatile)
### Review Schedule
- **Weekly:** Check blocked signal count
- **After 10 blocked:** Run Phase 2 analysis
- **After Phase 2:** Decide on Phase 3 implementation
- **Monthly:** Review overall system performance
---
## Questions to Answer
### Phase 1 Questions
- [ ] How many signals get blocked per day?
- [ ] What's the score distribution of blocked signals?
- [ ] Are most blocks from quality score or other reasons?
### Phase 2 Questions
- [ ] Do blocked signals at 60-64 have common characteristics?
- [ ] Would lowering threshold to 60 improve performance?
- [ ] Do extreme positions need different treatment?
- [ ] Is ADX pattern valid in blocked signals?
### Phase 3 Questions
- [ ] Did the change improve win rate?
- [ ] Did it increase profitability?
- [ ] Any unintended side effects?
---
## Appendix: Historical Context
### Why This Roadmap Exists
**Date:** November 11, 2025
**Situation:** Three TradingView signals fired:
1. SHORT at 05:15 - Executed (score likely 65+) → Losing trade
2. LONG at 05:20 - Executed (score likely 65+) → Losing trade
3. SHORT at 05:30 - **BLOCKED** (score 45) → Would have been profitable
**User Question:** "What can we do about this?"
**Analysis Findings:**
- Only 2 historical trades scored 60-64 (both winners +$45.78)
- Sample size too small for confident decision
- ADX 20-25 is a trap zone (-$23.41 in 23 trades)
- Low volume (<0.8x) outperforms high volume (counterintuitive!)
**Decision:** Build data collection system instead of changing thresholds prematurely
**This Roadmap:** Systematic approach to optimization with proper data backing
---
**Remember:** The goal isn't to catch every winning trade. The goal is to optimize the **risk-adjusted return** by catching more winners than losers at each threshold level. Sometimes blocking a potential winner is correct if it also blocks 3 losers.