- Added signalSource field documentation - Emphasized CRITICAL exclusion from TradingView indicator analysis - Reference to MANUAL_TRADE_FILTERING.md for SQL queries - Manual Trading via Telegram section updated with contamination warning
1563 lines
76 KiB
Markdown
1563 lines
76 KiB
Markdown
# AI Agent Instructions for Trading Bot v4
|
||
|
||
## Mission & Financial Goals
|
||
|
||
**Primary Objective:** Build wealth systematically from $106 → $100,000+ through algorithmic trading
|
||
|
||
**Current Phase:** Phase 1 - Survival & Proof (Nov 2025 - Jan 2026)
|
||
- **Current Capital:** $97.55 USDC (zero debt, 100% health)
|
||
- **Starting Capital:** $106 (Nov 2025)
|
||
- **Target:** $2,500 by end of Phase 1 (Month 2.5)
|
||
- **Strategy:** Aggressive compounding, 0 withdrawals
|
||
- **Position Sizing:** 100% of free collateral (~$97 at 15x leverage = ~$1,463 notional)
|
||
- **Risk Tolerance:** EXTREME - This is recovery/proof-of-concept mode
|
||
- **Win Target:** 20-30% monthly returns to reach $2,500
|
||
- **Trades Executed:** 161 (as of Nov 12, 2025)
|
||
|
||
**Why This Matters for AI Agents:**
|
||
- Every dollar counts at this stage - optimize for profitability, not just safety
|
||
- User needs this system to work for long-term financial goals ($300-500/month withdrawals starting Month 3)
|
||
- No changes that reduce win rate unless they improve profit factor
|
||
- System must prove itself before scaling (see `TRADING_GOALS.md` for full 8-phase roadmap)
|
||
|
||
**Key Constraints:**
|
||
- Can't afford extended drawdowns (limited capital)
|
||
- Must maintain 60%+ win rate to compound effectively
|
||
- Quality over quantity - only trade 60+ signal quality scores (lowered from 65 on Nov 12, 2025)
|
||
- After 3 consecutive losses, STOP and review system
|
||
|
||
## Architecture Overview
|
||
|
||
**Type:** Autonomous cryptocurrency trading bot with Next.js 15 frontend + Solana/Drift Protocol backend
|
||
|
||
**Data Flow:** TradingView → n8n webhook → Next.js API → Drift Protocol (Solana DEX) → Real-time monitoring → Auto-exit
|
||
|
||
**CRITICAL: RPC Provider Choice**
|
||
- **MUST use Alchemy RPC** (https://solana-mainnet.g.alchemy.com/v2/YOUR_API_KEY)
|
||
- **DO NOT use Helius free tier** - causes catastrophic rate limiting (239 errors in 10 minutes)
|
||
- Helius free: 10 req/sec sustained = TOO LOW for trade execution + Position Manager monitoring
|
||
- Alchemy free: 300M compute units/month = adequate for bot operations
|
||
- **Symptom if wrong RPC:** Trades hit SL immediately, duplicate closes, Position Manager loses tracking, database save failures
|
||
- **Fixed Nov 14, 2025:** Switched to Alchemy, system now works perfectly (TP1/TP2/runner all functioning)
|
||
|
||
**Key Design Principle:** Dual-layer redundancy - every trade has both on-chain orders (Drift) AND software monitoring (Position Manager) as backup.
|
||
|
||
**Exit Strategy:** TP2-as-Runner system (CURRENT):
|
||
- TP1 at +0.4%: Close configurable % (default 75%, adjustable via `TAKE_PROFIT_1_SIZE_PERCENT`)
|
||
- TP2 at +0.7%: **Activates trailing stop** on full remaining % (no position close)
|
||
- Runner: Remaining % after TP1 with ATR-based trailing stop (default 25%, configurable)
|
||
- **Note:** All UI displays dynamically calculate runner% as `100 - TAKE_PROFIT_1_SIZE_PERCENT`
|
||
|
||
**Per-Symbol Configuration:** SOL and ETH have independent enable/disable toggles and position sizing:
|
||
- `SOLANA_ENABLED`, `SOLANA_POSITION_SIZE`, `SOLANA_LEVERAGE` (defaults: true, 100%, 15x)
|
||
- `ETHEREUM_ENABLED`, `ETHEREUM_POSITION_SIZE`, `ETHEREUM_LEVERAGE` (defaults: true, 100%, 1x)
|
||
- BTC and other symbols fall back to global settings (`MAX_POSITION_SIZE_USD`, `LEVERAGE`)
|
||
- **Priority:** Per-symbol ENV → Market config → Global ENV → Defaults
|
||
|
||
**Signal Quality System:** Filters trades based on 5 metrics (ATR, ADX, RSI, volumeRatio, pricePosition) scored 0-100. Only trades scoring 60+ are executed (lowered from 65 after data analysis showed 60-64 tier outperformed higher scores). Scores stored in database for future optimization.
|
||
|
||
**Timeframe-Aware Scoring:** Signal quality thresholds adjust based on timeframe (5min vs daily):
|
||
- 5min: ADX 12+ trending (vs 18+ for daily), ATR 0.2-0.7% healthy (vs 0.4%+ for daily)
|
||
- Anti-chop filter: -20 points for extreme sideways (ADX <10, ATR <0.25%, Vol <0.9x)
|
||
- Pass `timeframe` param to `scoreSignalQuality()` from TradingView alerts (e.g., `timeframe: "5"`)
|
||
|
||
**MAE/MFE Tracking:** Every trade tracks Maximum Favorable Excursion (best profit %) and Maximum Adverse Excursion (worst loss %) updated every 2s. Used for data-driven optimization of TP/SL levels.
|
||
|
||
**Manual Trading via Telegram:** Send plain-text messages like `long sol`, `short eth`, `long btc` to open positions instantly (bypasses n8n, calls `/api/trading/execute` directly with preset healthy metrics). **CRITICAL:** Manual trades are marked with `signalSource='manual'` and excluded from TradingView indicator analysis (prevents data contamination).
|
||
|
||
**Re-Entry Analytics System:** Manual trades are validated before execution using fresh TradingView data:
|
||
- Market data cached from TradingView signals (5min expiry)
|
||
- `/api/analytics/reentry-check` scores re-entry based on fresh metrics + recent performance
|
||
- Telegram bot blocks low-quality re-entries unless `--force` flag used
|
||
- Uses real TradingView ADX/ATR/RSI when available, falls back to historical data
|
||
- Penalty for recent losing trades, bonus for winning streaks
|
||
|
||
## VERIFICATION MANDATE: Financial Code Requires Proof
|
||
|
||
**CRITICAL: THIS IS A REAL MONEY TRADING SYSTEM - NOT A TOY PROJECT**
|
||
|
||
**Core Principle:** In trading systems, "working" means "verified with real data", NOT "code looks correct".
|
||
|
||
**NEVER declare something working without:**
|
||
1. Observing actual logs showing expected behavior
|
||
2. Verifying database state matches expectations
|
||
3. Comparing calculated values to source data
|
||
4. Testing with real trades when applicable
|
||
5. **CONFIRMING CODE IS DEPLOYED** - Check container start time vs commit time
|
||
|
||
**CODE COMMITTED ≠ CODE DEPLOYED**
|
||
- Git commit at 15:56 means NOTHING if container started at 15:06
|
||
- ALWAYS verify: `docker logs trading-bot-v4 | grep "Server starting" | head -1`
|
||
- Compare container start time to commit timestamp
|
||
- If container older than commit: **CODE NOT DEPLOYED, FIX NOT ACTIVE**
|
||
- Never say "fixed" or "protected" until deployment verified
|
||
|
||
### Critical Path Verification Requirements
|
||
|
||
**Position Manager Changes:**
|
||
- [ ] Execute test trade with DRY_RUN=false (small size)
|
||
- [ ] Watch docker logs for full TP1 → TP2 → exit cycle
|
||
- [ ] SQL query: verify `tp1Hit`, `slMovedToBreakeven`, `currentSize` match Position Manager logs
|
||
- [ ] Compare Position Manager tracked size to actual Drift position size
|
||
- [ ] Check exit reason matches actual trigger (TP1/TP2/SL/trailing)
|
||
|
||
**Exit Logic Changes (TP/SL/Trailing):**
|
||
- [ ] Log EXPECTED values (TP1 price, SL price after breakeven, trailing stop distance)
|
||
- [ ] Log ACTUAL values from Drift position and Position Manager state
|
||
- [ ] Verify: Does TP1 hit when price crosses TP1? Does SL move to breakeven?
|
||
- [ ] Test: Open position, let it hit TP1, verify 75% closed + SL moved
|
||
- [ ] Document: What SHOULD happen vs what ACTUALLY happened
|
||
|
||
**API Endpoint Changes:**
|
||
- [ ] curl test with real payload from TradingView/n8n
|
||
- [ ] Check response JSON matches expectations
|
||
- [ ] Verify database record created with correct fields
|
||
- [ ] Check Telegram notification shows correct values (leverage, size, etc.)
|
||
- [ ] SQL query: confirm all fields populated correctly
|
||
|
||
**Calculation Changes (P&L, Position Sizing, Percentages):**
|
||
- [ ] Add console.log for EVERY step of calculation
|
||
- [ ] Verify units match (tokens vs USD, percent vs decimal, etc.)
|
||
- [ ] SQL query with manual calculation: does code result match hand calculation?
|
||
- [ ] Test edge cases: 0%, 100%, negative values, very small/large numbers
|
||
|
||
**SDK/External Data Integration:**
|
||
- [ ] Log raw SDK response to verify assumptions about data format
|
||
- [ ] NEVER trust documentation - verify with console.log
|
||
- [ ] Example: position.size doc said "USD" but logs showed "tokens"
|
||
- [ ] Document actual behavior in Common Pitfalls section
|
||
|
||
### Red Flags Requiring Extra Verification
|
||
|
||
**High-Risk Changes:**
|
||
- Unit conversions (tokens ↔ USD, percent ↔ decimal)
|
||
- State transitions (TP1 hit → move SL to breakeven)
|
||
- Configuration precedence (per-symbol vs global vs defaults)
|
||
- Display values from complex calculations (leverage, size, P&L)
|
||
- Timing-dependent logic (grace periods, cooldowns, race conditions)
|
||
|
||
**Verification Steps for Each:**
|
||
1. **Before declaring working**: Show proof (logs, SQL results, test output)
|
||
2. **After deployment**: Monitor first real trade closely, verify behavior
|
||
3. **Edge cases**: Test boundary conditions (0, 100%, max leverage, min size)
|
||
4. **Regression**: Check that fix didn't break other functionality
|
||
|
||
### SQL Verification Queries
|
||
|
||
**After Position Manager changes:**
|
||
```sql
|
||
-- Verify TP1 detection worked correctly
|
||
SELECT
|
||
symbol, entryPrice, currentSize, realizedPnL,
|
||
tp1Hit, slMovedToBreakeven, exitReason,
|
||
TO_CHAR(createdAt, 'MM-DD HH24:MI') as time
|
||
FROM "Trade"
|
||
WHERE exitReason IS NULL -- Open positions
|
||
OR createdAt > NOW() - INTERVAL '1 hour' -- Recent closes
|
||
ORDER BY createdAt DESC
|
||
LIMIT 5;
|
||
|
||
-- Compare Position Manager state to expectations
|
||
SELECT configSnapshot->'positionManagerState' as pm_state
|
||
FROM "Trade"
|
||
WHERE symbol = 'SOL-PERP' AND exitReason IS NULL;
|
||
```
|
||
|
||
**After calculation changes:**
|
||
```sql
|
||
-- Verify P&L calculations
|
||
SELECT
|
||
symbol, direction, entryPrice, exitPrice,
|
||
positionSize, realizedPnL,
|
||
-- Manual calculation:
|
||
CASE
|
||
WHEN direction = 'long' THEN
|
||
positionSize * ((exitPrice - entryPrice) / entryPrice)
|
||
ELSE
|
||
positionSize * ((entryPrice - exitPrice) / entryPrice)
|
||
END as expected_pnl,
|
||
-- Difference:
|
||
realizedPnL - CASE
|
||
WHEN direction = 'long' THEN
|
||
positionSize * ((exitPrice - entryPrice) / entryPrice)
|
||
ELSE
|
||
positionSize * ((entryPrice - exitPrice) / entryPrice)
|
||
END as pnl_difference
|
||
FROM "Trade"
|
||
WHERE exitReason IS NOT NULL
|
||
AND createdAt > NOW() - INTERVAL '24 hours'
|
||
ORDER BY createdAt DESC
|
||
LIMIT 10;
|
||
```
|
||
|
||
### Example: How Position.size Bug Should Have Been Caught
|
||
|
||
**What went wrong:**
|
||
- Read code: "Looks like it's comparing sizes correctly"
|
||
- Declared: "Position Manager is working!"
|
||
- Didn't verify with actual trade
|
||
|
||
**What should have been done:**
|
||
```typescript
|
||
// In Position Manager monitoring loop - ADD THIS LOGGING:
|
||
console.log('🔍 VERIFICATION:', {
|
||
positionSizeRaw: position.size, // What SDK returns
|
||
positionSizeUSD: position.size * currentPrice, // Converted to USD
|
||
trackedSizeUSD: trade.currentSize, // What we're tracking
|
||
ratio: (position.size * currentPrice) / trade.currentSize,
|
||
tp1ShouldTrigger: (position.size * currentPrice) < trade.currentSize * 0.95
|
||
})
|
||
```
|
||
|
||
Then observe logs on actual trade:
|
||
```
|
||
🔍 VERIFICATION: {
|
||
positionSizeRaw: 12.28, // ← AH! This is SOL tokens, not USD!
|
||
positionSizeUSD: 1950.84, // ← Correct USD value
|
||
trackedSizeUSD: 1950.00,
|
||
ratio: 1.0004, // ← Should be near 1.0 when position full
|
||
tp1ShouldTrigger: false // ← Correct
|
||
}
|
||
```
|
||
|
||
**Lesson:** One console.log would have exposed the bug immediately.
|
||
|
||
### Deployment Checklist
|
||
|
||
**MANDATORY PRE-DEPLOYMENT VERIFICATION:**
|
||
- [ ] Check container start time: `docker logs trading-bot-v4 | grep "Server starting" | head -1`
|
||
- [ ] Compare to commit timestamp: Container MUST be newer than code changes
|
||
- [ ] If container older: **STOP - Code not deployed, fix not active**
|
||
- [ ] Never declare "fixed" or "working" until container restarted with new code
|
||
|
||
Before marking feature complete:
|
||
- [ ] Code review completed
|
||
- [ ] Unit tests pass (if applicable)
|
||
- [ ] Integration test with real API calls
|
||
- [ ] Logs show expected behavior
|
||
- [ ] Database state verified with SQL
|
||
- [ ] Edge cases tested
|
||
- [ ] **Container restarted and verified running new code**
|
||
- [ ] Documentation updated (including Common Pitfalls if applicable)
|
||
- [ ] User notified of what to verify during first real trade
|
||
|
||
### When to Escalate to User
|
||
|
||
**Don't say "it's working" if:**
|
||
- You haven't observed actual logs showing the expected behavior
|
||
- SQL query shows unexpected values
|
||
- Test trade behaved differently than expected
|
||
- You're unsure about unit conversions or SDK behavior
|
||
- Change affects money (position sizing, P&L, exits)
|
||
- **Container hasn't been restarted since code commit**
|
||
|
||
**Instead say:**
|
||
- "Code is updated. Need to verify with test trade - watch for [specific log message]"
|
||
- "Fixed, but requires verification: check database shows [expected value]"
|
||
- "Deployed. First real trade should show [behavior]. If not, there's still a bug."
|
||
- **"Code committed but NOT deployed - container running old version, fix not active yet"**
|
||
|
||
### Docker Build Best Practices
|
||
|
||
**CRITICAL: Prevent build interruptions with background execution + live monitoring**
|
||
|
||
Docker builds take 40-70 seconds and are easily interrupted by terminal issues. Use this pattern:
|
||
|
||
```bash
|
||
# Start build in background with live log tail
|
||
cd /home/icke/traderv4 && docker compose build trading-bot > /tmp/docker-build-live.log 2>&1 & BUILD_PID=$!; echo "Build started, PID: $BUILD_PID"; tail -f /tmp/docker-build-live.log
|
||
```
|
||
|
||
**Why this works:**
|
||
- Build runs in background (`&`) - immune to terminal disconnects/Ctrl+C
|
||
- Output redirected to log file - can review later if needed
|
||
- `tail -f` shows real-time progress - see compilation, linting, errors
|
||
- Can Ctrl+C the `tail -f` without killing build - build continues
|
||
- Verification after: `tail -50 /tmp/docker-build-live.log` to check success
|
||
|
||
**Success indicators:**
|
||
- `✓ Compiled successfully in 27s`
|
||
- `✓ Generating static pages (30/30)`
|
||
- `#22 naming to docker.io/library/traderv4-trading-bot done`
|
||
- `DONE X.Xs` on final step
|
||
|
||
**Failure indicators:**
|
||
- `Failed to compile.`
|
||
- `Type error:`
|
||
- `ERROR: process "/bin/sh -c npm run build" did not complete successfully: exit code: 1`
|
||
|
||
**After successful build:**
|
||
```bash
|
||
# Deploy new container
|
||
docker compose up -d --force-recreate trading-bot
|
||
|
||
# Verify it started
|
||
docker logs --tail=30 trading-bot-v4
|
||
|
||
# Confirm deployed version
|
||
docker logs trading-bot-v4 | grep "Server starting" | head -1
|
||
```
|
||
|
||
**DO NOT use:** `docker compose build trading-bot` in foreground - one network hiccup kills 60s of work
|
||
|
||
### Docker Cleanup After Builds
|
||
|
||
**CRITICAL: Prevent disk full issues from build cache accumulation**
|
||
|
||
Docker builds create intermediate layers (1.3+ GB per build) that accumulate over time. Build cache can reach 40-50 GB after frequent rebuilds.
|
||
|
||
**After successful deployment, clean up:**
|
||
```bash
|
||
# Remove dangling images (old builds)
|
||
docker image prune -f
|
||
|
||
# Remove build cache (biggest space hog - 40+ GB typical)
|
||
docker builder prune -f
|
||
|
||
# Optional: Remove dangling volumes (if no important data)
|
||
docker volume prune -f
|
||
|
||
# Check space saved
|
||
docker system df
|
||
```
|
||
|
||
**When to run:**
|
||
- After each successful deployment (recommended)
|
||
- Weekly if building frequently
|
||
- When disk space warnings appear
|
||
- Before major updates/migrations
|
||
|
||
**Space typically freed:**
|
||
- Dangling images: 2-5 GB
|
||
- Build cache: 40-50 GB
|
||
- Dangling volumes: 0.5-1 GB
|
||
- **Total: 40-55 GB per cleanup**
|
||
|
||
**What's safe to delete:**
|
||
- `<none>` tagged images (old builds)
|
||
- Build cache (recreated on next build)
|
||
- Dangling volumes (orphaned from removed containers)
|
||
|
||
**What NOT to delete:**
|
||
- Named volumes (contain data: `trading-bot-postgres`, etc.)
|
||
- Active containers
|
||
- Tagged images currently in use
|
||
|
||
---
|
||
|
||
## Critical Components
|
||
|
||
### 1. Phantom Trade Auto-Closure System
|
||
**Purpose:** Automatically close positions when size mismatch detected (position opened but wrong size)
|
||
|
||
**When triggered:**
|
||
- Position opened on Drift successfully
|
||
- Expected size: $50 (50% @ 1x leverage)
|
||
- Actual size: $1.37 (7% fill - likely oracle price stale or exchange rejection)
|
||
- Size ratio < 50% threshold → phantom detected
|
||
|
||
**Automated response (all happens in <1 second):**
|
||
1. **Immediate closure:** Market order closes 100% of phantom position
|
||
2. **Database logging:** Creates trade record with `status='phantom'`, saves P&L
|
||
3. **n8n notification:** Returns HTTP 200 with full details (not 500 - allows workflow to continue)
|
||
4. **Telegram alert:** Message includes entry/exit prices, P&L, reason, transaction IDs
|
||
|
||
**Why auto-close instead of manual intervention:**
|
||
- User may be asleep, away from devices, unavailable for hours
|
||
- Unmonitored position = unlimited risk exposure
|
||
- Position Manager won't track phantom (by design)
|
||
- No TP/SL protection, no trailing stop, no monitoring
|
||
- Better to exit with small loss/gain than leave position exposed
|
||
- Re-entry always possible if setup was actually good
|
||
|
||
**Example notification:**
|
||
```
|
||
⚠️ PHANTOM TRADE AUTO-CLOSED
|
||
|
||
Symbol: SOL-PERP
|
||
Direction: LONG
|
||
Expected Size: $48.75
|
||
Actual Size: $1.37 (2.8%)
|
||
|
||
Entry: $168.50
|
||
Exit: $168.45
|
||
P&L: -$0.02
|
||
|
||
Reason: Size mismatch detected - likely oracle price issue or exchange rejection
|
||
Action: Position auto-closed for safety (unmonitored positions = risk)
|
||
|
||
TX: 5Yx2Fm8vQHKLdPaw...
|
||
```
|
||
|
||
**Database tracking:**
|
||
- `status='phantom'` field identifies these trades
|
||
- `isPhantom=true`, `phantomReason='ORACLE_PRICE_MISMATCH'`
|
||
- `expectedSizeUSD`, `actualSizeUSD` fields for analysis
|
||
- Exit reason: `'manual'` (phantom auto-close category)
|
||
- Enables post-trade analysis of phantom frequency and patterns
|
||
|
||
**Code location:** `app/api/trading/execute/route.ts` lines 322-445
|
||
|
||
### 2. Signal Quality Scoring (`lib/trading/signal-quality.ts`)
|
||
**Purpose:** Unified quality validation system that scores trading signals 0-100 based on 5 market metrics
|
||
|
||
**Timeframe-aware thresholds:**
|
||
```typescript
|
||
scoreSignalQuality({
|
||
atr, adx, rsi, volumeRatio, pricePosition,
|
||
timeframe?: string // "5" for 5min, undefined for higher timeframes
|
||
})
|
||
```
|
||
|
||
**5min chart adjustments:**
|
||
- ADX healthy range: 12-22 (vs 18-30 for daily)
|
||
- ATR healthy range: 0.2-0.7% (vs 0.4%+ for daily)
|
||
- Anti-chop filter: -20 points for extreme sideways (ADX <10, ATR <0.25%, Vol <0.9x)
|
||
|
||
**Price position penalties (all timeframes):**
|
||
- Long at 90-95%+ range: -15 to -30 points (chasing highs)
|
||
- Short at <5-10% range: -15 to -30 points (chasing lows)
|
||
- Prevents flip-flop losses from entering range extremes
|
||
|
||
**Key behaviors:**
|
||
- Returns score 0-100 and detailed breakdown object
|
||
- Minimum score 60 required to execute trade
|
||
- Called by both `/api/trading/check-risk` and `/api/trading/execute`
|
||
- Scores saved to database for post-trade analysis
|
||
|
||
### 2. Position Manager (`lib/trading/position-manager.ts`)
|
||
**Purpose:** Software-based monitoring loop that checks prices every 2 seconds and closes positions via market orders
|
||
|
||
**Singleton pattern:** Always use `getInitializedPositionManager()` - never instantiate directly
|
||
```typescript
|
||
const positionManager = await getInitializedPositionManager()
|
||
await positionManager.addTrade(activeTrade)
|
||
```
|
||
|
||
**Key behaviors:**
|
||
- Tracks `ActiveTrade` objects in a Map
|
||
- **TP2-as-Runner system**: TP1 (configurable %, default 75%) → TP2 trigger (no close, activate trailing) → Runner (remaining %) with ATR-based trailing stop
|
||
- Dynamic SL adjustments: Moves to breakeven after TP1, locks profit at +1.2%
|
||
- **On-chain order synchronization:** After TP1 hits, calls `cancelAllOrders()` then `placeExitOrders()` with updated SL price at breakeven (uses `retryWithBackoff()` for rate limit handling)
|
||
- **ATR-based trailing stop:** Calculates trail distance as `(atrAtEntry / currentPrice × 100) × trailingStopAtrMultiplier`, clamped between min/max %
|
||
- Trailing stop: Activates when TP2 price hit, tracks `peakPrice` and trails dynamically
|
||
- Closes positions via `closePosition()` market orders when targets hit
|
||
- Acts as backup if on-chain orders don't fill
|
||
- State persistence: Saves to database, restores on restart via `configSnapshot.positionManagerState`
|
||
- **Startup validation:** On container restart, cross-checks last 24h "closed" trades against Drift to detect orphaned positions (see `lib/startup/init-position-manager.ts`)
|
||
- **Grace period for new trades:** Skips "external closure" detection for positions <30 seconds old (Drift positions take 5-10s to propagate)
|
||
- **Exit reason detection:** Uses trade state flags (`tp1Hit`, `tp2Hit`) and realized P&L to determine exit reason, NOT current price (avoids misclassification when price moves after order fills)
|
||
- **Real P&L calculation:** Calculates actual profit based on entry vs exit price, not SDK's potentially incorrect values
|
||
- **Rate limit-aware exit:** On 429 errors during close, keeps trade in monitoring (doesn't mark closed), retries naturally on next price update
|
||
|
||
### 3. Telegram Bot (`telegram_command_bot.py`)
|
||
**Purpose:** Python-based Telegram bot for manual trading commands and position status monitoring
|
||
|
||
**Manual trade commands via plain text:**
|
||
```python
|
||
# User sends plain text message (not slash commands)
|
||
"long sol" → Validates via analytics, then opens SOL-PERP long
|
||
"short eth" → Validates via analytics, then opens ETH-PERP short
|
||
"long btc --force" → Skips analytics validation, opens BTC-PERP long immediately
|
||
```
|
||
|
||
**Key behaviors:**
|
||
- MessageHandler processes all text messages (not just commands)
|
||
- Maps user-friendly symbols (sol, eth, btc) to Drift format (SOL-PERP, etc.)
|
||
- **Analytics validation:** Calls `/api/analytics/reentry-check` before execution
|
||
- Blocks trades with score <55 unless `--force` flag used
|
||
- Uses fresh TradingView data (<5min old) when available
|
||
- Falls back to historical metrics with penalty
|
||
- Considers recent trade performance (last 3 trades)
|
||
- Calls `/api/trading/execute` directly with preset healthy metrics (ATR=0.45, ADX=32, RSI=58/42)
|
||
- Bypasses n8n workflow and TradingView requirements
|
||
- 60-second timeout for API calls
|
||
- Responds with trade confirmation or analytics rejection message
|
||
|
||
**Status command:**
|
||
```python
|
||
/status → Returns JSON of open positions from Drift
|
||
```
|
||
|
||
**Implementation details:**
|
||
- Uses `python-telegram-bot` library
|
||
- Deployed via `docker-compose.telegram-bot.yml`
|
||
- Requires `TELEGRAM_BOT_TOKEN` and `TELEGRAM_CHANNEL_ID` in .env
|
||
- API calls to `http://trading-bot:3000/api/trading/execute`
|
||
|
||
**Drift client integration:**
|
||
- Singleton pattern: Use `initializeDriftService()` and `getDriftService()` - maintains single connection
|
||
```typescript
|
||
const driftService = await initializeDriftService()
|
||
const health = await driftService.getAccountHealth()
|
||
```
|
||
- Wallet handling: Supports both JSON array `[91,24,...]` and base58 string formats from Phantom wallet
|
||
|
||
### 4. Rate Limit Monitoring (`lib/drift/orders.ts` + `app/api/analytics/rate-limits`)
|
||
**Purpose:** Track and analyze Solana RPC rate limiting (429 errors) to prevent silent failures
|
||
|
||
**Helius RPC Limits (Free Tier):**
|
||
- **Burst:** 100 requests/second
|
||
- **Sustained:** 10 requests/second
|
||
- **Monthly:** 100k requests
|
||
- See `docs/HELIUS_RATE_LIMITS.md` for upgrade recommendations
|
||
|
||
**Retry mechanism with exponential backoff (Nov 14, 2025 - Updated):**
|
||
```typescript
|
||
await retryWithBackoff(async () => {
|
||
return await driftClient.cancelOrders(...)
|
||
}, maxRetries = 3, baseDelay = 5000) // Increased from 2s to 5s
|
||
```
|
||
**Progression:** 5s → 10s → 20s (vs old 2s → 4s → 8s)
|
||
**Rationale:** Gives Helius time to recover, reduces cascade pressure by 2.5x
|
||
|
||
**Database logging:** Three event types in SystemEvent table:
|
||
- `rate_limit_hit`: Each 429 error (logged with attempt #, delay, error snippet)
|
||
- `rate_limit_recovered`: Successful retry (logged with total time, retry count)
|
||
- `rate_limit_exhausted`: Failed after max retries (CRITICAL - order operation failed)
|
||
|
||
**Analytics endpoint:**
|
||
```bash
|
||
curl http://localhost:3001/api/analytics/rate-limits
|
||
```
|
||
Returns: Total hits/recoveries/failures, hourly patterns, recovery times, success rate
|
||
|
||
**Key behaviors:**
|
||
- Only RPC calls wrapped: `cancelAllOrders()`, `placeExitOrders()`, `closePosition()`
|
||
- Position Manager monitoring: Event-driven via Pyth WebSocket (not polling)
|
||
- Rate limit-aware exit: Position Manager keeps monitoring on 429 errors (retries naturally)
|
||
- Logs to both console and database for post-trade analysis
|
||
|
||
**Monitoring queries:** See `docs/RATE_LIMIT_MONITORING.md` for SQL queries
|
||
|
||
**Startup Position Validation (Nov 14, 2025 - Added):**
|
||
On container startup, cross-checks last 24h of "closed" trades against actual Drift positions:
|
||
- If DB says closed but Drift shows open → reopens in DB to restore Position Manager tracking
|
||
- Prevents orphaned positions from failed close transactions
|
||
- Logs: `🔴 CRITICAL: ${symbol} marked as CLOSED in DB but still OPEN on Drift!`
|
||
- Implementation: `lib/startup/init-position-manager.ts` - `validateOpenTrades()`
|
||
|
||
### 5. Order Placement (`lib/drift/orders.ts`)
|
||
**Critical functions:**
|
||
- `openPosition()` - Opens market position with transaction confirmation
|
||
- `closePosition()` - Closes position with transaction confirmation
|
||
- `placeExitOrders()` - Places TP/SL orders on-chain
|
||
- `cancelAllOrders()` - Cancels all reduce-only orders for a market
|
||
|
||
**CRITICAL: Transaction Confirmation Pattern**
|
||
Both `openPosition()` and `closePosition()` MUST confirm transactions on-chain:
|
||
```typescript
|
||
const txSig = await driftClient.placePerpOrder(orderParams)
|
||
console.log('⏳ Confirming transaction on-chain...')
|
||
const connection = driftService.getConnection()
|
||
const confirmation = await connection.confirmTransaction(txSig, 'confirmed')
|
||
|
||
if (confirmation.value.err) {
|
||
throw new Error(`Transaction failed: ${JSON.stringify(confirmation.value.err)}`)
|
||
}
|
||
console.log('✅ Transaction confirmed on-chain')
|
||
```
|
||
Without this, the SDK returns signatures for transactions that never execute, causing phantom trades/closes.
|
||
|
||
**CRITICAL: Drift SDK position.size is BASE ASSET TOKENS, not USD**
|
||
The Drift SDK returns `position.size` as token quantity (SOL/ETH/BTC), NOT USD notional:
|
||
```typescript
|
||
// CORRECT: Convert tokens to USD by multiplying by current price
|
||
const positionSizeUSD = Math.abs(position.size) * currentPrice
|
||
|
||
// WRONG: Using position.size directly as USD (off by 150x+ for SOL!)
|
||
const positionSizeUSD = Math.abs(position.size)
|
||
```
|
||
**This affects Position Manager's TP1/TP2 detection** - if position.size is not converted to USD before comparing to tracked USD values, the system will never detect partial closes correctly. See Common Pitfall #22 for the full bug details and fix applied Nov 12, 2025.
|
||
|
||
**Solana RPC Rate Limiting with Exponential Backoff**
|
||
Solana RPC endpoints return 429 errors under load. Always use retry logic for order operations:
|
||
```typescript
|
||
export async function retryWithBackoff<T>(
|
||
operation: () => Promise<T>,
|
||
maxRetries: number = 3,
|
||
initialDelay: number = 5000 // Increased from 2000ms to 5000ms (Nov 14, 2025)
|
||
): Promise<T> {
|
||
for (let attempt = 0; attempt < maxRetries; attempt++) {
|
||
try {
|
||
return await operation()
|
||
} catch (error: any) {
|
||
if (error?.message?.includes('429') && attempt < maxRetries - 1) {
|
||
const delay = initialDelay * Math.pow(2, attempt)
|
||
console.log(`⏳ Rate limited, retrying in ${delay/1000}s... (attempt ${attempt + 1}/${maxRetries})`)
|
||
await new Promise(resolve => setTimeout(resolve, delay))
|
||
continue
|
||
}
|
||
throw error
|
||
}
|
||
}
|
||
throw new Error('Max retries exceeded')
|
||
}
|
||
|
||
// Usage in cancelAllOrders
|
||
await retryWithBackoff(() => driftClient.cancelOrders(...))
|
||
```
|
||
**Note:** Increased from 2s to 5s base delay to give Helius RPC more recovery time. See `docs/HELIUS_RATE_LIMITS.md` for detailed analysis.
|
||
Without this, order cancellations fail silently during TP1→breakeven order updates, leaving ghost orders that cause incorrect fills.
|
||
|
||
**Dual Stop System** (USE_DUAL_STOPS=true):
|
||
```typescript
|
||
// Soft stop: TRIGGER_LIMIT at -1.5% (avoids wicks)
|
||
// Hard stop: TRIGGER_MARKET at -2.5% (guarantees exit)
|
||
```
|
||
|
||
**Order types:**
|
||
- Entry: MARKET (immediate execution)
|
||
- TP1/TP2: LIMIT reduce-only orders
|
||
- Soft SL: TRIGGER_LIMIT reduce-only
|
||
- Hard SL: TRIGGER_MARKET reduce-only
|
||
|
||
### 6. Database (`lib/database/trades.ts` + `prisma/schema.prisma`)
|
||
**Purpose:** PostgreSQL via Prisma ORM for trade history and analytics
|
||
|
||
**Models:** Trade, PriceUpdate, SystemEvent, DailyStats, BlockedSignal
|
||
|
||
**Singleton pattern:** Use `getPrismaClient()` - never instantiate PrismaClient directly
|
||
|
||
**Key functions:**
|
||
- `createTrade()` - Save trade after execution (includes dual stop TX signatures + signalQualityScore)
|
||
- `updateTradeExit()` - Record exit with P&L
|
||
- `addPriceUpdate()` - Track price movements (called by Position Manager)
|
||
- `getTradeStats()` - Win rate, profit factor, avg win/loss
|
||
- `getLastTrade()` - Fetch most recent trade for analytics dashboard
|
||
- `createBlockedSignal()` - Save blocked signals for data-driven optimization analysis
|
||
- `getRecentBlockedSignals()` - Query recent blocked signals
|
||
- `getBlockedSignalsForAnalysis()` - Fetch signals needing price analysis (future automation)
|
||
|
||
**Important fields:**
|
||
- `signalSource` (String?) - Identifies trade origin: 'tradingview', 'manual', or NULL (old trades)
|
||
- **CRITICAL:** Manual Telegram trades are marked `signalSource='manual'` and excluded from TradingView indicator analysis
|
||
- Use filter: `WHERE ("signalSource" IS NULL OR "signalSource" != 'manual')` for indicator optimization queries
|
||
- See `docs/MANUAL_TRADE_FILTERING.md` for complete SQL filtering guide
|
||
- `signalQualityScore` (Int?) - 0-100 score for data-driven optimization
|
||
- `signalQualityVersion` (String?) - Tracks which scoring logic was used ('v1', 'v2', 'v3', 'v4')
|
||
- v1: Original logic (price position < 5% threshold)
|
||
- v2: Added volume compensation for low ADX (2025-11-07)
|
||
- v3: Stricter breakdown requirements: positions < 15% require (ADX > 18 AND volume > 1.2x) OR (RSI < 35 for shorts / RSI > 60 for longs)
|
||
- v4: CURRENT - Blocked signals tracking enabled for data-driven threshold optimization (2025-11-11)
|
||
- All new trades tagged with current version for comparative analysis
|
||
- `maxFavorableExcursion` / `maxAdverseExcursion` - Track best/worst P&L during trade lifetime
|
||
- `maxFavorablePrice` / `maxAdversePrice` - Track prices at MFE/MAE points
|
||
- `configSnapshot` (Json) - Stores Position Manager state for crash recovery
|
||
- `atr`, `adx`, `rsi`, `volumeRatio`, `pricePosition` - Context metrics from TradingView
|
||
|
||
**BlockedSignal model fields (NEW):**
|
||
- Signal metrics: `atr`, `adx`, `rsi`, `volumeRatio`, `pricePosition`, `timeframe`
|
||
- Quality scoring: `signalQualityScore`, `signalQualityVersion`, `scoreBreakdown` (JSON), `minScoreRequired`
|
||
- Block tracking: `blockReason` (QUALITY_SCORE_TOO_LOW, COOLDOWN_PERIOD, HOURLY_TRADE_LIMIT, etc.), `blockDetails`
|
||
- Future analysis: `priceAfter1/5/15/30Min`, `wouldHitTP1/TP2/SL`, `analysisComplete`
|
||
- Automatically saved by check-risk endpoint when signals are blocked
|
||
- Enables data-driven optimization: collect 10-20 blocked signals → analyze patterns → adjust thresholds
|
||
|
||
**Per-symbol functions:**
|
||
- `getLastTradeTimeForSymbol(symbol)` - Get last trade time for specific coin (enables per-symbol cooldown)
|
||
- Each coin (SOL/ETH/BTC) has independent cooldown timer to avoid missed opportunities
|
||
|
||
## Configuration System
|
||
|
||
**Three-layer merge:**
|
||
1. `DEFAULT_TRADING_CONFIG` (config/trading.ts)
|
||
2. Environment variables (.env) via `getConfigFromEnv()`
|
||
3. Runtime overrides via `getMergedConfig(overrides)`
|
||
|
||
**Always use:** `getMergedConfig()` to get final config - never read env vars directly in business logic
|
||
|
||
**Per-symbol position sizing:** Use `getPositionSizeForSymbol(symbol, config)` which returns `{ size, leverage, enabled }`
|
||
```typescript
|
||
const { size, leverage, enabled } = getPositionSizeForSymbol('SOL-PERP', config)
|
||
if (!enabled) {
|
||
return NextResponse.json({ success: false, error: 'Symbol trading disabled' }, { status: 400 })
|
||
}
|
||
```
|
||
|
||
**Symbol normalization:** TradingView sends "SOLUSDT" → must convert to "SOL-PERP" for Drift
|
||
```typescript
|
||
const driftSymbol = normalizeTradingViewSymbol(body.symbol)
|
||
```
|
||
|
||
## API Endpoints Architecture
|
||
|
||
**Authentication:** All `/api/trading/*` endpoints (except `/test`) require `Authorization: Bearer API_SECRET_KEY`
|
||
|
||
**Pattern:** Each endpoint follows same flow:
|
||
1. Auth check
|
||
2. Get config via `getMergedConfig()`
|
||
3. Initialize Drift service
|
||
4. Check account health
|
||
5. Execute operation
|
||
6. Save to database
|
||
7. Add to Position Manager if applicable
|
||
|
||
**Key endpoints:**
|
||
- `/api/trading/execute` - Main entry point from n8n (production, requires auth), **auto-caches market data**
|
||
- `/api/trading/check-risk` - Pre-execution validation (duplicate check, quality score, **per-symbol cooldown**, rate limits, **symbol enabled check**, **saves blocked signals automatically**)
|
||
- `/api/trading/test` - Test trades from settings UI (no auth required, **respects symbol enable/disable**)
|
||
- `/api/trading/close` - Manual position closing (requires symbol normalization)
|
||
- `/api/trading/sync-positions` - **Force Position Manager sync with Drift** (POST, requires auth) - restores tracking for orphaned positions
|
||
- `/api/trading/cancel-orders` - **Manual order cleanup** (for stuck/ghost orders after rate limit failures)
|
||
- `/api/trading/positions` - Query open positions from Drift
|
||
- `/api/trading/market-data` - Webhook for TradingView market data updates (GET for debug, POST for data)
|
||
- `/api/settings` - Get/update config (writes to .env file, **includes per-symbol settings**)
|
||
- `/api/analytics/last-trade` - Fetch most recent trade details for dashboard (includes quality score)
|
||
- `/api/analytics/reentry-check` - **Validate manual re-entry** with fresh TradingView data + recent performance
|
||
- `/api/analytics/version-comparison` - Compare performance across signal quality logic versions (v1/v2/v3/v4)
|
||
- `/api/restart` - Create restart flag for watch-restart.sh script
|
||
|
||
## Critical Workflows
|
||
|
||
### Execute Trade (Production)
|
||
```
|
||
TradingView alert → n8n Parse Signal Enhanced (extracts metrics + timeframe)
|
||
↓ /api/trading/check-risk [validates quality score ≥60, checks duplicates, per-symbol cooldown]
|
||
↓ /api/trading/execute
|
||
↓ normalize symbol (SOLUSDT → SOL-PERP)
|
||
↓ getMergedConfig()
|
||
↓ getPositionSizeForSymbol() [check if symbol enabled + get sizing]
|
||
↓ openPosition() [MARKET order]
|
||
↓ calculate dual stop prices if enabled
|
||
↓ placeExitOrders() [on-chain TP1/TP2/SL orders]
|
||
↓ scoreSignalQuality({ ..., timeframe }) [compute 0-100 score with timeframe-aware thresholds]
|
||
↓ createTrade() [CRITICAL: save to database FIRST - see Common Pitfall #27]
|
||
↓ positionManager.addTrade() [ONLY after DB save succeeds - prevents unprotected positions]
|
||
```
|
||
|
||
**CRITICAL EXECUTION ORDER (Nov 13, 2025 Fix):**
|
||
The order of database save → Position Manager add is NOT arbitrary - it's a safety requirement:
|
||
- If database save fails, API returns HTTP 500 with critical warning
|
||
- User sees: "CLOSE POSITION MANUALLY IMMEDIATELY" with transaction signature
|
||
- Position Manager only tracks database-persisted trades
|
||
- Container restarts can restore all positions from database
|
||
- **Never add to Position Manager before database save** - creates unprotected positions
|
||
|
||
### Position Monitoring Loop
|
||
```
|
||
Position Manager every 2s:
|
||
↓ Verify on-chain position still exists (detect external closures)
|
||
↓ getPythPriceMonitor().getLatestPrice()
|
||
↓ Calculate current P&L and update MAE/MFE metrics
|
||
↓ Check emergency stop (-2%) → closePosition(100%)
|
||
↓ Check SL hit → closePosition(100%)
|
||
↓ Check TP1 hit → closePosition(75%), cancelAllOrders(), placeExitOrders() with SL at breakeven
|
||
↓ Check profit lock trigger (+1.2%) → move SL to +configured%
|
||
↓ Check TP2 hit → closePosition(80% of remaining), activate runner
|
||
↓ Check trailing stop (if runner active) → adjust SL dynamically based on peakPrice
|
||
↓ addPriceUpdate() [save to database every N checks]
|
||
↓ saveTradeState() [persist Position Manager state + MAE/MFE for crash recovery]
|
||
```
|
||
|
||
### Settings Update
|
||
```
|
||
Web UI → /api/settings POST
|
||
↓ Validate new settings
|
||
↓ Write to .env file using string replacement
|
||
↓ Return success
|
||
↓ User clicks "Restart Bot" → /api/restart
|
||
↓ Creates /tmp/trading-bot-restart.flag
|
||
↓ watch-restart.sh detects flag
|
||
↓ Executes: docker restart trading-bot-v4
|
||
```
|
||
|
||
## Docker Context
|
||
|
||
**Multi-stage build:** deps → builder → runner (Node 20 Alpine)
|
||
|
||
**Critical Dockerfile steps:**
|
||
1. Install deps with `npm install --production`
|
||
2. Copy source and `npx prisma generate` (MUST happen before build)
|
||
3. `npm run build` (Next.js standalone output)
|
||
4. Runner stage copies standalone + static + node_modules + Prisma client
|
||
|
||
**Container networking:**
|
||
- External: `trading-bot-v4` on port 3001
|
||
- Internal: Next.js on port 3000
|
||
- Database: `trading-bot-postgres` on 172.28.0.0/16 network
|
||
|
||
**DATABASE_URL caveat:** Use `trading-bot-postgres` (container name) in .env for runtime, but `localhost:5432` for Prisma CLI migrations from host
|
||
|
||
## Project-Specific Patterns
|
||
|
||
### 1. Singleton Services
|
||
Never create multiple instances - always use getter functions:
|
||
```typescript
|
||
const driftService = await initializeDriftService() // NOT: new DriftService()
|
||
const positionManager = getPositionManager() // NOT: new PositionManager()
|
||
const prisma = getPrismaClient() // NOT: new PrismaClient()
|
||
```
|
||
|
||
### 2. Price Calculations
|
||
Direction matters for long vs short:
|
||
```typescript
|
||
function calculatePrice(entry: number, percent: number, direction: 'long' | 'short') {
|
||
if (direction === 'long') {
|
||
return entry * (1 + percent / 100) // Long: +1% = higher price
|
||
} else {
|
||
return entry * (1 - percent / 100) // Short: +1% = lower price
|
||
}
|
||
}
|
||
```
|
||
|
||
### 3. Error Handling
|
||
Database failures should not fail trades - always wrap in try/catch:
|
||
```typescript
|
||
try {
|
||
await createTrade(params)
|
||
console.log('💾 Trade saved to database')
|
||
} catch (dbError) {
|
||
console.error('❌ Failed to save trade:', dbError)
|
||
// Don't fail the trade if database save fails
|
||
}
|
||
```
|
||
|
||
### 4. Reduce-Only Orders
|
||
All exit orders MUST be reduce-only (can only close, not open positions):
|
||
```typescript
|
||
const orderParams = {
|
||
reduceOnly: true, // CRITICAL for TP/SL orders
|
||
// ... other params
|
||
}
|
||
```
|
||
|
||
### 5. Nextcloud Deck Roadmap Sync
|
||
**Purpose:** Visual kanban board for tracking optimization roadmap progress
|
||
|
||
**Key Components:**
|
||
- `scripts/discover-deck-ids.sh` - Find Nextcloud Deck board/stack IDs
|
||
- `scripts/sync-roadmap-to-deck.py` - Sync roadmap files to Deck cards
|
||
- `docs/NEXTCLOUD_DECK_SYNC.md` - Complete documentation
|
||
|
||
**Workflow:**
|
||
```bash
|
||
# One-time setup (already done)
|
||
bash scripts/discover-deck-ids.sh # Creates /tmp/deck-config.json
|
||
|
||
# Sync roadmap to Deck (creates/updates cards)
|
||
python3 scripts/sync-roadmap-to-deck.py --init
|
||
|
||
# Always dry-run first to preview changes
|
||
python3 scripts/sync-roadmap-to-deck.py --init --dry-run
|
||
```
|
||
|
||
**Stack Mapping:**
|
||
- 📥 **Backlog:** Future phases, ideas, ML work (status: FUTURE)
|
||
- 📋 **Planning:** Next phases, ready to implement (status: PENDING, NEXT)
|
||
- 🚀 **In Progress:** Currently active work (status: CURRENT, IN PROGRESS, DEPLOYED)
|
||
- ✅ **Complete:** Finished phases (status: COMPLETE)
|
||
|
||
**Card Structure:**
|
||
- 3 high-level initiative cards (from `OPTIMIZATION_MASTER_ROADMAP.md`)
|
||
- 18 detailed phase cards (from individual roadmap files)
|
||
- Total: 21 cards tracking all optimization work
|
||
|
||
**When to Sync:**
|
||
- After completing a phase (update markdown status → re-sync)
|
||
- When starting new phase (move card in Deck UI)
|
||
- Weekly during active development to keep visual state current
|
||
|
||
**Important Notes:**
|
||
- API doesn't support duplicate detection - always use `--dry-run` first
|
||
- Manual card deletion required (API returns 405 on DELETE)
|
||
- Code blocks auto-removed from descriptions (prevent API errors)
|
||
- Card titles cleaned (no markdown, emojis removed for readability)
|
||
|
||
## Testing Commands
|
||
|
||
```bash
|
||
# Local development
|
||
npm run dev
|
||
|
||
# Build production
|
||
npm run build && npm start
|
||
|
||
# Docker build and restart
|
||
docker compose build trading-bot
|
||
docker compose up -d --force-recreate trading-bot
|
||
docker logs -f trading-bot-v4
|
||
|
||
# Database operations
|
||
npx prisma generate # Generate client
|
||
DATABASE_URL="postgresql://...@localhost:5432/..." npx prisma migrate dev
|
||
docker exec trading-bot-postgres psql -U postgres -d trading_bot_v4 -c "\dt"
|
||
|
||
# Test trade from UI
|
||
# Go to http://localhost:3001/settings
|
||
# Click "Test LONG" or "Test SHORT"
|
||
```
|
||
|
||
## SQL Analysis Queries
|
||
|
||
Essential queries for monitoring signal quality and blocked signals. Run via:
|
||
```bash
|
||
docker exec trading-bot-postgres psql -U postgres -d trading_bot_v4 -c "YOUR_QUERY"
|
||
```
|
||
|
||
### Phase 1: Monitor Data Collection Progress
|
||
```sql
|
||
-- Check blocked signals count (target: 10-20 for Phase 2)
|
||
SELECT COUNT(*) as total_blocked FROM "BlockedSignal";
|
||
|
||
-- Score distribution of blocked signals
|
||
SELECT
|
||
CASE
|
||
WHEN signalQualityScore >= 60 THEN '60-64 (Close Call)'
|
||
WHEN signalQualityScore >= 55 THEN '55-59 (Marginal)'
|
||
WHEN signalQualityScore >= 50 THEN '50-54 (Weak)'
|
||
ELSE '0-49 (Very Weak)'
|
||
END as tier,
|
||
COUNT(*) as count,
|
||
ROUND(AVG(signalQualityScore)::numeric, 1) as avg_score
|
||
FROM "BlockedSignal"
|
||
WHERE blockReason = 'QUALITY_SCORE_TOO_LOW'
|
||
GROUP BY tier
|
||
ORDER BY MIN(signalQualityScore) DESC;
|
||
|
||
-- Recent blocked signals with full details
|
||
SELECT
|
||
symbol,
|
||
direction,
|
||
signalQualityScore as score,
|
||
ROUND(adx::numeric, 1) as adx,
|
||
ROUND(atr::numeric, 2) as atr,
|
||
ROUND(pricePosition::numeric, 1) as pos,
|
||
ROUND(volumeRatio::numeric, 2) as vol,
|
||
blockReason,
|
||
TO_CHAR(createdAt, 'MM-DD HH24:MI') as time
|
||
FROM "BlockedSignal"
|
||
ORDER BY createdAt DESC
|
||
LIMIT 10;
|
||
```
|
||
|
||
### Phase 2: Compare Blocked vs Executed Trades
|
||
```sql
|
||
-- Compare executed trades in 60-69 score range
|
||
SELECT
|
||
signalQualityScore as score,
|
||
COUNT(*) as trades,
|
||
ROUND(AVG(realizedPnL)::numeric, 2) as avg_pnl,
|
||
ROUND(SUM(realizedPnL)::numeric, 2) as total_pnl,
|
||
ROUND(100.0 * SUM(CASE WHEN realizedPnL > 0 THEN 1 ELSE 0 END) / COUNT(*)::numeric, 1) as win_rate
|
||
FROM "Trade"
|
||
WHERE exitReason IS NOT NULL
|
||
AND signalQualityScore BETWEEN 60 AND 69
|
||
GROUP BY signalQualityScore
|
||
ORDER BY signalQualityScore;
|
||
|
||
-- Block reason breakdown
|
||
SELECT
|
||
blockReason,
|
||
COUNT(*) as count,
|
||
ROUND(AVG(signalQualityScore)::numeric, 1) as avg_score
|
||
FROM "BlockedSignal"
|
||
GROUP BY blockReason
|
||
ORDER BY count DESC;
|
||
```
|
||
|
||
### Analyze Specific Patterns
|
||
```sql
|
||
-- Blocked signals at range extremes (price position)
|
||
SELECT
|
||
direction,
|
||
signalQualityScore as score,
|
||
ROUND(pricePosition::numeric, 1) as pos,
|
||
ROUND(adx::numeric, 1) as adx,
|
||
ROUND(volumeRatio::numeric, 2) as vol,
|
||
symbol,
|
||
TO_CHAR(createdAt, 'MM-DD HH24:MI') as time
|
||
FROM "BlockedSignal"
|
||
WHERE blockReason = 'QUALITY_SCORE_TOO_LOW'
|
||
AND (pricePosition < 10 OR pricePosition > 90)
|
||
ORDER BY signalQualityScore DESC;
|
||
|
||
-- ADX distribution in blocked signals
|
||
SELECT
|
||
CASE
|
||
WHEN adx >= 25 THEN 'Strong (25+)'
|
||
WHEN adx >= 20 THEN 'Moderate (20-25)'
|
||
WHEN adx >= 15 THEN 'Weak (15-20)'
|
||
ELSE 'Very Weak (<15)'
|
||
END as adx_tier,
|
||
COUNT(*) as count,
|
||
ROUND(AVG(signalQualityScore)::numeric, 1) as avg_score
|
||
FROM "BlockedSignal"
|
||
WHERE blockReason = 'QUALITY_SCORE_TOO_LOW'
|
||
AND adx IS NOT NULL
|
||
GROUP BY adx_tier
|
||
ORDER BY MIN(adx) DESC;
|
||
```
|
||
|
||
**Usage Pattern:**
|
||
1. Run "Monitor Data Collection" queries weekly during Phase 1
|
||
2. Once 10+ blocked signals collected, run "Compare Blocked vs Executed" queries
|
||
3. Use "Analyze Specific Patterns" to identify optimization opportunities
|
||
4. Full query reference: `BLOCKED_SIGNALS_TRACKING.md`
|
||
|
||
## Common Pitfalls
|
||
|
||
1. **WRONG RPC PROVIDER (CRITICAL - CATASTROPHIC SYSTEM FAILURE):**
|
||
- **FINAL CONCLUSION Nov 14, 2025 (INVESTIGATION COMPLETE):** Helius is the ONLY reliable RPC provider for Drift SDK
|
||
- **Root Cause CONFIRMED:** Alchemy's rate limiting breaks Drift SDK's burst subscription pattern during initialization
|
||
- **Definitive Proof (Nov 14, 21:14 CET):**
|
||
* Created diagnostic endpoint `/api/testing/drift-init`
|
||
* Alchemy: 17-71 subscription errors EVERY init (49 avg over 5 runs), 1644ms avg init time
|
||
* Helius: 0 subscription errors EVERY init, 800ms avg init time
|
||
* See `docs/ALCHEMY_RPC_INVESTIGATION_RESULTS.md` for full test data
|
||
|
||
- **Why Alchemy Fails:**
|
||
* Drift SDK subscribes to 30-50+ accounts simultaneously during init (burst pattern)
|
||
* Alchemy's CUPS enforcement rate limits these burst requests
|
||
* Drift SDK does NOT retry failed subscriptions
|
||
* SDK reports "initialized successfully" but with incomplete subscription set
|
||
* Subsequent operations fail/timeout due to missing account data
|
||
* Error message: "Received JSON-RPC error calling `accountSubscribe`"
|
||
|
||
- **Why "Breakthrough" at 14:25 Wasn't Real:**
|
||
* First Alchemy test had 17-71 subscription errors (random variation)
|
||
* Sometimes gets lucky with "just enough" subscriptions for one operation
|
||
* SDK in degraded state from the start, just not obvious until second operation
|
||
* This explains why first trade "worked" but subsequent trades failed
|
||
|
||
- **Why Helius Works:**
|
||
* Higher burst tolerance for Solana dApp subscription patterns
|
||
* Zero subscription errors during init
|
||
* Faster initialization (800ms vs 1600ms)
|
||
* Stable for continuous operations
|
||
|
||
- **Technical Reality vs Documentation:**
|
||
* Alchemy DOES support WebSocket subscriptions (research confirmed)
|
||
* Alchemy DOES support accountSubscribe method (not -32601 error)
|
||
* BUT: Rate limit enforcement model incompatible with Drift's burst pattern
|
||
* Documentation doesn't mention burst subscription limits
|
||
|
||
- **Production Status:**
|
||
* Using: Helius RPC (https://mainnet.helius-rpc.com/?api-key=...)
|
||
* Retry logic: 5s exponential backoff for rate limits
|
||
* System: Stable, TP1/TP2/SL working, Position Manager tracking correctly
|
||
|
||
- **Investigation Closed:** This is DEFINITIVE. Use Helius. Do not use Alchemy.
|
||
- **Test Yourself:** `curl 'http://localhost:3001/api/testing/drift-init?rpc=alchemy'`
|
||
|
||
2. **Prisma not generated in Docker:** Must run `npx prisma generate` in Dockerfile BEFORE `npm run build`
|
||
|
||
3. **Wrong DATABASE_URL:** Container runtime needs `trading-bot-postgres`, Prisma CLI from host needs `localhost:5432`
|
||
|
||
4. **Symbol format mismatch:** Always normalize with `normalizeTradingViewSymbol()` before calling Drift (applies to ALL endpoints including `/api/trading/close`)
|
||
|
||
5. **Missing reduce-only flag:** Exit orders without `reduceOnly: true` can accidentally open new positions
|
||
|
||
6. **Singleton violations:** Creating multiple DriftClient or Position Manager instances causes connection/state issues
|
||
|
||
7. **Type errors with Prisma:** The Trade type from Prisma is only available AFTER `npx prisma generate` - use explicit types or `// @ts-ignore` carefully
|
||
|
||
8. **Quality score duplication:** Signal quality calculation exists in BOTH `check-risk` and `execute` endpoints - keep logic synchronized
|
||
|
||
9. **TP2-as-Runner configuration:**
|
||
- `takeProfit2SizePercent: 0` means "TP2 activates trailing stop, no position close"
|
||
- This creates runner of remaining % after TP1 (default 25%, configurable via TAKE_PROFIT_1_SIZE_PERCENT)
|
||
- `TAKE_PROFIT_2_PERCENT=0.7` sets TP2 trigger price, `TAKE_PROFIT_2_SIZE_PERCENT` should be 0
|
||
- Settings UI correctly shows "TP2 activates trailing stop" with dynamic runner % calculation
|
||
|
||
9. **P&L calculation CRITICAL:** Use actual entry vs exit price calculation, not SDK values:
|
||
```typescript
|
||
const profitPercent = this.calculateProfitPercent(trade.entryPrice, exitPrice, trade.direction)
|
||
const actualRealizedPnL = (closedSizeUSD * profitPercent) / 100
|
||
trade.realizedPnL += actualRealizedPnL // NOT: result.realizedPnL from SDK
|
||
```
|
||
|
||
10. **Transaction confirmation CRITICAL:** Both `openPosition()` AND `closePosition()` MUST call `connection.confirmTransaction()` after `placePerpOrder()`. Without this, the SDK returns transaction signatures that aren't confirmed on-chain, causing "phantom trades" or "phantom closes". Always check `confirmation.value.err` before proceeding.
|
||
|
||
11. **Execution order matters:** When creating trades via API endpoints, the order MUST be:
|
||
1. Open position + place exit orders
|
||
2. Save to database (`createTrade()`)
|
||
3. Add to Position Manager (`positionManager.addTrade()`)
|
||
|
||
If Position Manager is added before database save, race conditions occur where monitoring checks before the trade exists in DB.
|
||
|
||
12. **New trade grace period:** Position Manager skips "external closure" detection for trades <30 seconds old because Drift positions take 5-10 seconds to propagate after opening. Without this grace period, new positions are immediately detected as "closed externally" and cancelled.
|
||
|
||
13. **Drift minimum position sizes:** Actual minimums differ from documentation:
|
||
- SOL-PERP: 0.1 SOL (~$5-15 depending on price)
|
||
- ETH-PERP: 0.01 ETH (~$38-40 at $4000/ETH)
|
||
- BTC-PERP: 0.0001 BTC (~$10-12 at $100k/BTC)
|
||
|
||
Always calculate: `minOrderSize × currentPrice` must exceed Drift's $4 minimum. Add buffer for price movement.
|
||
|
||
14. **Exit reason detection bug:** Position Manager was using current price to determine exit reason, but on-chain orders filled at a DIFFERENT price in the past. Now uses `trade.tp1Hit` / `trade.tp2Hit` flags and realized P&L to correctly identify whether TP1, TP2, or SL triggered. Prevents profitable trades being mislabeled as "SL" exits.
|
||
|
||
15. **Per-symbol cooldown:** Cooldown period is per-symbol, NOT global. ETH trade at 10:00 does NOT block SOL trade at 10:01. Each coin (SOL/ETH/BTC) has independent cooldown timer to avoid missing opportunities on different assets.
|
||
|
||
16. **Timeframe-aware scoring crucial:** Signal quality thresholds MUST adjust for 5min vs higher timeframes:
|
||
- 5min charts naturally have lower ADX (12-22 healthy) and ATR (0.2-0.7% healthy) than daily charts
|
||
- Without timeframe awareness, valid 5min breakouts get blocked as "low quality"
|
||
- Anti-chop filter applies -20 points for extreme sideways regardless of timeframe
|
||
- Always pass `timeframe` parameter from TradingView alerts to `scoreSignalQuality()`
|
||
|
||
17. **Price position chasing causes flip-flops:** Opening longs at 90%+ range or shorts at <10% range reliably loses money:
|
||
- Database analysis showed overnight flip-flop losses all had price position 9-94% (chasing extremes)
|
||
- These trades had valid ADX (16-18) but entered at worst possible time
|
||
- Quality scoring now penalizes -15 to -30 points for range extremes
|
||
- Prevents rapid reversals when price is already overextended
|
||
|
||
18. **TradingView ADX minimum for 5min:** Set ADX filter to 15 (not 20+) in TradingView alerts for 5min charts:
|
||
- Higher timeframes can use ADX 20+ for strong trends
|
||
- 5min charts need lower threshold to catch valid breakouts
|
||
- Bot's quality scoring provides second-layer filtering with context-aware metrics
|
||
- Two-stage filtering (TradingView + bot) prevents both overtrading and missing valid signals
|
||
|
||
19. **Prisma Decimal type handling:** Raw SQL queries return Prisma `Decimal` objects, not plain numbers:
|
||
- Use `any` type for numeric fields in `$queryRaw` results: `total_pnl: any`
|
||
- Convert with `Number()` before returning to frontend: `totalPnL: Number(stat.total_pnl) || 0`
|
||
- Frontend uses `.toFixed()` which doesn't exist on Decimal objects
|
||
- Applies to all aggregations: SUM(), AVG(), ROUND() - all return Decimal types
|
||
- Example: `/api/analytics/version-comparison` converts all numeric fields
|
||
|
||
20. **ATR-based trailing stop implementation (Nov 11, 2025):** Runner system was using FIXED 0.3% trailing, causing immediate stops:
|
||
- **Problem:** At $168 SOL, 0.3% = $0.50 wiggle room. Trades with +7-9% MFE exited for losses.
|
||
- **Fix:** `trailingDistancePercent = (atrAtEntry / currentPrice * 100) × trailingStopAtrMultiplier`
|
||
- **Config:** `TRAILING_STOP_ATR_MULTIPLIER=1.5`, `MIN=0.25%`, `MAX=0.9%`, `ACTIVATION=0.5%`
|
||
- **Typical improvement:** 0.45% ATR × 1.5 = 0.675% trail ($1.13 vs $0.50 = 2.26x more room)
|
||
- **Fallback:** If `atrAtEntry` unavailable, uses clamped legacy `trailingStopPercent`
|
||
- **Log verification:** Look for "📊 ATR-based trailing: 0.0045 (0.52%) × 1.5x = 0.78%" messages
|
||
- **ActiveTrade interface:** Must include `atrAtEntry?: number` field for calculation
|
||
- See `ATR_TRAILING_STOP_FIX.md` for full details and database analysis
|
||
|
||
21. **CreateTradeParams interface sync:** When adding new database fields to Trade model, MUST update `CreateTradeParams` interface in `lib/database/trades.ts`:
|
||
- Interface defines what parameters `createTrade()` accepts
|
||
- Must add new field to interface (e.g., `indicatorVersion?: string`)
|
||
- Must add field to Prisma create data object in `createTrade()` function
|
||
- TypeScript build will fail if endpoint passes field not in interface
|
||
- Example: indicatorVersion tracking required 3-file update (execute route.ts, CreateTradeParams interface, createTrade function)
|
||
|
||
22. **Position.size tokens vs USD bug (CRITICAL - Fixed Nov 12, 2025):**
|
||
- **Symptom:** Position Manager detects false TP1 hits, moves SL to breakeven prematurely
|
||
- **Root Cause:** `lib/drift/client.ts` returns `position.size` as BASE ASSET TOKENS (12.28 SOL), not USD ($1,950)
|
||
- **Bug:** Comparing tokens (12.28) directly to USD ($1,950) → 12.28 < 1,950 × 0.95 = "99.4% reduction" → FALSE TP1!
|
||
- **Fix:** Always convert to USD before comparisons:
|
||
```typescript
|
||
// In Position Manager (lines 322, 519, 558, 591)
|
||
const positionSizeUSD = Math.abs(position.size) * currentPrice
|
||
|
||
// Now compare USD to USD
|
||
if (positionSizeUSD < trade.currentSize * 0.95) {
|
||
// Actual 5%+ reduction detected
|
||
}
|
||
```
|
||
- **Impact:** Without this fix, TP1 never triggers correctly, SL moves at wrong times, runner system fails
|
||
- **Where it matters:** Position Manager, any code querying Drift positions
|
||
- **Database evidence:** Trade showed `tp1Hit: true` when 100% still open, `slMovedToBreakeven: true` prematurely
|
||
|
||
23. **Leverage display showing global config instead of symbol-specific (Fixed Nov 12, 2025):**
|
||
- **Symptom:** Telegram notifications showing "⚡ Leverage: 10x" when actual position uses 15x or 20x
|
||
- **Root Cause:** API response returning `config.leverage` (global default) instead of symbol-specific value
|
||
- **Fix:** Use actual leverage from `getPositionSizeForSymbol()`:
|
||
```typescript
|
||
// app/api/trading/execute/route.ts (lines 345, 448, 522, 557)
|
||
const { size, leverage, enabled } = getPositionSizeForSymbol(driftSymbol, config)
|
||
|
||
// Return symbol-specific leverage
|
||
leverage: leverage, // NOT: config.leverage
|
||
```
|
||
- **Impact:** Misleading notifications, user confusion about actual position risk
|
||
- **Hierarchy:** Per-symbol ENV (SOLANA_LEVERAGE) → Market config → Global ENV (LEVERAGE) → Defaults
|
||
|
||
24. **Indicator version tracking (Nov 12, 2025+):**
|
||
- Database field `indicatorVersion` tracks which TradingView strategy generated the signal
|
||
- **v5:** Buy/Sell Signal strategy (pre-Nov 12)
|
||
- **v6:** HalfTrend + BarColor strategy (Nov 12+)
|
||
- Used for performance comparison between strategies
|
||
|
||
26. **External closure duplicate updates bug (CRITICAL - Fixed Nov 12, 2025):**
|
||
- **Symptom:** Trades showing 7-8x larger losses than actual ($58 loss when Drift shows $7 loss)
|
||
- **Root Cause:** Position Manager monitoring loop re-processes external closures multiple times before trade removed from activeTrades Map
|
||
- **Bug sequence:**
|
||
1. Trade closed externally (on-chain SL order fills at -$7.98)
|
||
2. Position Manager detects closure: `position === null`
|
||
3. Calculates P&L and calls `updateTradeExit()` → -$7.50 in DB
|
||
4. **BUT:** Trade still in `activeTrades` Map (removal happens after DB update)
|
||
5. Next monitoring loop (2s later) detects closure AGAIN
|
||
6. Accumulates P&L: `previouslyRealized (-$7.50) + runnerRealized (-$7.50) = -$15.00`
|
||
7. Updates database AGAIN → -$15.00 in DB
|
||
8. Repeats 8 times → final -$58.43 (8× the actual loss)
|
||
- **Fix:** Remove trade from `activeTrades` Map BEFORE database update:
|
||
```typescript
|
||
// BEFORE (BROKEN):
|
||
await updateTradeExit({ ... })
|
||
await this.removeTrade(trade.id) // Too late! Loop already ran again
|
||
|
||
// AFTER (FIXED):
|
||
this.activeTrades.delete(trade.id) // Remove FIRST
|
||
await updateTradeExit({ ... }) // Then update DB
|
||
if (this.activeTrades.size === 0) {
|
||
this.stopMonitoring()
|
||
}
|
||
```
|
||
- **Impact:** Without this fix, every external closure is recorded 5-8 times with compounding P&L
|
||
- **Root cause:** Async timing issue - `removeTrade()` is async but monitoring loop continues synchronously
|
||
- **Evidence:** Logs showed 8 consecutive "External closure recorded" messages with increasing P&L
|
||
- **Line:** `lib/trading/position-manager.ts` line 493 (external closure detection block)
|
||
- Must update `CreateTradeParams` interface when adding new database fields (see pitfall #21)
|
||
- Analytics endpoint `/api/analytics/version-comparison` compares v5 vs v6 performance
|
||
|
||
25. **Signal quality threshold adjustment (Nov 12, 2025):**
|
||
- **Lowered from 65 → 60** based on data analysis of 161 trades
|
||
- **Reason:** Score 60-64 tier outperformed higher scores:
|
||
- 60-64: 2 trades, +$45.78 total, 100% WR, +$22.89 avg
|
||
- 65-69: 13 trades, +$28.28 total, 53.8% WR, +$2.18 avg
|
||
- 70-79: 67 trades, +$8.28 total, 44.8% WR (worst performance!)
|
||
- **Paradox:** Higher quality scores don't correlate with better performance in current data
|
||
- **Expected impact:** 2-3 additional trades/week, +$46-69 weekly profit potential
|
||
- **Data collection:** Enables blocked signals at 55-59 range for Phase 2 optimization
|
||
- **Risk:** Small sample size (2 trades) could be outliers, but downside limited
|
||
- SQL analysis showed clear pattern: stricter filtering was blocking profitable setups
|
||
|
||
27. **Database-First Pattern (CRITICAL - Fixed Nov 13, 2025):**
|
||
- **Symptom:** Positions opened on Drift with NO database record, NO Position Manager tracking, NO TP/SL protection
|
||
- **Root Cause:** Execute endpoint saved to database AFTER adding to Position Manager, with silent error catch
|
||
- **Bug sequence:**
|
||
1. TradingView signal → `/api/trading/execute`
|
||
2. Position opened on Drift ✅
|
||
3. Position Manager tracking added ✅
|
||
4. Database save attempted ❌ (fails silently)
|
||
5. API returns success to user ❌
|
||
6. Container restarts → Position Manager loses in-memory state ❌
|
||
7. Result: Unprotected position with no monitoring or TP/SL orders
|
||
- **Fix:** Database-first execution order in `app/api/trading/execute/route.ts`:
|
||
```typescript
|
||
// CRITICAL: Save to database FIRST before adding to Position Manager
|
||
try {
|
||
await createTrade({...})
|
||
} catch (dbError) {
|
||
console.error('❌ CRITICAL: Failed to save trade to database:', dbError)
|
||
return NextResponse.json({
|
||
success: false,
|
||
error: 'Database save failed - position unprotected',
|
||
message: `Position opened on Drift but database save failed. CLOSE POSITION MANUALLY IMMEDIATELY. Transaction: ${openResult.transactionSignature}`,
|
||
}, { status: 500 })
|
||
}
|
||
|
||
// ONLY add to Position Manager if database save succeeded
|
||
await positionManager.addTrade(activeTrade)
|
||
```
|
||
- **Impact:** Without this fix, ANY database failure creates unprotected positions
|
||
- **Verification:** Test trade cmhxj8qxl0000od076m21l58z (Nov 13) confirmed fix working
|
||
- **Documentation:** See `CRITICAL_INCIDENT_UNPROTECTED_POSITION.md` for full incident report
|
||
- **Rule:** Database persistence ALWAYS comes before in-memory state updates
|
||
|
||
28. **DNS retry logic (Nov 13, 2025):**
|
||
- **Problem:** Trading bot fails with "fetch failed" errors when DNS resolution temporarily fails for `mainnet.helius-rpc.com`
|
||
- **Impact:** n8n workflow failures, missed trades, container restart failures
|
||
- **Root Cause:** `EAI_AGAIN` errors are transient DNS issues that resolve in seconds, but bot treated them as permanent failures
|
||
- **Fix:** Automatic retry in `lib/drift/client.ts` - `retryOperation()` wrapper:
|
||
```typescript
|
||
// Detects transient errors: fetch failed, EAI_AGAIN, ENOTFOUND, ETIMEDOUT
|
||
// Retries up to 3 times with 2s delay between attempts (DNS-specific, separate from rate limit retries)
|
||
// Fails fast on non-transient errors (auth, config, permanent network issues)
|
||
await this.retryOperation(async () => {
|
||
// Initialize Drift SDK, subscribe, get user account
|
||
}, 3, 2000, 'Drift initialization')
|
||
```
|
||
- **Success logs:** `⚠️ Drift initialization failed (attempt 1/3): fetch failed` → `⏳ Retrying in 2000ms...` → `✅ Drift service initialized successfully`
|
||
- **Impact:** 99% of transient DNS failures now auto-recover, preventing missed trades
|
||
- **Note:** DNS retries use 2s delays (fast recovery), rate limit retries use 5s delays (RPC cooldown)
|
||
- **Documentation:** See `docs/DNS_RETRY_LOGIC.md` for monitoring queries and metrics
|
||
|
||
29. **Declaring fixes "working" before deployment (CRITICAL - Nov 13, 2025):**
|
||
- **Symptom:** AI says "position is protected" or "fix is deployed" when container still running old code
|
||
- **Root Cause:** Conflating "code committed to git" with "code running in production"
|
||
- **Real Incident:** Database-first fix committed 15:56, declared "working" at 19:42, but container started 15:06 (old code)
|
||
- **Result:** Unprotected position opened, database save failed silently, Position Manager never tracked it
|
||
- **Financial Impact:** User discovered $250+ unprotected position 3.5 hours after opening
|
||
- **Verification Required:**
|
||
```bash
|
||
# ALWAYS check before declaring fix deployed:
|
||
docker logs trading-bot-v4 | grep "Server starting" | head -1
|
||
# Compare container start time to git commit timestamp
|
||
# If container older: FIX NOT DEPLOYED
|
||
```
|
||
- **Rule:** NEVER say "fixed", "working", "protected", or "deployed" without verifying container restart timestamp
|
||
- **Impact:** This is a REAL MONEY system - premature declarations cause financial losses
|
||
- **Documentation:** Added mandatory deployment verification to VERIFICATION MANDATE section
|
||
|
||
30. **Phantom trade notification workflow breaks (Nov 14, 2025):**
|
||
- **Symptom:** Phantom trade detected, position opened on Drift, but n8n workflow stops with HTTP 500 error. User NOT notified.
|
||
- **Root Cause:** Execute endpoint returned HTTP 500 when phantom detected, causing n8n chain to halt before Telegram notification
|
||
- **Problem:** Unmonitored phantom position on exchange while user is asleep/away = unlimited risk exposure
|
||
- **Fix:** Auto-close phantom trades immediately + return HTTP 200 with warning (allows n8n to continue)
|
||
```typescript
|
||
// When phantom detected in app/api/trading/execute/route.ts:
|
||
// 1. Immediately close position via closePosition()
|
||
// 2. Save to database (create trade + update with exit info)
|
||
// 3. Return HTTP 200 with full notification message in response
|
||
// 4. n8n workflow continues to Telegram notification step
|
||
```
|
||
- **Response format change:** `{ success: true, warning: 'Phantom trade detected and auto-closed', isPhantom: true, message: '[Full notification text]', phantomDetails: {...} }`
|
||
- **Why auto-close:** User can't always respond (sleeping, no phone, traveling). Better to exit with small loss/gain than leave unmonitored position exposed.
|
||
- **Impact:** Protects user from unlimited risk during unavailable hours. Phantom trades are rare edge cases (oracle issues, exchange rejections).
|
||
- **Database tracking:** `status='phantom'`, `exitReason='manual'`, enables analysis of phantom frequency and patterns
|
||
|
||
31. **Flip-flop price context using wrong data (CRITICAL - Fixed Nov 14, 2025):**
|
||
- **Symptom:** Flip-flop detection showing "100% price move" when actual movement was 0.2%, allowing trades that should be blocked
|
||
- **Root Cause:** `currentPrice` parameter not available in check-risk endpoint (trade hasn't opened yet), so calculation used undefined/zero
|
||
- **Real incident:** Nov 14, 06:05 CET - SHORT allowed with 0.2% flip-flop, lost -$1.56 in 5 minutes
|
||
- **Bug sequence:**
|
||
1. LONG opened at $143.86 (06:00)
|
||
2. SHORT signal 4min later at $143.58 (0.2% move)
|
||
3. Flip-flop check: `(undefined - 143.86) / 143.86 * 100` = garbage → showed "100%"
|
||
4. System thought it was reversal → allowed trade
|
||
5. Should have been blocked as tight-range chop
|
||
- **Fix:** Two-part fix in commits 77a9437 and 795026a:
|
||
```typescript
|
||
// In app/api/trading/check-risk/route.ts:
|
||
// Get current price from Pyth BEFORE quality scoring
|
||
const priceMonitor = getPythPriceMonitor()
|
||
const latestPrice = priceMonitor.getCachedPrice(body.symbol)
|
||
const currentPrice = latestPrice?.price || body.currentPrice
|
||
|
||
// In lib/trading/signal-quality.ts:
|
||
// Validate price data exists before calculation
|
||
if (!params.currentPrice || params.currentPrice === 0) {
|
||
// No current price available - apply penalty (conservative)
|
||
console.warn(`⚠️ Flip-flop check: No currentPrice available, applying penalty`)
|
||
frequencyPenalties.flipFlop = -25
|
||
score -= 25
|
||
} else {
|
||
const priceChangePercent = Math.abs(
|
||
(params.currentPrice - recentSignals.oppositeDirectionPrice) /
|
||
recentSignals.oppositeDirectionPrice * 100
|
||
)
|
||
console.log(`🔍 Flip-flop price check: $${recentSignals.oppositeDirectionPrice.toFixed(2)} → $${params.currentPrice.toFixed(2)} = ${priceChangePercent.toFixed(2)}%`)
|
||
// Apply penalty only if < 2% move
|
||
}
|
||
```
|
||
- **Impact:** Without this fix, flip-flop detection is useless - blocks reversals, allows chop
|
||
- **Lesson:** Always validate input data for financial calculations, especially when data might not exist yet
|
||
- **Monitoring:** Watch logs for "🔍 Flip-flop price check: $X → $Y = Z%" to verify correct calculations
|
||
|
||
## File Conventions
|
||
|
||
- **API routes:** `app/api/[feature]/[action]/route.ts` (Next.js 15 App Router)
|
||
- **Services:** `lib/[service]/[module].ts` (drift, pyth, trading, database)
|
||
- **Config:** Single source in `config/trading.ts` with env merging
|
||
- **Types:** Define interfaces in same file as implementation (not separate types directory)
|
||
- **Console logs:** Use emojis for visual scanning: 🎯 🚀 ✅ ❌ 💰 📊 🛡️
|
||
|
||
## Re-Entry Analytics System (Phase 1)
|
||
|
||
**Purpose:** Validate manual Telegram trades using fresh TradingView data + recent performance analysis
|
||
|
||
**Components:**
|
||
1. **Market Data Cache** (`lib/trading/market-data-cache.ts`)
|
||
- Singleton service storing TradingView metrics
|
||
- 5-minute expiry on cached data
|
||
- Tracks: ATR, ADX, RSI, volume ratio, price position, timeframe
|
||
|
||
2. **Market Data Webhook** (`app/api/trading/market-data/route.ts`)
|
||
- Receives TradingView alerts every 1-5 minutes
|
||
- POST: Updates cache with fresh metrics
|
||
- GET: View cached data (debugging)
|
||
|
||
3. **Re-Entry Check Endpoint** (`app/api/analytics/reentry-check/route.ts`)
|
||
- Validates manual trade requests
|
||
- Uses fresh TradingView data if available (<5min old)
|
||
- Falls back to historical metrics from last trade
|
||
- Scores signal quality + applies performance modifiers:
|
||
- **-20 points** if last 3 trades lost money (avgPnL < -5%)
|
||
- **+10 points** if last 3 trades won (avgPnL > +5%, WR >= 66%)
|
||
- **-5 points** for stale data, **-10 points** for no data
|
||
- Minimum score: 55 (vs 60 for new signals)
|
||
|
||
4. **Auto-Caching** (`app/api/trading/execute/route.ts`)
|
||
- Every trade signal from TradingView auto-caches metrics
|
||
- Ensures fresh data available for manual re-entries
|
||
|
||
5. **Telegram Integration** (`telegram_command_bot.py`)
|
||
- Calls `/api/analytics/reentry-check` before executing manual trades
|
||
- Shows data freshness ("✅ FRESH 23s old" vs "⚠️ Historical")
|
||
- Blocks low-quality re-entries unless `--force` flag used
|
||
- Fail-open: Proceeds if analytics check fails
|
||
|
||
**User Flow:**
|
||
```
|
||
User: "long sol"
|
||
↓ Check cache for SOL-PERP
|
||
↓ Fresh data? → Use real TradingView metrics
|
||
↓ Stale/missing? → Use historical + penalty
|
||
↓ Score quality + recent performance
|
||
↓ Score >= 55? → Execute
|
||
↓ Score < 55? → Block (unless --force)
|
||
```
|
||
|
||
**TradingView Setup:**
|
||
Create alerts that fire every 1-5 minutes with this webhook message:
|
||
```json
|
||
{
|
||
"action": "market_data",
|
||
"symbol": "{{ticker}}",
|
||
"timeframe": "{{interval}}",
|
||
"atr": {{ta.atr(14)}},
|
||
"adx": {{ta.dmi(14, 14)}},
|
||
"rsi": {{ta.rsi(14)}},
|
||
"volumeRatio": {{volume / ta.sma(volume, 20)}},
|
||
"pricePosition": {{(close - ta.lowest(low, 100)) / (ta.highest(high, 100) - ta.lowest(low, 100)) * 100}},
|
||
"currentPrice": {{close}}
|
||
}
|
||
```
|
||
|
||
Webhook URL: `https://your-domain.com/api/trading/market-data`
|
||
|
||
## Per-Symbol Trading Controls
|
||
|
||
**Purpose:** Independent enable/disable toggles and position sizing for SOL and ETH to support different trading strategies (e.g., ETH for data collection at minimal size, SOL for profit generation).
|
||
|
||
**Configuration Priority:**
|
||
1. **Per-symbol ENV vars** (highest priority)
|
||
- `SOLANA_ENABLED`, `SOLANA_POSITION_SIZE`, `SOLANA_LEVERAGE`
|
||
- `ETHEREUM_ENABLED`, `ETHEREUM_POSITION_SIZE`, `ETHEREUM_LEVERAGE`
|
||
2. **Market-specific config** (from `MARKET_CONFIGS` in config/trading.ts)
|
||
3. **Global ENV vars** (fallback for BTC and other symbols)
|
||
- `MAX_POSITION_SIZE_USD`, `LEVERAGE`
|
||
4. **Default config** (lowest priority)
|
||
|
||
**Settings UI:** `app/settings/page.tsx` has dedicated sections:
|
||
- 💎 Solana section: Toggle + position size + leverage + risk calculator
|
||
- ⚡ Ethereum section: Toggle + position size + leverage + risk calculator
|
||
- 💰 Global fallback: For BTC-PERP and future symbols
|
||
|
||
**Example usage:**
|
||
```typescript
|
||
// In execute/test endpoints
|
||
const { size, leverage, enabled } = getPositionSizeForSymbol(driftSymbol, config)
|
||
if (!enabled) {
|
||
return NextResponse.json({
|
||
success: false,
|
||
error: 'Symbol trading disabled'
|
||
}, { status: 400 })
|
||
}
|
||
```
|
||
|
||
**Test buttons:** Settings UI has symbol-specific test buttons:
|
||
- 💎 Test SOL LONG/SHORT (disabled when `SOLANA_ENABLED=false`)
|
||
- ⚡ Test ETH LONG/SHORT (disabled when `ETHEREUM_ENABLED=false`)
|
||
|
||
## When Making Changes
|
||
|
||
1. **Adding new config:** Update DEFAULT_TRADING_CONFIG + getConfigFromEnv() + .env file
|
||
2. **Adding database fields:** Update prisma/schema.prisma → `npx prisma migrate dev` → `npx prisma generate` → rebuild Docker
|
||
3. **Changing order logic:** Test with DRY_RUN=true first, use small position sizes ($10)
|
||
4. **API endpoint changes:** Update both endpoint + corresponding n8n workflow JSON (Check Risk and Execute Trade nodes)
|
||
5. **Docker changes:** Rebuild with `docker compose build trading-bot` then restart container
|
||
6. **Modifying quality score logic:** Update BOTH `/api/trading/check-risk` and `/api/trading/execute` endpoints, ensure timeframe-aware thresholds are synchronized
|
||
7. **Exit strategy changes:** Modify Position Manager logic + update on-chain order placement in `placeExitOrders()`
|
||
8. **TradingView alert changes:** Ensure alerts pass `timeframe` field (e.g., `"timeframe": "5"`) to enable proper signal quality scoring
|
||
9. **Position Manager changes:** ALWAYS execute test trade after deployment
|
||
- Use `/api/trading/test` endpoint or Telegram `long sol --force`
|
||
- Monitor `docker logs -f trading-bot-v4` for full cycle
|
||
- Verify TP1 hit → 75% close → SL moved to breakeven
|
||
- SQL: Check `tp1Hit`, `slMovedToBreakeven`, `currentSize` in Trade table
|
||
- Compare: Position Manager logs vs actual Drift position size
|
||
10. **Calculation changes:** Add verbose logging and verify with SQL
|
||
- Log every intermediate step, especially unit conversions
|
||
- Never assume SDK data format - log raw values to verify
|
||
- SQL query with manual calculation to compare results
|
||
- Test boundary cases: 0%, 100%, min/max values
|
||
11. **DEPLOYMENT VERIFICATION (MANDATORY):** Before declaring ANY fix working:
|
||
- Check container start time vs commit timestamp
|
||
- If container older than commit: CODE NOT DEPLOYED
|
||
- Restart container and verify new code is running
|
||
- Never say "fixed" or "protected" without deployment confirmation
|
||
- This is a REAL MONEY system - unverified fixes cause losses
|
||
12. **GIT COMMIT AND PUSH (MANDATORY):** After completing ANY feature, fix, or significant change:
|
||
- ALWAYS commit changes with descriptive message
|
||
- ALWAYS push to remote repository
|
||
- User should NOT have to ask for this - it's part of completion
|
||
- Commit message format:
|
||
```bash
|
||
git add -A
|
||
git commit -m "type: brief description
|
||
|
||
- Bullet point details
|
||
- Files changed
|
||
- Why the change was needed
|
||
"
|
||
git push
|
||
```
|
||
- Types: `feat:` (feature), `fix:` (bug fix), `docs:` (documentation), `refactor:` (code restructure)
|
||
- This is NOT optional - code exists only when committed and pushed
|
||
13. **NEXTCLOUD DECK SYNC (MANDATORY):** After completing phases or making significant roadmap progress:
|
||
- Update roadmap markdown files with new status (🔄 IN PROGRESS, ✅ COMPLETE, 🔜 NEXT)
|
||
- Run sync to update Deck cards: `python3 scripts/sync-roadmap-to-deck.py --init`
|
||
- Move cards between stacks in Nextcloud Deck UI to reflect progress visually
|
||
- Backlog (📥) → Planning (📋) → In Progress (🚀) → Complete (✅)
|
||
- Keep Deck in sync with actual work - it's the visual roadmap tracker
|
||
- Documentation: `docs/NEXTCLOUD_DECK_SYNC.md`
|
||
|
||
## Development Roadmap
|
||
|
||
**Current Status (Nov 14, 2025):**
|
||
- **168 trades executed** with quality scores and MAE/MFE tracking
|
||
- **Capital:** $97.55 USDC at 100% health (zero debt, all USDC collateral)
|
||
- **Leverage:** 15x SOL (reduced from 20x for safer liquidation cushion)
|
||
- **Three active optimization initiatives** in data collection phase:
|
||
1. **Signal Quality:** 0/20 blocked signals collected → need 10-20 for analysis
|
||
2. **Position Scaling:** 161 v5 trades, collecting v6 data → need 50+ v6 trades
|
||
3. **ATR-based TP:** 1/50 trades with ATR data → need 50 for validation
|
||
- **Expected combined impact:** 35-40% P&L improvement when all three optimizations complete
|
||
- **Master roadmap:** See `OPTIMIZATION_MASTER_ROADMAP.md` for consolidated view
|
||
|
||
See `SIGNAL_QUALITY_OPTIMIZATION_ROADMAP.md` for systematic signal quality improvements:
|
||
- **Phase 1 (🔄 IN PROGRESS):** Collect 10-20 blocked signals with quality scores (1-2 weeks)
|
||
- **Phase 2 (🔜 NEXT):** Analyze patterns and make data-driven threshold decisions
|
||
- **Phase 3 (🎯 FUTURE):** Implement dual-threshold system or other optimizations based on data
|
||
- **Phase 4 (🤖 FUTURE):** Automated price analysis for blocked signals
|
||
- **Phase 5 (🧠 DISTANT):** ML-based scoring weight optimization
|
||
|
||
See `POSITION_SCALING_ROADMAP.md` for planned position management optimizations:
|
||
- **Phase 1 (✅ COMPLETE):** Collect data with quality scores (20-50 trades needed)
|
||
- **Phase 2:** ATR-based dynamic targets (adapt to volatility)
|
||
- **Phase 3:** Signal quality-based scaling (high quality = larger runners)
|
||
- **Phase 4:** Direction-based optimization (shorts vs longs have different performance)
|
||
- **Phase 5 (✅ COMPLETE):** TP2-as-runner system implemented - configurable runner (default 25%, adjustable via TAKE_PROFIT_1_SIZE_PERCENT) with ATR-based trailing stop
|
||
- **Phase 6:** ML-based exit prediction (future)
|
||
|
||
**Recent Implementation:** TP2-as-runner system provides 5x larger runner (default 25% vs old 5%) for better profit capture on extended moves. When TP2 price is hit, trailing stop activates on full remaining position instead of closing partial amount. Runner size is configurable (100% - TP1 close %).
|
||
|
||
**Blocked Signals Tracking (Nov 11, 2025):** System now automatically saves all blocked signals to database for data-driven optimization. See `BLOCKED_SIGNALS_TRACKING.md` for SQL queries and analysis workflows.
|
||
|
||
**Data-driven approach:** Each phase requires validation through SQL analysis before implementation. No premature optimization.
|
||
|
||
**Signal Quality Version Tracking:** Database tracks `signalQualityVersion` field to compare algorithm performance:
|
||
- Analytics dashboard shows version comparison: trades, win rate, P&L, extreme position stats
|
||
- v4 (current) includes blocked signals tracking for data-driven optimization
|
||
- Focus on extreme positions (< 15% range) - v3 aimed to reduce losses from weak ADX entries
|
||
- SQL queries in `docs/analysis/SIGNAL_QUALITY_VERSION_ANALYSIS.sql` for deep-dive analysis
|
||
- Need 20+ trades per version before meaningful comparison
|
||
|
||
**Financial Roadmap Integration:**
|
||
All technical improvements must align with current phase objectives (see top of document):
|
||
- **Phase 1 (CURRENT):** Prove system works, compound aggressively, 60%+ win rate mandatory
|
||
- **Phase 2-3:** Transition to sustainable growth while funding withdrawals
|
||
- **Phase 4+:** Scale capital while reducing risk progressively
|
||
- See `TRADING_GOALS.md` for complete 8-phase plan ($106 → $1M+)
|
||
- SQL queries in `docs/analysis/SIGNAL_QUALITY_VERSION_ANALYSIS.sql` for deep-dive analysis
|
||
- Need 20+ trades per version before meaningful comparison
|
||
|
||
**Blocked Signals Analysis:** See `BLOCKED_SIGNALS_TRACKING.md` for:
|
||
- SQL queries to analyze blocked signal patterns
|
||
- Score distribution and metric analysis
|
||
- Comparison with executed trades at similar quality levels
|
||
- Future automation of price tracking (would TP1/TP2/SL have hit?)
|
||
|
||
## Integration Points
|
||
|
||
- **n8n:** Expects exact response format from `/api/trading/execute` (see n8n-complete-workflow.json)
|
||
- **Drift Protocol:** Uses SDK v2.75.0 - check docs at docs.drift.trade for API changes
|
||
- **Pyth Network:** WebSocket + HTTP fallback for price feeds (handles reconnection)
|
||
- **PostgreSQL:** Version 16-alpine, must be running before bot starts
|
||
|
||
---
|
||
|
||
**Key Mental Model:** Think of this as two parallel systems (on-chain orders + software monitoring) working together. The Position Manager is the "backup brain" that constantly watches and acts if on-chain orders fail. Both write to the same database for complete trade history.
|