filtering_analysis

Phase 3: First-Pass Filtering Analysis

Novelty Quick-Check Results

⚠️ PARTIAL OVERLAP - Need Differentiation

Idea 1: Modality-Aware Adaptive LoRA (MA-LoRA)

Idea 2: Cross-Modal Budget Allocation (CMBA)

Idea 3: Zero-Shot Rank Predictor (ZRP)

✅ NOVEL - Clear Differentiation

Idea 4: Hierarchical Rank Allocation (HRA)

Idea 5: Modality-Specific Learning Rate Scaling (MSLR)

Idea 10: Dynamic Rank Adjustment During Training

❌ HIGH RISK / HIGH COMPUTE

Idea 6: Task-Conditioned Rank Allocation (TCRA)

Idea 7: Gradient-Free Rank Search via Evolutionary Algorithm

Idea 8: Cross-Architecture Rank Transfer

Idea 9: Information Bottleneck-Guided Rank Allocation

Feasibility Check

Idea Compute Data Implementation Verdict
1. MA-LoRA 40h VQAv2 ✅ Medium ⚠️ Needs differentiation
2. CMBA 24h VQAv2 ✅ Easy ✅ PASS
3. ZRP 60h VQAv2 ✅ Hard ⚠️ Overlaps SR-LoRA
4. HRA 32h VQAv2 ✅ Medium ✅ PASS
5. MSLR 24h VQAv2 ✅ Easy ✅ PASS
6. TCRA 72h 3 datasets Hard ❌ Too expensive
7. Evolutionary 80h 2 datasets Medium ❌ Too expensive
8. Transfer 96h 3 models Hard ❌ Too expensive
9. IB-Guided 48h VQAv2 ✅ Very Hard ❌ High risk + cost
10. Dynamic 24h VQAv2 ✅ Easy ✅ PASS

Impact Estimation

High Impact (clear "so what"):

Medium Impact:

Unclear Impact:

Surviving Ideas (6 → 4)

✅ Top Tier (pilot these)

  1. Idea 2: Cross-Modal Budget Allocation (CMBA) - LOW risk, HIGH impact, NOVEL
  2. Idea 4: Hierarchical Rank Allocation (HRA) - MEDIUM risk, HIGH impact, NOVEL
  3. Idea 10: Dynamic Rank Adjustment (Dynamic) - LOW risk, HIGH impact, NOVEL

⚠️ Second Tier (validate on paper, pilot if budget allows)

  1. Idea 5: Modality-Specific Learning Rate Scaling (MSLR) - LOW risk, MEDIUM impact, NOVEL

❌ Eliminated (6 ideas)

Recommendation for Phase 4

Pilot these 3 ideas in parallel (total: 24+32+24 = 80h, but pilots are scaled down):

  1. CMBA - 6 ratio ablations × 2h = 12h pilot
  2. HRA - 3 allocation strategies × 3h = 9h pilot
  3. Dynamic - 3 rank schedules × 2h = 6h pilot

Total pilot budget: ~27 GPU-hours (well under MAX_TOTAL_GPU_HOURS=8 per idea)

If pilots show positive signal, proceed to deep validation (Phase 4).


Key Findings from Literature

Recent relevant work (2024-2025):

Structural gap confirmed: No work systematically studies budget allocation ratios across modalities (vision:projector:language). This is Idea 2's unique angle.