Skip to content

Integration Hub

Everything in this section supports the phased escalation strategy documented in the Integration Plan (Nov 2025). Use it as the connective tissue between per-modality preprocessing, harmonization, and experiment execution.

Overview

This hub provides end-to-end guidance for integrating genetics, brain, and behavioral data using foundation models.

πŸ“‹ Key Resources

πŸ”¬ Analysis Recipes

Copy-ready runbooks for common integration tasks:

🧬🧠 Modality Features

Concrete instructions for extracting and harmonizing features:

  • Genomics β€” Genetics embeddings, RC-equivariance, gene attribution
  • sMRI β€” FreeSurfer ROIs, PCA compression, site harmonization
  • fMRI β€” Functional connectivity, BrainLM/SwiFT embeddings, preprocessing

🎨 Integration Cards

Comprehensive multimodal fusion guidance:

Quick Start

Before running any analysis, grab the relevant strategy IDs and log them with your experiment configs:

# Show sMRI baseline recipe
python scripts/manage_kb.py ops strategy smri_free_surfer_pca512_v1

# Inspect harmonization metadata (e.g., MURD)
python scripts/manage_kb.py ops harmonization murd_t1_t2

# Show rs-fMRI preprocessing stack
python scripts/manage_kb.py ops strategy rsfmri_swift_segments_v1

This keeps downstream reports auditable even when raw datasets (e.g., UKB) cannot be shared.

Integration Phases

We follow a phased escalation strategy to avoid premature complexity:

Phase Status Pattern Trigger Documentation
Phase 1 βœ… Active Late Fusion Baseline Integration Plan
Phase 2 🚧 Prep Two-Tower Contrastive CCA p<0.001, Ξ”AUROC>5% Integration Plan
Phase 3 ⏳ Future Unified Multimodal (MoT/BAGEL/LLM-Bridge) Ξ”AUROC>10%, cross-modal reasoning Integration Plan

For Late Fusion Workflows (Phase 1)

  1. Read Integration Strategy
  2. Pick analysis recipe: CCA, Prediction, or Partial Correlations
  3. Extract features: Genomics, sMRI, fMRI
  4. Review Ensemble Integration card for stacking strategies
  5. Run analysis with logged strategy IDs

For Multimodal Architecture Design (Phase 2+)

  1. Read Design Patterns for escalation logic
  2. Study Multimodal Architectures for BAGEL/MoT/M3FM/Me-LLaMA/TITAN patterns
  3. Review Multimodal FM Patterns integration card
  4. Consult Oncology Multimodal Review for fusion taxonomy
  5. Check Integration Plan decision table for recommended pattern

For Adding New Integration Strategies

  1. Start from Integration card template
  2. Review existing cards for structure and style
  3. Document mechanics, use cases, caveats, and BOM integration
  4. Add to models/integrations/ directory
  5. Update mkdocs.yml navigation

Key Principles

βœ… Late fusion first β€” Preserve modality-specific signal under heterogeneous semantics
βœ… Unimodal baselines β€” Establish gene-only and brain-only performance before multimodal claims
βœ… Covariate control β€” Z-score + residualize vs age/sex/site before interpreting effects
βœ… Reproducibility β€” Log embedding strategy IDs, harmonization methods, CV folds
βœ… Phased escalation β€” Only escalate when data and compute justify the complexity

Read full integration plan β†’