FMS-Medical: Foundation Models for Advancing Healthcare (Catalog)¶

Overview¶

Type: Knowledge Base / Survey Resource
Format: Curated repository of medical foundation models + datasets
Coverage: Language (LFM), Vision (VFM), Bioinformatics (BFM), Multimodal (MFM)
Primary use: Model and dataset discovery, benchmarking reference, literature review

Purpose & Design Philosophy¶

FMS-Medical is an "awesome list" style knowledge base tracking foundation model research across healthcare modalities. Maintained as a GitHub repository with bilingual documentation (English + Chinese), it provides structured references to 100+ medical FM papers, datasets, tutorials, and related resources organized by modality and year. The resource is anchored by an IEEE Reviews in Biomedical Engineering survey paper and serves as a comprehensive entry point for medical AI research.

Key value: Centralized, actively maintained catalog of medical FMs and datasets—ideal for systematic literature review and baseline selection.

Catalog Structure¶

Model Taxonomies¶

Category	Coverage	Use Case
LFM (Language)	Medical LLMs, clinical NLP	Text understanding, report generation, QA
VFM (Vision)	Medical image encoders, segmentation	Radiology, pathology, ultrasound analysis
BFM (Bioinformatics)	Genomics, proteomics, drug discovery	Sequence modeling, variant interpretation
MFM (Multimodal)	Vision-language, integrated models	Unified diagnosis, multimodal reasoning

Dataset Catalogs¶

Text datasets: Clinical notes, radiology reports, biomedical literature
Imaging datasets: CXR, CT, MRI, pathology, ultrasound, retinal
Omics datasets: Genomics, transcriptomics, proteomics
Multimodal datasets: Image-text pairs, integrated EHR + imaging

Integration Strategy¶

For Neuro-Omics KB¶

FMS-Medical provides dataset discovery and model benchmarking:

Key uses: - Literature review: Identify related work in medical AI for neuro-omics - Dataset selection: Find imaging and genomics datasets for validation - Baseline comparison: Track state-of-the-art methods for benchmarking - Survey reference: Cite comprehensive medical FM survey

Application to KB pipeline:

FMS-Medical catalog
    ↓
Extract relevant entries:
    - Brain imaging models
    - Genomics foundation models
    - Multimodal medical datasets
    ↓
Populate KB model/dataset cards
    ↓
Benchmark neuro-omics methods against medical FM baselines

For ARPA-H Brain-Omics Model (BOM)¶

FMS-Medical informs medical AI landscape understanding:

Survey medical FMs
    ↓
Identify integration patterns:
    - CLIP-style alignment (M3FM, TITAN)
    - MoT/MoE architectures
    - LLM continual pretraining (Me-LLaMA)
    ↓
Apply to BOM design

Transfer insights: - Comprehensive coverage: Survey spans all medical FM modalities—identify gaps for neuro-omics - Dataset catalogs: Find publicly available datasets for cross-validation - Benchmark references: Track medical FM performance for comparison - Bilingual support: Chinese documentation aids international collaboration

Catalog Access Workflow¶

# 1. Clone or sync FMS-Medical repository
git clone https://github.com/YutingHe-list/Awesome-Foundation-Models-for-Advancing-Healthcare

# 2. Review README for model/dataset tables

# 3. Extract entries relevant to neuro-omics:
#    - VFM: Brain imaging models
#    - BFM: Genomics models
#    - MFM: Multimodal integration patterns

# 4. Populate KB YAML cards from extracted entries

# 5. Track updates via GitHub (actively maintained)

For KB automation: - Parse README tables to generate candidate model/dataset cards - Link to survey PDFs for detailed descriptions - Monitor repository updates for new medical FMs

Strengths & Limitations¶

Strengths¶

Comprehensive coverage: 100+ medical FMs across all modalities
Actively maintained: Regular updates with publication news
Bilingual: English + Chinese documentation
Well-organized: Structured by modality, year, and venue
Survey anchor: Peer-reviewed IEEE Reviews paper provides synthesis

Limitations¶

Documentation-only: No executable code or model weights
General medical focus: Limited neuro-omics specific content
No unified schema: Markdown tables, not structured YAML/JSON
Citation lag: New models may take time to appear in catalog

When to Use FMS-Medical¶

✅ Use when: - Starting literature review on medical FMs - Selecting baseline models for benchmarking - Finding publicly available medical datasets - Identifying integration patterns from related work - Citing comprehensive medical FM surveys

⚠️ Not a substitute for: - Model implementation code (links to external repos) - Pretrained model weights (links to paper/HuggingFace) - Executable benchmarking pipelines (reference only)

⚠️ Complement with: - Papers With Code: For benchmark leaderboards - HuggingFace Model Hub: For pretrained weights - Model-specific repos: For implementation details

Reference Materials¶

Knowledge Base Resources¶

Curated materials in this KB: - Code walkthrough: FMS-Medical walkthrough - Dataset card (YAML): kb/datasets/fms_medical_catalog.yaml

Integration recipes: - Integration Strategy — Dataset selection - KB Overview — Catalog integration patterns

Original Sources¶

Source repositories: - Local copy: external_repos/fms-medical/ - Official GitHub: YutingHe-list/Awesome-Foundation-Models-for-Advancing-Healthcare

Original survey paper: - Title: "Foundation Models for Healthcare" - Authors: Yuting He, et al. - Published: IEEE Reviews in Biomedical Engineering, 2024 - Link (IEEE): IEEE Xplore: 10750441 - Link (arXiv): arXiv:2404.03264

Next Steps in Our Pipeline¶

Systematic extraction: Parse FMS-Medical tables to populate KB model/dataset cards
Gap analysis: Identify medical FM capabilities missing from neuro-omics KB
Benchmark selection: Choose medical FM baselines for comparison experiments
Dataset discovery: Find publicly available datasets for cross-validation
Literature tracking: Monitor repository updates for new medical FM methods

Engineering Notes¶

FMS-Medical is pure documentation—no code dependencies
Bilingual PDFs in files/ directory useful for international teams
Citation metadata in tables ready for automated KB card generation
Active maintenance—check GitHub for recent updates before citing
Survey paper provides narrative synthesis complementing tabular catalog