FMS-Medical: Foundation Models for Advancing Healthcare (Catalog)¶
Overview¶
Type: Knowledge Base / Survey Resource
Format: Curated repository of medical foundation models + datasets
Coverage: Language (LFM), Vision (VFM), Bioinformatics (BFM), Multimodal (MFM)
Primary use: Model and dataset discovery, benchmarking reference, literature review
Purpose & Design Philosophy¶
FMS-Medical is an "awesome list" style knowledge base tracking foundation model research across healthcare modalities. Maintained as a GitHub repository with bilingual documentation (English + Chinese), it provides structured references to 100+ medical FM papers, datasets, tutorials, and related resources organized by modality and year. The resource is anchored by an IEEE Reviews in Biomedical Engineering survey paper and serves as a comprehensive entry point for medical AI research.
Key value: Centralized, actively maintained catalog of medical FMs and datasets—ideal for systematic literature review and baseline selection.
Catalog Structure¶
Model Taxonomies¶
| Category | Coverage | Use Case |
|---|---|---|
| LFM (Language) | Medical LLMs, clinical NLP | Text understanding, report generation, QA |
| VFM (Vision) | Medical image encoders, segmentation | Radiology, pathology, ultrasound analysis |
| BFM (Bioinformatics) | Genomics, proteomics, drug discovery | Sequence modeling, variant interpretation |
| MFM (Multimodal) | Vision-language, integrated models | Unified diagnosis, multimodal reasoning |
Dataset Catalogs¶
- Text datasets: Clinical notes, radiology reports, biomedical literature
- Imaging datasets: CXR, CT, MRI, pathology, ultrasound, retinal
- Omics datasets: Genomics, transcriptomics, proteomics
- Multimodal datasets: Image-text pairs, integrated EHR + imaging
Integration Strategy¶
For Neuro-Omics KB¶
FMS-Medical provides dataset discovery and model benchmarking:
Key uses: - Literature review: Identify related work in medical AI for neuro-omics - Dataset selection: Find imaging and genomics datasets for validation - Baseline comparison: Track state-of-the-art methods for benchmarking - Survey reference: Cite comprehensive medical FM survey
Application to KB pipeline:
FMS-Medical catalog
↓
Extract relevant entries:
- Brain imaging models
- Genomics foundation models
- Multimodal medical datasets
↓
Populate KB model/dataset cards
↓
Benchmark neuro-omics methods against medical FM baselines
For ARPA-H Brain-Omics Model (BOM)¶
FMS-Medical informs medical AI landscape understanding:
Survey medical FMs
↓
Identify integration patterns:
- CLIP-style alignment (M3FM, TITAN)
- MoT/MoE architectures
- LLM continual pretraining (Me-LLaMA)
↓
Apply to BOM design
Transfer insights: - Comprehensive coverage: Survey spans all medical FM modalities—identify gaps for neuro-omics - Dataset catalogs: Find publicly available datasets for cross-validation - Benchmark references: Track medical FM performance for comparison - Bilingual support: Chinese documentation aids international collaboration
Catalog Access Workflow¶
# 1. Clone or sync FMS-Medical repository
git clone https://github.com/YutingHe-list/Awesome-Foundation-Models-for-Advancing-Healthcare
# 2. Review README for model/dataset tables
# 3. Extract entries relevant to neuro-omics:
# - VFM: Brain imaging models
# - BFM: Genomics models
# - MFM: Multimodal integration patterns
# 4. Populate KB YAML cards from extracted entries
# 5. Track updates via GitHub (actively maintained)
For KB automation: - Parse README tables to generate candidate model/dataset cards - Link to survey PDFs for detailed descriptions - Monitor repository updates for new medical FMs
Strengths & Limitations¶
Strengths¶
- Comprehensive coverage: 100+ medical FMs across all modalities
- Actively maintained: Regular updates with publication news
- Bilingual: English + Chinese documentation
- Well-organized: Structured by modality, year, and venue
- Survey anchor: Peer-reviewed IEEE Reviews paper provides synthesis
Limitations¶
- Documentation-only: No executable code or model weights
- General medical focus: Limited neuro-omics specific content
- No unified schema: Markdown tables, not structured YAML/JSON
- Citation lag: New models may take time to appear in catalog
When to Use FMS-Medical¶
✅ Use when: - Starting literature review on medical FMs - Selecting baseline models for benchmarking - Finding publicly available medical datasets - Identifying integration patterns from related work - Citing comprehensive medical FM surveys
⚠️ Not a substitute for: - Model implementation code (links to external repos) - Pretrained model weights (links to paper/HuggingFace) - Executable benchmarking pipelines (reference only)
⚠️ Complement with: - Papers With Code: For benchmark leaderboards - HuggingFace Model Hub: For pretrained weights - Model-specific repos: For implementation details
Reference Materials¶
Knowledge Base Resources¶
Curated materials in this KB:
- Code walkthrough: FMS-Medical walkthrough
- Dataset card (YAML): kb/datasets/fms_medical_catalog.yaml
Integration recipes: - Integration Strategy — Dataset selection - KB Overview — Catalog integration patterns
Original Sources¶
Source repositories:
- Local copy: external_repos/fms-medical/
- Official GitHub: YutingHe-list/Awesome-Foundation-Models-for-Advancing-Healthcare
Original survey paper: - Title: "Foundation Models for Healthcare" - Authors: Yuting He, et al. - Published: IEEE Reviews in Biomedical Engineering, 2024 - Link (IEEE): IEEE Xplore: 10750441 - Link (arXiv): arXiv:2404.03264
Next Steps in Our Pipeline¶
- Systematic extraction: Parse FMS-Medical tables to populate KB model/dataset cards
- Gap analysis: Identify medical FM capabilities missing from neuro-omics KB
- Benchmark selection: Choose medical FM baselines for comparison experiments
- Dataset discovery: Find publicly available datasets for cross-validation
- Literature tracking: Monitor repository updates for new medical FM methods
Engineering Notes¶
- FMS-Medical is pure documentation—no code dependencies
- Bilingual PDFs in
files/directory useful for international teams - Citation metadata in tables ready for automated KB card generation
- Active maintenance—check GitHub for recent updates before citing
- Survey paper provides narrative synthesis complementing tabular catalog