A comprehensive guide to navigating and understanding the METAINFORMANT documentation system.
If you're new to METAINFORMANT, follow this path:
- QUICKSTART.md - Installation and basic usage examples
- Architecture - System design and module relationships
- CLI Reference - Command-line interface for all modules
- Domain-specific documentation - See below for your area of interest
METAINFORMANT documentation is organized hierarchically by domain and purpose:
- README.md - Project overview, features, and quick examples
- QUICKSTART.md - Fast setup and common commands
- AGENTS.md - AI assistance in development
- index.md - Complete documentation index with navigation
- architecture.md - System design, module dependencies, data flow
- cli.md - Unified CLI reference for all modules
- setup.md - Installation and environment configuration
- testing.md - Test suite documentation and guidelines
- UV_SETUP.md - Package management with
uv
Each biological domain has its own subdirectory with consistent structure:
| Domain | Location | Primary Focus |
|---|---|---|
| Core | docs/core/ |
Shared utilities (I/O, config, logging, paths) |
| DNA | docs/dna/ |
Sequence analysis, alignment, phylogeny, population genetics |
| RNA | docs/rna/ |
RNA-seq workflows, amalgkit integration |
| GWAS | docs/gwas/ |
Genome-wide association studies |
| eQTL | docs/eqtl/ |
Expression QTL integration pipeline |
| Protein | docs/protein/ |
Protein sequences, structures, databases |
| Single-Cell | docs/singlecell/ |
scRNA-seq preprocessing, clustering, trajectory |
| Networks | docs/networks/ |
Biological networks, community detection |
| Multi-Omics | docs/multiomics/ |
Cross-omics data integration |
| Math | docs/math/ |
Population genetics theory, epidemiology |
| ML | docs/ml/ |
Machine learning methods |
| Visualization | docs/visualization/ |
Plots, animations, trees |
| Simulation | docs/simulation/ |
Synthetic data generation |
| Quality | docs/quality/ |
Data quality assessment |
| Information | docs/information/ |
Information theory methods |
| Life Events | docs/life_events/ |
Life course event analysis |
| Ontology | docs/ontology/ |
Gene Ontology, functional annotation |
| Phenotype | docs/phenotype/ |
Phenotypic trait analysis |
| Epigenome | docs/epigenome/ |
Methylation, ChIP-seq, ATAC-seq |
| Ecology | docs/ecology/ |
Community analysis, diversity metrics |
| Long-read | docs/longread/ |
PacBio/Nanopore sequencing, assembly |
| Metagenomics | docs/metagenomics/ |
Amplicon, shotgun, functional annotation |
| Structural Variants | docs/structural_variants/ |
CNV/SV detection, annotation, visualization |
| Spatial | docs/spatial/ |
Spatial transcriptomics (Visium, MERFISH, Xenium) |
| Pharmacogenomics | docs/pharmacogenomics/ |
Clinical variants, CPIC, PharmGKB |
| Metabolomics | docs/metabolomics/ |
Mass spectrometry, pathway mapping |
| Cloud | docs/cloud/ |
GCP deployment, Docker pipelines, VM lifecycle |
| Agents | docs/agents/ |
Agent-based modeling, ecosystem simulation |
Each domain directory follows a consistent pattern:
docs/<domain>/
├── README.md # Domain overview and quick start
├── index.md # Detailed index of all submodules
└── <topic>.md # Topic-specific documentation
| Task | Documentation |
|---|---|
| Install METAINFORMANT | QUICKSTART.md, setup.md |
| Run CLI commands | cli.md |
| Analyze DNA sequences | docs/dna/ |
| Run RNA-seq workflow | docs/rna/workflow.md |
| Perform GWAS | docs/gwas/workflow.md |
| Analyze single-cell data | docs/singlecell/ |
| Visualize results | docs/visualization/ |
| Run tests | testing.md |
| Understand architecture | architecture.md |
Source code documentation mirrors the src/metainformant/ structure:
| Source Module | Source README | User Documentation |
|---|---|---|
core/ |
src/metainformant/core/README.md |
docs/core/ |
dna/ |
src/metainformant/dna/README.md |
docs/dna/ |
rna/ |
src/metainformant/rna/README.md |
docs/rna/ |
| ... | ... | ... |
- Located in each module directory
- Provide quick overview and basic usage
- Link to detailed documentation
- Document AI contributions to the project
- Located at repository root and in
src/metainformant/module directories - Track development history and design decisions
- Cursor project skills under
.cursor/skills/are generated from eachAGENTS.md; regenerate withuv run python scripts/package/generate_cursor_skills.py(see.cursor/skills/README.md)
- Detailed coverage of specific functionality
- Code examples and API references
- Performance considerations
- YAML configuration file format
- Environment variable overrides
- Species-specific configurations
- Start with domain
README.mdfor overview - Check
index.mdfor available topics - Read topic-specific files for details
- Reference source code READMEs for API specifics
- Follow existing structure and formatting
- Include practical code examples
- Update cross-references when adding new docs
- Place new docs in appropriate
docs/<domain>/directory - Never create documentation in
output/directory
- Clear, technical writing style
- Consistent markdown formatting
- Runnable code examples
- Cross-references between related topics
- Regular updates with code changes
- Documentation Index - Complete hierarchical navigation
- Architecture - Module relationships diagram
- CLI Reference - All available commands
- cursorrules/README.md - Module-specific coding rules
- tests/README.md - Test suite organization
- scripts/README.md - Workflow scripts
- Documentation Issues: Check for broken links or outdated content
- Code Questions: Reference source code READMEs and docstrings
- Workflow Help: See domain-specific workflow documentation
- Bug Reports: https://github.com/docxology/metainformant/issues
This guide provides navigation for documentation organized across 29 domains, ensuring coverage of METAINFORMANT's bioinformatics capabilities.