Covary protocol portfolio and 3.0.1 beta are now available.
A Genomic Intelligence Framework

A translation-aware framework for alignment-free phylogenetics using machine learning

Covary is a computational framework designed for large-scale biological sequence analysis, powered by TIPs-VF. Leverage alignment-free, translation-aware embeddings to compare, cluster, and analyze genetic sequences at scale.

Built for large-scale sequence analysis

Covary is a computational framework designed for large-scale biological sequence analysis, powered by TIPs-VF (Translator-Interpreter Pre-seeding for Variable-length Fragments).

Covary feature overview — PCA, t-SNE, UMAP embeddings and dendrogram outputs
Covary t-SNE sequence embedding scatter plot
Covary UMAP distance heatmap
Covary t-SNE dendrogram single linkage
Learn more
🧬
Analysis

Alignment-free

Circumvents computationally expensive multiple sequence alignments (MSA), enabling scalable analyses for large datasets.

🔤
Encoding

Translation-aware embeddings

Incorporates codon-bound information for biologically meaningful sequence comparison — works in both coding and non-coding sequences.

📊
Clustering

Distance-based clustering

Computes embeddings and distance matrices for downstream clustering and visualization across PCA, t-SNE, and UMAP.

🌿
Resolution

Phylogenetic resolution

Resolves sequences at species, genus, family, and order levels. Supports multi-FASTA files from a variety of organisms.

Phylogenomic tools & data workflows

Phylogenomic tools, data processing workflows and simulation pipelines built for Covary.

Covary-encoder toolkit screenshot
Encoding

Covary-encoder

A k-mer-derived, non-overlapping, and frequency-independent encoding logic. Represents genetic sequences based on relative proximity, directional alignment, and translation awareness.

Explore on GitHub
Mutagen-PX toolkit screenshot
Simulation

Mutagen-PX

A lightweight Python toolkit that simulates tumor-specific gene sequence profiles by applying patient mutation data from TCGA cohorts to a reference sequence. Recreates "mutated" FASTA outputs per patient.

Explore on GitHub
Seed Aligner toolkit screenshot
Pre-processing

Seed Aligner

A computationally-optimized tool that detects a common seed region across genetic sequences and reorders them to start at the same point, standardizing FASTA inputs for Covary without full MSA.

Explore on GitHub

What you can do with Covary

Covary leverages alignment-free, translation-aware embeddings to compare, cluster, and analyze genetic sequences.

Covary phylogenomics usage illustration
Phylogenomics

Perform large-scale phylogenomic analyses

Covary phylogenomics usage illustration
Embeddings

Embed sequences with translation-aware context

Covary phylogenomics usage illustration
Exploration

Run fast, scalable exploratory workflows

Covary phylogenomics usage illustration
Alignment-free

Analyze sequences without alignment or gaps

Discover more in the publication

Suitable for diverse research questions

Covary is suitable for use in solving a number of research questions spanning phylogenetics, pathogen surveillance, and more.

🔬
Use Case

Taxonomic studies

Covary t-SNE dendrogram complete linkage — taxonomic use case

Covary provides alignment-free, translation-aware comparisons that help distinguish species, infer relatedness, and support phylogenetic analyses even across highly divergent sequences.

🦠
Use Case

Pathogen detection

Covary t-SNE heatmap — pathogen detection use case

Covary can rapidly screen genetic sequences from clinical or environmental samples to identify viral, bacterial, or fungal pathogens without requiring multiple sequence alignment.

🌍
Use Case

Metagenomic exploration

Covary UMAP embedding scatter — metagenomic use case

Covary enables fast, large-scale profiling of mixed microbial communities, helping researchers uncover taxonomic composition, detect rare organisms, and analyze functional divergence.

Explore more research topics

Beyond phylogenetic analysis

Covary's translation-aware, alignment-free framework extends into new domains — from oncogenomics to forensic genetics and precision medicine.

🧫
Oncogenomics

Tumor clonal evolution

Map subclonal architecture and mutational trajectories in cancer genomes, tracing how tumor lineages diverge over time and under selective pressure.

Learn more
Epidemiology

Outbreak & variant prediction

Model the evolutionary trajectories of viral genomes in outbreak settings — forecasting emergent variants and resistance evolution for proactive public health response.

Learn more
🔍
Forensics

Forensic & environmental genetics

Apply Covary's identification engine to forensic DNA profiling, environmental surveillance, and species-of-origin determination from complex biological samples.

Learn more
View all beyond-phylogenetics projects →

See Covary in action — no setup required

Run a simple interactive, step-by-step simulation of a Covary analysis pipeline directly in your browser. Based on predefined sequences and parameters from the official Google Colab notebook — no ML runtime needed.

  • Step-by-step walkthrough from FASTA input to result output
  • Simulated k-mer encoding, distance matrix, and dendrogram construction
  • Predefined parameter sets modeled after real Colab runs
  • Animated visualization of PCA, t-SNE, and UMAP projections
  • Plain-language explanations at each step
Launch Simulation →
Covary interactive simulation preview

For visual representation only. See Covary notebook for the actual interface.

Published Papers

01

Covary: A translation-aware framework for alignment-free phylogenetics using machine learning.

bioRxiv · 2025-11

doi.org/10.1101/2025.11.13.687960 →
02

Rapid Phylogenomic Analysis of Thousands Outbreak-Causing Viral Genomes Using Covary.

Preprints · 2025-12

doi.org/10.20944/preprints202512.1970.v1 →
View Published Protocols →

Get in touch

Have questions about Covary, licensing, training, or the Research Program? Reach out and we'll get back to you.

License inquiries welcome
Training & workshop requests
Procurement & institutional access