Utils#
Utility functions for models and metrics. Plotting functionality for models mapping sequences to either embedding- or property-space. Diagnostics to evaluate embedding models alignment with the feature(s) of interest.
Projections#
Projection and visualization of sequence embeddings.
Project embeddings into 2D using PCA. |
|
Project embeddings into 2D using t-SNE. |
|
Project embeddings into 2D using UMAP. |
|
Plot projections for one or more groups. |
Diagnostics#
Helpers to select an appropriate embedding model.
Compute the Spearman correlation coefficient using the pairwise distance between embedding spaces. |
|
Compute the k-NN feature alignment score of an embedding model. |
|
Plot the k-NN feature alignment score of an embedding model using variable-number of neighbors. |
Sequence Operations#
Helpers for generating and transforming sequence data.
Randomly shuffle characters within each sequence. |
|
Sample a subset of the sequences without replacement. |