Trimmer

Base Trimmer

class pytrimal.BaseTrimmer

A sequence alignment trimmer.

All subclasses provide the same trim method, and are configured through their constructor.

__init__()

Create a new base trimmer.

trim(alignment)

Trim the provided alignment.

Parameters
  • alignment (Alignment) – A multiple sequence alignment to trim.

  • matrix (SimilarityMatrix, optional) – An alternative similarity matrix to use for computing the similarity statistic. If None, a default matrix will be used based on the type of the alignment.

Returns

TrimmedAlignment – The trimmed alignment.

Hint

This method is re-entrant, and can be called safely accross different threads. Most of the computations will be done after releasing the GIL.

Changed in version 0.1.2: Added the matrix optional argument.

Automatic Trimmer

class pytrimal.AutomaticTrimmer(BaseTrimmer)

A sequence alignment trimmer with automatic parameter detection.

trimAl provides several heuristic methods for automated trimming of multiple sequence algorithms:

  • strict: A statistical method that combines gaps and similarity statistics to clean the alignment.

  • strictplus: A statistical method that combines gaps and similarity statistics, optimized for Neighbour-Joining tree reconstruction.

  • gappyout: A statistical method that only uses gaps statistic to clean the alignment.

  • automated1: A meta-method that chooses between strict and gappyout, optimized for Maximum Likelihood phylogenetic tree reconstruction.

  • nogaps: A naive method that removes every column containing at least one gap.

  • noallgaps: A naive method that removes every column containing only gaps.

__init__(method='strict')

Create a new automatic alignment trimmer using the given method.

Parameters

method (str) – The automatic aligment trimming method. See the documentation for AutomatedTrimmer for a list of supported values.

Raises

ValueError – When method is not one of the automatic alignment trimming methods supported by trimAl.

trim(alignment)

Trim the provided alignment.

Parameters
  • alignment (Alignment) – A multiple sequence alignment to trim.

  • matrix (SimilarityMatrix, optional) – An alternative similarity matrix to use for computing the similarity statistic. If None, a default matrix will be used based on the type of the alignment.

Returns

TrimmedAlignment – The trimmed alignment.

Hint

This method is re-entrant, and can be called safely accross different threads. Most of the computations will be done after releasing the GIL.

Changed in version 0.1.2: Added the matrix optional argument.

Manual Trimmer

class pytrimal.ManualTrimmer(BaseTrimmer)

A sequence alignment trimmer with manually defined thresholds.

Manual trimming allows the user to specify independent thresholds for four different statistics:

  • Consistency threshold: Remove columns with a consistency ratio lower than the provided threshold.

  • Gap threshold: Remove columns where the gap ratio (or the absolute gap count) is higher than the provided threshold.

  • Similarity threshold: Remove columns with a similarity ratio lower than the provided threshold.

In addition, the trimming can be restricted so that at least a configurable fraction of the original alignment is retained, in order to avoid stripping an alignment of distance sequences by aggressive trimming.

__init__(*, gap_threshold=None, gap_absolute_threshold=None, similarity_threshold=None, consistency_threshold=None, conservation_percentage=None)

Create a new manual alignment trimmer with the given parameters.

Keyword Arguments
  • gap_threshold (float) – The minimum fraction of non-gap characters that must be present in a column to keep the column.

  • gap_absolute_threshold (int, optional) – The absolute number of gaps allowed on a column to keep it in the alignment. Incompatible with gap_threshold.

  • similarity_threshold (float, optional) – The minimum average similarity required.

  • consistency_threshold (float, optional) – The minimum consistency value required.

  • conservation_percentage (float, optional) – The minimum percentage of positions in the original alignment to conserve.

trim(alignment)

Trim the provided alignment.

Parameters
  • alignment (Alignment) – A multiple sequence alignment to trim.

  • matrix (SimilarityMatrix, optional) – An alternative similarity matrix to use for computing the similarity statistic. If None, a default matrix will be used based on the type of the alignment.

Returns

TrimmedAlignment – The trimmed alignment.

Hint

This method is re-entrant, and can be called safely accross different threads. Most of the computations will be done after releasing the GIL.

Changed in version 0.1.2: Added the matrix optional argument.