Analyze Binding Sites Dialog Box

Summary

Calculate sequence identity in multiple alignment columns that are within a certain spatial distance from ligand residues, and filter the alignment sequences based on the sequence identity.

Opening the Analyze Binding Sites Dialog Box

To open the Analyze Binding Sites dialog box, you can:

Using the Analyze Binding Sites Dialog Box

To analyze a binding site, you must have a sequence and a structure for the query, and the structure must have at least one ligand (it can have more). Next, you must have a set of sequences that are aligned to the query, which you can obtain from a BLAST search followed by an alignment. If you want to do the analysis for sequences other than the query, you must have structures with ligands for those sequences.

When you open the dialog box, the query sequence is analyzed and annotated with ligand contacts (for all ligands), and the columns for the residues that are in contact with the ligand (or first ligand) are selected. Sequences that have 100% identity with the query residues that are in contact with the ligand (or the first ligand) are also selected by default.

The dialog box opens at the Conservation Analysis tab. If you want to select and remove sequences that are similar to the query or different from the query in the active site, you can use the tools in this tab.

The Distance Analysis tab is useful for examining the average conservation within a set of sequences. The analysis could be repeated for different queries to obtain a comparison of the average conservation for the queries.

You can change the sequence and select the ligand with the Sequence and Ligand option menus. When you change the sequence, the new sequence is analyzed and annotated and the column selection updated for the new ligand. Changing the ligand updates the column selection.

Analyze Binding Sites Dialog Box Features

Sequence option menu

Choose the sequence from this option menu. The analysis is done on the basis of the chosen sequence. This means that the sequence you choose must have an associated structure with a ligand.

Ligand option menu

Choose the ligand for the sequence from this option menu, if the structure for the sequence has more than one ligand. The column selection changes to represent contact with the chosen ligand.

Conservation Analysis tab

In this tab you can select sequences based on the degree of conservation of residues in the active site (i.e. residues that are in contact with the ligand).

Sequence identity threshold in selected columns slider

Use this slider to adjust the percentage sequence identity threshold for selecting sequences in the Multiple Sequence Viewer panel. As you change the value, the selection is updated to reflect the new threshold. The default is 100%, so only sequences that are completely identical for the active site residues are selected by default.

Remove Similar Sequences button

For the current threshold, remove sequences from the Multiple Sequence Viewer panel whose sequence identity in the active site is greater than the threshold. This is useful if you are looking for mutations in the active site and want to eliminate sequences that are the same or very similar in the active site.

Remove Divergent Sequences buttons

For the current threshold, remove sequences from the Multiple Sequence Viewer panel whose sequence identity in the active site is smaller than the threshold. This is useful if you are looking for sequences that are the same or very similar in the active site, but may diverge elsewhere.

Distance Analysis tab

In this tab you can analyze the average identity, similarity, and homology of the residues in the alignment to the query residues that are within a range of distances of the ligand.

Distance table

This table reports the results of the analysis. Each row represents a neighborhood of the selected residues. The first column is the distance cutoff for that neighborhood: any residue that has a heavy atom within that distance of a heavy atom in the selection is considered to be in that neighborhood. The remaining columns contain the identity, similarity and homology values calculated for the columns within the alignment that include the residues within the distance threshold.

Scan from N to M angstroms boxes

Specify the range of distances over which the analysis is performed.

Export results to CSV file button

Export the table data to a CSV file. Opens a file selector, in which you can navigate to a location and name the file.

Analyze button

Analyze the residues in the alignment for their identity, similarity, and homology to the query residues that are within the specified distance threshold of the ligand.

Related Topics


Legal Notice

File: tools_menu/msv_analyze_binding_site.html
Last updated: 29 Apr 2014