11Protein Symmetry using BioJava
22================================================================
33
4- BioJava can be used to detect, analyze, and visualize ** symmetry** and
5- ** pseudo-symmetry** in the ** quaternary** (biological assembly) and tertiary
4+ BioJava can be used to detect, analyze, and visualize ** symmetry** and
5+ ** pseudo-symmetry** in the ** quaternary** (biological assembly) and tertiary
66(** internal** ) structural levels.
77
88## Quaternary Symmetry
99
10- The ** quaternary symmetry** of a structure defines the relations between
11- its individual chains or groups of chains. For a more extensive explanation
10+ The ** quaternary symmetry** of a structure defines the relations between
11+ its individual chains or groups of chains. For a more extensive explanation
1212about symmetery visit the [ PDB help page] ( http://www.rcsb.org/pdb/staticHelp.do?p=help/viewers/jmol_symmetry_view.html ) .
1313
1414In the ** quaternary symmetry** detection problem, we are given a set of chains
@@ -19,7 +19,7 @@ relates them. The solution is divided into the following steps:
1919in the pseudo-symmetry case). For that, we perform a pairwise alignment of all
2020chains and determine ** clusters of identical chains** .
21212 . Next, we reduce the each chains to a single point, its ** centroid** (center of mass).
22- 3 . After that, we try different ** symmetry relations** to superimpose the chain centroids
22+ 3 . After that, we try different ** symmetry relations** to superimpose the chain centroids
2323and obtain their RMSD.
24244 . At last, based on the parameters (cutoffs), we determine the ** overall symmetry** of the
2525structure, with the symmetry relations obtained in the previous step.
@@ -36,16 +36,20 @@ Structure s;
3636
3737// Set some parameters if needed different than DEFAULT - see descriptions
3838QuatSymmetryParameters parameters = new QuatSymmetryParameters ();
39- parameters . setVerbose( true ); // print information
39+ SubunitClustererParameters clusterParams = new SubunitClustererParameters ();
4040
4141// Instantiate the detector
42- QuatSymmetryDetector detector = QuatSymmetryDetector(structure , parameters);
42+ QuatSymmetryDetector detector = QuatSymmetryDetector(s , parameters, clusterParams );
4343
44- // The getters calculate the quaternary symmetry automatically
45- List< QuatSymmetryResults > globalResults = detector . getGlobalSymmetry();
46- List<List< QuatSymmetryResults >> localResults = detector . getLocalSymmetries();
44+ // Static methods in QuatSymmetryDetector perform the calculation
45+ QuatSymmetryResults globalResults = QuatSymmetryDetector . getGlobalSymmetry(s, parameters, clusterParams );
46+ List<QuatSymmetryResults > localResults = QuatSymmetryDetector . getLocalSymmetries(s, parameters, clusterParams );
4747
4848```
49+ See also the demo in the BioJava repo:
50+
51+ https://github.com/biojava/biojava/blob/885600670be75b7f6bc5216bff52a93f43fff09e/biojava-structure/src/main/java/demo/DemoSymmetry.java#L37-L59
52+
4953The return type are ` List ` because there can be multiple valid options for the
5054quaternary symmetry. The local results ` List ` is empty if there exist no local
5155symmetry in the structure, and the global results ` List ` has always size bigger
@@ -83,35 +87,35 @@ only applies to a subset of chains.
8387### Pseudo-Symmetry
8488
8589In ** pseudo-symmetry** the chains related by the symmetry are not completely
86- identical, but they share a sequence similarity above the pseudo-symmetry
90+ identical, but they share a sequence similarity above the pseudo-symmetry
8791similarity threshold.
8892
89- If we consider hemoglobin, at a 95% sequence identity threshold the alpha and
90- beta subunits are considered different, which correspond to an A2B2 stoichiometry
91- and a C2 point group. At the structural similarity level, all four chains are
92- considered homologous (~ 45% sequence identity) with an A4 pseudostoichiometry and
93- D2 pseudosymmetry.
93+ If we consider hemoglobin, at a 95% sequence identity threshold the alpha and
94+ beta subunits are considered different, which correspond to an A2B2 stoichiometry
95+ and a C2 point group. At the structural similarity level, all four chains are
96+ considered homologous (~ 45% sequence identity) with an A4 pseudostoichiometry and
97+ D2 pseudosymmetry.
9498
9599![ PDB ID 4HHB] ( img/symm_pseudo.png )
96100
97101## Internal Symmetry
98102
99- ** Internal symmetry** refers to the symmetry present in a single chain, that is,
100- the tertiary structure. The algorithm implemented in biojava to detect internal
103+ ** Internal symmetry** refers to the symmetry present in a single chain, that is,
104+ the tertiary structure. The algorithm implemented in biojava to detect internal
101105symmetry is called ** CE-Symm** .
102106
103107### CE-Symm
104108
105- The ** CE-Symm** algorithm was originally developed by [ Myers-Turnbull D., Bliven SE.,
109+ The ** CE-Symm** algorithm was originally developed by [ Myers-Turnbull D., Bliven SE.,
106110Rose PW., Aziz ZK., Youkharibache P., Bourne PE. & Prlić A. in 2014]
107111(http://www.sciencedirect.com/science/article/pii/S0022283614001557 ) [ ![ pubmed] ( http://img.shields.io/badge/in-pubmed-blue.svg?style=flat )] ( http://www.ncbi.nlm.nih.gov/pubmed/24681267 ) .
108112As the name of the algorithm explicitly states, ** CE-Symm** uses the Combinatorial
109- Extension (** CE** ) algorithm to generate an alignment of the structure chain to itself,
110- disabling the identity alignment (the diagonal of the ** DotPlot** representation of a
111- structure alignment). This allows the identification of alternative self-alignments,
113+ Extension (** CE** ) algorithm to generate an alignment of the structure chain to itself,
114+ disabling the identity alignment (the diagonal of the ** DotPlot** representation of a
115+ structure alignment). This allows the identification of alternative self-alignments,
112116which are related to symmetry and/or structural repeats inside the chain.
113117
114- By a procedure called ** refinement** , the subunits of the chain that are part of the symmetry
118+ By a procedure called ** refinement** , the subunits of the chain that are part of the symmetry
115119are defined and a ** multiple alignment** is created. This process can be thought as to
116120divide the chain into other subchains, and then superimposing each subchain to each other to
117121create a multiple alignment of the subunits, respecting the symmetry axes.
@@ -156,7 +160,7 @@ System.out.println(pg.getSymmetry());
156160
157161To enable some extra features in the display, a ` SymmetryDisplay `
158162class has been created, although the ` MultipleAlignmentDisplay ` method
159- can also be used for that purpose (it will not show symmetry axes or
163+ can also be used for that purpose (it will not show symmetry axes or
160164symmetry menus).
161165
162166Lastly, the ` SymmetryGUI ` class in the ** structure-gui** package
@@ -166,7 +170,7 @@ to the GUI to trigger structure alignments.
166170### Symmetry Display
167171
168172The symmetry display is similar to the ** quaternary symmetry** , because
169- part of the code is shared. See for example this beta-propeller (1U6D),
173+ part of the code is shared. See for example this beta-propeller (1U6D),
170174where the repeated beta-sheets are connected by a linker forming a C6
171175point group internal symmetry:
172176
@@ -175,10 +179,10 @@ point group internal symmetry:
175179#### Hierarchical Symmetry
176180
177181One additional feature of the ** internal symmetry** display is the representation
178- of hierarchical symmetries and repeats. Contrary to point groups, some structures
179- have different ** levels** of symmetry. That is, the whole strucutre has, e.g. C2
180- symmetry and, at the same time, each of the two parts has C2 symmetry, but the axes
181- of both levels are not related by a point group (i.e. they do not cross to a single
182+ of hierarchical symmetries and repeats. Contrary to point groups, some structures
183+ have different ** levels** of symmetry. That is, the whole strucutre has, e.g. C2
184+ symmetry and, at the same time, each of the two parts has C2 symmetry, but the axes
185+ of both levels are not related by a point group (i.e. they do not cross to a single
182186point).
183187
184188A very clear example are the beta-gamma-crystallins, like 4GCR:
@@ -187,24 +191,24 @@ A very clear example are the beta-gamma-crystallins, like 4GCR:
187191
188192#### Subunit Multiple Alignment
189193
190- Another feature of the display is the option to show the ** multiple alignment** of
194+ Another feature of the display is the option to show the ** multiple alignment** of
191195the symmetry related subunits created during the ** refinement** process. Search for
192- the option * Subunit Superposition* in the * symmetry* menu of the Jmol window. For
196+ the option * Subunit Superposition* in the * symmetry* menu of the Jmol window. For
193197the previous example the display looks like that:
194198
195199![ PDB ID 4GCR] ( img/symm_subunits.png )
196200
197- The subunit display highlights the differences and similarities between the symmetry
201+ The subunit display highlights the differences and similarities between the symmetry
198202related subunits of the chain, and helps the user to identify conseved and divergent
199203regions, with the help of the * Sequence Alignment Panel* .
200204
201205## Combined Global Symmetry
202206
203207Finally, the internal and quaternary symmetries can be combined to obtain the global
204208overall combined symmetry. As we have seen before, the protein 1VYM is a DNA-clamp that
205- has three chains relates by C3 symmetry. Each chain is internally C2 symmetric, and each
206- part of the C2 internal symmetry is C2 symmetric, so a case of ** hierarchical symmetry**
207- (C2 + C2). Once we have divided the whole structure into its asymmetric parts, we can
209+ has three chains relates by C3 symmetry. Each chain is internally C2 symmetric, and each
210+ part of the C2 internal symmetry is C2 symmetric, so a case of ** hierarchical symmetry**
211+ (C2 + C2). Once we have divided the whole structure into its asymmetric parts, we can
208212analyze the global symmetry that related each one of them. The interesting result is that
209213in some cases, the internal symmetry ** multiplies** the point group of the quaternary symmetry.
210214What seemed a C3 + C2 + C2 is combined into a D6 overall symmetry, as we can see in the figure
0 commit comments