devldevelopment
diff --git a/‎README.md‎
Lines changed: 3 additions & 4 deletions b/‎README.md‎
Lines changed: 3 additions & 4 deletions
diff --git a/‎structure/README.md‎
Lines changed: 11 additions & 11 deletions b/‎structure/README.md‎
Lines changed: 11 additions & 11 deletions
diff --git a/‎structure/alignment-data-model.md‎
Lines changed: 22 additions & 2 deletions b/‎structure/alignment-data-model.md‎
Lines changed: 22 additions & 2 deletions
diff --git a/‎structure/alignment.md‎
Lines changed: 136 additions & 31 deletions b/‎structure/alignment.md‎
Lines changed: 136 additions & 31 deletions
@@ -6,7 +6,7 @@ A brief introduction into [BioJava](https://github.com/biojava/biojava).
 
 The goal of this tutorial is to provide an educational introduction into some of the features that are provided by BioJava. 
 
-At the moment this tutorial is still under development. Please check  the [BioJava Cookbook](http://biojava.org/wiki/BioJava:CookBook3.0) for a more comprehensive collection of many examples of what is possible with BioJava and how to do things.
+At the moment this tutorial is still under development. Please check  the [BioJava Cookbook](http://biojava.org/wiki/BioJava:CookBook3.0) for a more comprehensive collection of examples about what is possible with BioJava and how to do things.
 
 ## Index
 
@@ -16,10 +16,9 @@ Book 1: [The Core module](core/README.md), basic working with sequences.
 
 Book 2: [The Alignment module](alignment/README.md), pairwise and multiple alignments of protein sequences.
 
-Book 3: [The Protein Structure modules](structure/README.md), everything related to working with 3D structures.
-
-Book 4: [The Genomics Module](genomics/README.md), working with genomic data
+Book 3: [The Structure modules](structure/README.md), everything related to working with 3D structures.
 
+Book 4: [The Genomics Module](genomics/README.md), working with genomic data.
 
 ## License
 
 
@@ -1,7 +1,7 @@
-The Protein Structure Modules of BioJava
+The Structure Modules of BioJava
 =====================================================
 
-A tutorial for the protein structure modules of [BioJava](http://www.biojava.org)
+A tutorial for the structure modules of [BioJava](http://www.biojava.org)
 
 ## About
 <table>
@@ -32,35 +32,35 @@ Chapter 1 - Quick [Installation](installation.md)
 
 Chapter 2 - [First Steps](firststeps.md)
 
-Chapter 3 - The [data model](structure-data-model.md) for the representation of macromolecular structures.
+Chapter 3 - The [Structure Data Model](structure-data-model.md), for the representation of macromolecular structures
 
-Chapter 4 - [Local installations](caching.md) of PDB
+Chapter 4 - [Local Installations](caching.md) of PDB
 
 Chapter 5 - The [Chemical Component Dictionary](chemcomp.md)
 
-Chapter 6 - How to [work with mmCIF/PDBx files](mmcif.md)
+Chapter 6 - How to [Work with mmCIF/PDBx Files](mmcif.md)
 
-Chapter 7 - [SEQRES and ATOM records](seqres.md), mapping to Uniprot (SIFTs)
+Chapter 7 - [SEQRES and ATOM Records](seqres.md), mapping to Uniprot (SIFTs)
 
-Chapter 8 - Protein [Structure Alignments](alignment.md)
+Chapter 8 - [Structure Alignments](alignment.md)
 
 Chapter 9 - [Biological Assemblies](bioassembly.md)
 
 Chapter 10 - [External Databases](externaldb.md) like SCOP &amp; CATH
 
 Chapter 11 - [Accessible Surface Areas](asa.md)
 
-Chapter 12 - [Contacts within a chain and between chains](contact-map.md)
+Chapter 12 - [Contacts Within a Chain and between Chains](contact-map.md)
 
-Chapter 13 - Finding all interfaces in crystal: [crystal contacts](crystal-contacts.md)
+Chapter 13 - Finding all Interfaces in Crystal: [Crystal Contacts](crystal-contacts.md)
 
 Chapter 14 - Protein Symmetry
 
 Chapter 15 - Bonds
 
 Chapter 16 - [Special Cases](special.md)
 
-Chapter 17 - [Lists](lists.md) of PDB IDs and PDB [status information](lists.md).
+Chapter 17 - [Lists](lists.md) of PDB IDs and PDB [Status Information](lists.md)
 
 
 ### Author: 
@@ -88,7 +88,7 @@ The content of this tutorial is available under the [CC-BY](http://creativecommo
 
 Navigation:
 [Home](../README.md)
-| Book 3: The Protein Structure modules
+| Book 3: The Structure modules
 
 Prev: [Book 2: The Alignment module](../alignment/README.md)
 
 
@@ -15,6 +15,8 @@ Some of the important stored variables are:
 * BlockRotationMatrix: rotation component of the superposition transformation.
 * BlockShiftVector: translation component of the superposition transformation.
 
+BioJava class: [org.biojava.bio.structure.align.model.AFPChain](http://www.biojava.org/docs/api/org/biojava/nbio/structure/align/model/AFPChain.html)
+
 ### The Optimal Alignment
 
 The residue equivalencies of the alignment (EQRs) are described in the optimal 
@@ -80,6 +82,8 @@ In addtition, the data structure is not limited in the number and types of score
 it can store, because the scores are stored in a key:value fashion, as it will be
 described later.
 
+BioJava class: [org.biojava.bio.structure.align.multiple.MultipleAlignment](http://www.biojava.org/docs/api/org/biojava/nbio/structure/align/multiple/MultipleAlignment.html)
+
 ### Object Hierarchy
 
 The biggest difference with `AFPChain` is that the `MultipleAlignment` data 
@@ -167,8 +171,20 @@ on a `MultipleAlignment`:
   double bsRMSD = alignment.getScore('bsRMSD');
 ```
 
-Methods and names for some frequent scores are located in a util class called
-`MultipleAlignmentScorer`.
+### Manipulating Multiple Alignments
+
+Some classes are designed to contain utility methods for manipulating a `MultipleAlignment` object.
+The most important ones are ennumerated and briefly described below:
+
+* [MultipleAlignmentScorer](http://www.biojava.org/docs/api/org/biojava/nbio/structure/align/multiple/util/MultipleAlignmentScorer.html): contains frequent names for scores and methods to calculate them.
+
+* [MultipleAlignmentTools](http://www.biojava.org/docs/api/org/biojava/nbio/structure/align/multiple/util/MultipleAlignmentTools.html): contains helper methods, such as sequence alignment calculation, transform atom arrays of the structures or calculate aligned residue distances between all structures.
+
+* [MultipleAlignmentWriter](http://www.biojava.org/docs/api/org/biojava/nbio/structure/align/multiple/util/MultipleAlignmentWriter.html): contains methods to generate different types of String outputs of the alignment, e.g. FASTA, XML, FatCat.
+
+* [MultipleSuperimposer](http://www.biojava.org/docs/api/org/biojava/nbio/structure/align/multiple/util/MultipleSuperimposer.html): interface for implementations that calculate the structure superpositions of the alignment. Some examples of implementations are the ReferenceSuperimposer (superimposes all the structures to a reference) and the CoreSuperimposer (only uses EQRs present in all structures, without gaps, to superimpose them).
+
+* [MultipleAlignmentXMLParser](http://www.biojava.org/docs/api/org/biojava/nbio/structure/align/xml/MultipleAlignmentXMLParser.html): contains a method to create a `MultipleAlignment` object from an XML file representation.
 
 ### Overview
 
@@ -207,3 +223,7 @@ the first representation supports any number of structures, while the second is
 only supporting pairwise alignments. However, the conversion can be done with some
 lines of code if needed (instantiate a new `AFPChain` and copy one by one the 
 properties that can be represented from the `MultipleAlignment`.
+
+===
+
+Go back to [Chapter 8 : Structure Alignments](alignment.md).
@@ -1,46 +1,62 @@
-Structure Alignment
+Structure Alignments
 ===========================
 
 ## What is a Structure Alignment?
 
-A **structural alignment** attempts to establish equivalences between two or more polymer structures based on their shape and three-dimensional conformation. In contrast to simple structural superposition (see below), where at least some equivalent residues of the two structures are known, structural alignment requires no a priori knowledge of equivalent positions.
-
-**Structural alignment** is a valuable tool for the comparison of proteins with low sequence similarity, where evolutionary relationships between proteins cannot be easily detected by standard sequence alignment techniques. **Structural alignment** can therefore be used to imply evolutionary relationships between proteins that share very little common sequence. However, caution should be exercised when using the results as evidence for shared evolutionary ancestry, because of the possible confounding effects of convergent evolution by which multiple unrelated amino acid sequences converge on a common tertiary structure.
-
-**Structural alignment** of other biological structures can also be made in BioJava. For example, nucleic acids can
-be structurally aligned to find common structural motifs, independent of sequence simililarity. This is specially
-important for RNAs, because their 3D structure arrangement is important for their function.
+A **structural alignment** attempts to establish equivalences between two or 
+more polymer structures based on their shape and three-dimensional conformation. 
+In contrast to simple structural superposition (see below), where at least some 
+equivalent residues of the two structures are known, structural alignment requires 
+no a priori knowledge of equivalent positions.
+
+A **structural alignment** is a valuable tool for the comparison of proteins with 
+low sequence similarity, where evolutionary relationships between proteins cannot 
+be easily detected by standard sequence alignment techniques. Therefore, a 
+**structural alignment** can be used to imply evolutionary relationships between 
+proteins that share very little common sequence. However, caution should be exercised 
+when using the results as evidence for shared evolutionary ancestry, because of the 
+possible confounding effects of convergent evolution by which multiple unrelated amino 
+acid sequences converge on a common tertiary structure.
+
+A **structural alignment** of other biological polymers can also be made in BioJava.
+For example, nucleic acids can be structurally aligned to find common structural motifs, 
+independent of sequence simililarity. This is specially important for RNAs, because their
+3D structure arrangement is important for their function.
 
 For more info see the Wikipedia article on [structure alignment](http://en.wikipedia.org/wiki/Structural_alignment).
 
 ## Alignment Algorithms supported by BioJava
 
 BioJava comes with a number of algorithms for aligning structures. The following
 five options are displayed by default in the graphical user interface (GUI),
-although others can be accessed programmatically using the methods in
-[StructureAlignmentFactory]
-(http://www.biojava.org/docs/api/org/biojava/nbio/structure/align/StructureAlignmentFactory.html).
+although others can be accessed programmatically using the methods in 
+[StructureAlignmentFactory](http://www.biojava.org/docs/api/org/biojava/nbio/structure/align/StructureAlignmentFactory.html).
 
 1. Combinatorial Extension (CE)
 2. Combinatorial Extension with Circular Permutation (CE-CP)
 3. FATCAT - rigid
 4. FATCAT - flexible.
 5. Smith-Waterman superposition
 
-CE and FATCAT both use structural similarity to align the structures, while
-Smith-Waterman performs a local sequence alignment and then displays the result
+**CE** and **FATCAT** both use structural similarity to align the structures, while
+**Smith-Waterman** performs a local sequence alignment and then displays the result
 in 3D. See below for descriptions of the algorithms.
 
-Since BioJava version 4.1.0, multiple structure alignments can be generated and visualized. 
+Since BioJava version 4.1.0, multiple structures can be compared at the same time in 
+a **multiple structure alignment**, that can later be visualized in Jmol. 
 The algorithm is described in detail below. As an overview, it uses any pairwise alignment 
-algorithm and a reference structure to align all of the structures. Then, it runs a Monte 
-Carlo optimization method to determine the residue equivalencies between all the strucutures,
-identifying conserved structural motifs.
+algorithm and a **reference** structure to per perform an alignment of all the structures. 
+Then, it runs a **Monte Carlo** optimization to determine the residue equivalencies among
+all the strucutures, identifying conserved **structural motifs**.
 
 ## Alignment User Interface
 
 Before going the details how to use the algorithms programmatically, let's take
-a look at the user interface that cames with the *biojava-structure-gui* module.
+a look at the user interface that comes with the *biojava-structure-gui* module.
+
+### Pairwise Alignment GUI
+
+Generating an instance of the GUI is just one line of code:
 
 ```java
 AlignmentGui.getInstance();
@@ -60,9 +76,45 @@ and also a 2D display, that interacts with the 3D display
 
 ![2D Alignment of PDB IDs 2hyn and 1zll](img/alignmentpanel.png)
 
-The functionality to perform and visualize these alignments can of course be
-used also from your own code. Let's first have a look at the alignment
-algorithms.
+### Multiple Alignment GUI
+
+Because of the inherent difference between multiple and pairwise alignments,
+a separate GUI is used to trigger multiple structural alignments. Generating 
+an instance of the GUI is analogous to the pairwise alignment GUI:
+
+```java
+MultipleAlignmentGUI.getInstance();
+```
+
+This code shows the following user interface:
+
+![Multiple Alignment GUI](img/multiple_gui.png)
+
+The input format is a free text field, where the structure identifiers are 
+indidcated, space separated. A **structure identifier** is a String that 
+uniquely identifies a structure. It is basically composed of the pdbID, the
+chain letters and the ranges of residues of each chain. For the formal description
+visit [StructureIdentifier](http://www.biojava.org/docs/api/org/biojava/nbio/structure/StructureIdentifier.html).
+
+As an example, a multiple structure alignment of 6 globins is shown here. 
+Their structure identifiers are shown in the previous figure of the GUI.
+The results are shown in a graphical way, as for the pairwise alignments:
+
+![3D Globin Multiple Alignment](img/multiple_jmol_globins.png)
+
+The only difference with the Pairwise Alignment View is the possibility to show
+a subset of structures to be visualized, by checking the boxes under the 3D
+window and pressing the Show Only button afterwards.
+
+A **sequence alignment panel** that interacts with the 3D display can also be shown.
+
+![3D Globin Multiple Panel](img/multiple_panel_globins.png)
+
+Explore the coloring options in the *Edit* menu, and through the *View* menu for 
+alternative representations of the alignment.
+
+The functionality to perform and visualize these alignments can also be
+used from your own code. Let's first have a look at the alignment algorithms.
 
 ## Pairwise Alignment Algorithms
 
@@ -175,9 +227,33 @@ interface.
 
 ## Multiple Structure Alignment
 
-Since BioJava 4.1.0, multiple structure alignments can be generated.
+This Java implementation for multiple structure alignments, named MultipleMC, is based on the original CE-MC implementation by [Guda C, Scheeff ED, Bourne PE &amp; Shindyalov IN in 2001](http://psb.stanford.edu/psb-online/proceedings/psb01/abstracts/p275.html)
+[![pubmed](http://img.shields.io/badge/in-pubmed-blue.svg?style=flat)](http://www.ncbi.nlm.nih.gov/pubmed/11262947).
+
+The idea remains unchanged: perform **all-to-all pairwise alignments** of the structures, choose the 
+**reference** as the most similar structure to all others and run a **Monte Carlo optimization** of
+the multiple residue equivalencies (EQRs) to minimize a score function that depends on the inter-residue
+distances.
+
+Although the main idea is the same as in the original algorithm, some details of the implementation have 
+been changed in the BioJava version. They are described in the main class, but as a summary:
+
+1. It accepts **any pairwise alignment** algorithm (instead of being attached to CE), so any
+of the algorithms described before is suitable for generating a seed for optimization. Note that
+this property allows *non-topological* and *flexible* multiple structure alignments, always restricted
+by the pairwise alignment algorithm limitations.
+2. The **moves** in the Monte Carlo optimization have been simplified to 3, instead of 4.
+3. A **new move** to insert and delete individual gaps has been added.
+4. The scoring function has been modified to a **continuous** function, maintaining the properties that the authors described.
+5. The **probability function** is normalized in synchronization with the optimization progression, to improce the convergence into a score maximum after some random exploration of the multidimensiona space. 
+
+The algorithm performs similarly to other multiple structure alignment algorithms for most protein families. 
+The parameters both for the pairwise aligner and the MC optimization can have an impact on the final result. There is not a unique set of parameters, because they usually depend on the specific case. Thus, trying some parameter combinations, keeping in mind the effect they produce in the score function, is a good practice when doing structure alignments.
 
-## PDB-wide database searches
+BioJava class: [org.biojava.nbio.structure.align.multiple.mc.MultipleMcMain]
+(www.biojava.org/docs/api/org/biojava/nbio/structure/align/multiple/mc/MultipleMcMain.html)
+
+## PDB-wide Database Searches
 
 The Alignment GUI also provides functionality for PDB-wide structural searches.
 This systematically compares a structure against a non-redundant set of all
@@ -213,10 +289,10 @@ the `PDB_DIR` environmental variable. This operation sped up the search from
 about 30 hours to less than 4 hours.
 
 
-## Creating alignments programmatically
+## Creating Alignments Programmatically
 
-The various structure alignment algorithms in BioJava implement the
-`StructureAlignment` interface, and are normally accessed through
+The **pairwise structure alignment** algorithms in BioJava implement the
+`StructureAlignment` interface, and are usually accessed through
 `StructureAlignmentFactory`. Here's an example of how to create a CE-CP
 alignment and print some information about it.
 
@@ -242,13 +318,43 @@ To display the alignment using Jmol, use:
 
 ```java
 GuiWrapper.display(afpChain, ca1, ca2);
-// Or StructureAlignmentDisplay.display(afpChain, ca1, ca2);
+// Or using the biojava-structure-gui module
+StructureAlignmentDisplay.display(afpChain, ca1, ca2);
 ```
 
 Note that these require that you include the structure-gui package and the Jmol
 binary in the classpath at runtime.
 
-## Command-line tools
+For creating **multiple structure alignments**, the code is a little bit different, because the
+returned data structure and the number of input structures are different. Here is an 
+example of how to create and display a multiple alignment:
+
+```java
+//Specify the structures to align: some ASP-proteinases
+List<String> names = Arrays.asList("3app", "4ape", "5pep", "1psn", "4cms", "1bbs.A", "1smr.A");
+
+//Load the CA atoms of the structures
+AtomCache cache = new AtomCache();
+List<Atom[]> atomArrays = new ArrayList<Atom[]>();
+for (String name:names)	{
+  atomArrays.add(cache.getAtoms(name));
+}
+
+//Generate the multiple alignment algorithm with the chosen pairwise algorithm
+StructureAlignment pairwise  = StructureAlignmentFactory.getAlgorithm(CeMain.algorithmName);
+MultipleMcMain multiple = new MultipleMcMain(pairwise);
+
+//Perform the alignment
+MultipleAlignment result = algorithm.align(atomArrays);
+
+//Output the FASTA sequence alignment
+System.out.println(MultipleAlignmentWriter.toFASTA(result));
+
+//Display the results in a 3D view
+MultipleAlignmentDisplay.display(result);
+```
+
+## Command-Line Tools
 
 Many of the alignment algorithms are available in the form of command line
 tools. These can be accessed through the main methods of the StructureAlignment
@@ -265,8 +371,7 @@ alignments in batch mode, or full database searches. Some additional parameters
 are available which are not exposed in the GUI, such as outputting results to a
 file in various formats.
 
-
-## See Also
+## Alignment Data Model
 
 For details about the structure alignment data models in biojava, see [Structure Alignment Data Model](alignment-data-model.md)
 
@@ -280,7 +385,7 @@ Thanks to P. Bourne, Yuzhen Ye and A. Godzik for granting permission to freely u
 
 Navigation:
 [Home](../README.md)
-| [Book 3: The Protein Structure modules](README.md)
+| [Book 3: The Structure modules](README.md)
 | Chapter 8 : Structure Alignments
 
 Prev: [Chapter 7 : SEQRES and ATOM records](seqres.md)