Skip to content

Commit f4411a8

Browse files
committed
now with sections for alignment and core modules
1 parent a61b846 commit f4411a8

10 files changed

Lines changed: 227 additions & 9 deletions

File tree

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,4 @@
11
.DS_Store
22
.profile
33
.settings
4+
.idea

README.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,11 +13,13 @@ At the moment this tutorial is still under development. Please check the [BioJa
1313

1414
Quick [Installation](installation.md)
1515

16-
Book 1: [The Protein Structure modules](structure/README.md)
16+
Book 1: [The Core module](core/README.md), basic working with sequences.
1717

18-
Book 2: [The Genomics Module](genomics/README.md)
18+
Book 2: [The Alignment module](alignment/README.md), pairwise and multiple alignments of protein sequences.
1919

20-
Book 3: Alignments
20+
Book 3: [The Protein Structure modules](structure/README.md), everything related to working with 3D structures.
21+
22+
Book 4: [The Genomics Module](genomics/README.md), working with genomic data
2123

2224

2325
## License

alignment/README.md

Lines changed: 53 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,53 @@
1+
The BioJava - Alignment Module
2+
=====================================================
3+
4+
A tutorial for the alignment module of [BioJava](http://www.biojava.org).
5+
6+
## About
7+
<table>
8+
<tr>
9+
<td>
10+
<img src="img/alignment.png"/>
11+
</td>
12+
<td>
13+
The <i>alignment</i> module of BioJava provides an API that contains
14+
<ul>
15+
<li>Implementations of dynamic programming algorithms for sequence alignment</li>
16+
<li>Reading and Writing of popular alignment file formats</li>
17+
<li>A single-, or multi- threaded multiple sequence alignment algorithm.</li>
18+
</ul>
19+
20+
</td>
21+
</tr>
22+
</table>
23+
24+
## Index
25+
26+
This tutorial is split into several chapters.
27+
28+
Chapter 1 - Quick [Installation](installation.md)
29+
30+
Chapter 2 - Global alignment - Needleman and Wunsch algorithm
31+
32+
Chapter 3 - Local alignment - Smith-Waterman algorithm
33+
34+
Chapter 4 - Multiple Sequence alignment
35+
36+
Chapter 5 - Reading and writing of multiple alignments
37+
38+
Chapter 6 - BLAST - why you don't need BioJava for parsing BLAST
39+
40+
## Please cite
41+
42+
**BioJava: an open-source framework for bioinformatics in 2012**<br/>
43+
*Andreas Prlic; Andrew Yates; Spencer E. Bliven; Peter W. Rose; Julius Jacobsen; Peter V. Troshin; Mark Chapman; Jianjiong Gao; Chuan Hock Koh; Sylvain Foisy; Richard Holland; Gediminas Rimsa; Michael L. Heuer; H. Brandstatter-Muller; Philip E. Bourne; Scooter Willis* <br/>
44+
[Bioinformatics (2012) 28 (20): 2693-2695.](http://bioinformatics.oxfordjournals.org/content/28/20/2693.abstract) <br/>
45+
doi: 10.1093/bioinformatics/bts494
46+
47+
## License
48+
49+
The content of this tutorial is available under the [CC-BY](http://creativecommons.org/licenses/by/3.0/) license.
50+
51+
[view license](../license.md)
52+
53+

alignment/img/alignment.png

38.9 KB
Loading

alignment/installation.md

Lines changed: 51 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,51 @@
1+
## Quick Installation
2+
3+
In the beginning, just one quick paragraph of how to get access to BioJava.
4+
5+
BioJava is open source and you can get the code from [Github](https://github.com/biojava/biojava), however it might be easier this way:
6+
7+
BioJava uses [Maven](http://maven.apache.org/) as a build and distribution system. If you are new to Maven, take a look at the [Getting Started with Maven](http://maven.apache.org/guides/getting-started/index.html) guide.
8+
9+
Currently, we are providing a BioJava specific Maven repository at (http://biojava.org/download/maven/) .
10+
11+
You can add the BioJava repository by adding the following XML to your project pom.xml file:
12+
13+
```xml
14+
<repositories>
15+
...
16+
<repository>
17+
<id>biojava-maven-repo</id>
18+
<name>BioJava repository</name>
19+
<url>http://www.biojava.org/download/maven/</url>
20+
</repository>
21+
</repositories>
22+
```
23+
24+
We are currently in the process of changing our distribution to Maven Central, which would not even require this configuration step.
25+
26+
```xml
27+
<dependencies>
28+
...
29+
30+
<!-- This imports the latest version of BioJava core module -->
31+
<dependency>
32+
33+
<groupId>org.biojava</groupId>
34+
<artifactId>biojava3-core</artifactId>
35+
<version>3.0.8</version>
36+
</dependency>
37+
38+
39+
<!-- other biojava jars as needed -->
40+
41+
</dependencies>
42+
```
43+
44+
If you run
45+
46+
<pre>
47+
mvn package
48+
</pre>
49+
50+
on your project, the BioJava dependencies will be automatically downloaded and installed for you.
51+

core/README.md

Lines changed: 47 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,47 @@
1+
The BioJava - Core Module
2+
=====================================================
3+
4+
A tutorial for the core module of [BioJava](http://www.biojava.org).
5+
6+
## About
7+
<table>
8+
<tr>
9+
<td>
10+
<img src="img/core.png"/>
11+
</td>
12+
<td>
13+
The <i>core</i> module of BioJava provides an API that provides
14+
<ul>
15+
<li>Basic operations with biological sequences</li>
16+
<li>Reading and Writing of popular sequence file formats</li>
17+
<li>Translate DNA sequences into protein sequences</li>
18+
</ul>
19+
20+
</td>
21+
</tr>
22+
</table>
23+
24+
## Index
25+
26+
This tutorial is split into several chapters.
27+
28+
Chapter 1 - Quick [Installation](installation.md)
29+
30+
Chapter 2 - Reading and Writing of FASTA sequences
31+
32+
Chapter 3 - Translating DNA and protein sequences.
33+
34+
## Please cite
35+
36+
**BioJava: an open-source framework for bioinformatics in 2012**<br/>
37+
*Andreas Prlic; Andrew Yates; Spencer E. Bliven; Peter W. Rose; Julius Jacobsen; Peter V. Troshin; Mark Chapman; Jianjiong Gao; Chuan Hock Koh; Sylvain Foisy; Richard Holland; Gediminas Rimsa; Michael L. Heuer; H. Brandstatter-Muller; Philip E. Bourne; Scooter Willis* <br/>
38+
[Bioinformatics (2012) 28 (20): 2693-2695.](http://bioinformatics.oxfordjournals.org/content/28/20/2693.abstract) <br/>
39+
doi: 10.1093/bioinformatics/bts494
40+
41+
## License
42+
43+
The content of this tutorial is available under the [CC-BY](http://creativecommons.org/licenses/by/3.0/) license.
44+
45+
[view license](../license.md)
46+
47+

core/img/core.png

17.6 KB
Loading

core/installation.md

Lines changed: 51 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,51 @@
1+
## Quick Installation
2+
3+
In the beginning, just one quick paragraph of how to get access to BioJava.
4+
5+
BioJava is open source and you can get the code from [Github](https://github.com/biojava/biojava), however it might be easier this way:
6+
7+
BioJava uses [Maven](http://maven.apache.org/) as a build and distribution system. If you are new to Maven, take a look at the [Getting Started with Maven](http://maven.apache.org/guides/getting-started/index.html) guide.
8+
9+
Currently, we are providing a BioJava specific Maven repository at (http://biojava.org/download/maven/) .
10+
11+
You can add the BioJava repository by adding the following XML to your project pom.xml file:
12+
13+
```xml
14+
<repositories>
15+
...
16+
<repository>
17+
<id>biojava-maven-repo</id>
18+
<name>BioJava repository</name>
19+
<url>http://www.biojava.org/download/maven/</url>
20+
</repository>
21+
</repositories>
22+
```
23+
24+
We are currently in the process of changing our distribution to Maven Central, which would not even require this configuration step.
25+
26+
```xml
27+
<dependencies>
28+
...
29+
30+
<!-- This imports the latest version of BioJava core module -->
31+
<dependency>
32+
33+
<groupId>org.biojava</groupId>
34+
<artifactId>biojava3-core</artifactId>
35+
<version>3.0.8</version>
36+
</dependency>
37+
38+
39+
<!-- other biojava jars as needed -->
40+
41+
</dependencies>
42+
```
43+
44+
If you run
45+
46+
<pre>
47+
mvn package
48+
</pre>
49+
50+
on your project, the BioJava dependencies will be automatically downloaded and installed for you.
51+

genomics/README.md

Lines changed: 1 addition & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -37,14 +37,9 @@ Chapter 5 - Reading and writing a [Genebank](genebank.md) file
3737

3838
Chapter 5 - Reading [karyotype (cytoband)](karyotype.md) files
3939

40-
Chapter 6 - Reading UCSC's .2bit files
40+
Chapter 6 - Reading genomic DNA sequences using UCSC's [.2bit file format](twobit.md)
4141

4242

43-
44-
### Author:
45-
46-
[Andreas Prli&#263;](https://github.com/andreasprlic)
47-
4843
## Please cite
4944

5045
**BioJava: an open-source framework for bioinformatics in 2012**<br/>

genomics/twobit.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
Reading a .2bit file
2+
====================
3+
4+
UCSC's .2bit files provide a compact representation of the DNA sequences for a genome. The TwoBitParser class provides
5+
the access to the content of this file.
6+
7+
```java
8+
File f = new File("/path/to/file.2bit");
9+
TwoBitParser p = new TwoBitParser(File f);
10+
11+
String[] names = p.getSequenceNames();
12+
for(int i=0;i<names.length;i++) {
13+
p.setCurrentSequence(names[i]);
14+
p.printFastaSequence();
15+
p.close();
16+
}
17+
18+
```

0 commit comments

Comments
 (0)