Add protein alias and gene alias to uniprot extract#568
Conversation
Catch back up with biojava/master
Catch back up with biojava/master
# Conflicts: # biojava-structure/src/main/java/org/biojava/nbio/structure/io/PDBFileParser.java
|
Just to add: for a complete representation of UniProt in a Java datamodel, see here: |
|
Doesn't that only work if one is persisting it in a database in that format? On Aug 25, 2016 5:17 PM, "Andreas Prlic" notifications@github.com wrote:
|
|
You can just read from UniProt XML, without a DB. |
|
Just added some more docu there to provide an example how to read a UniProt XML into a Java object. For example this can be done: URL u = UniProtTools.getURLforXML(accession);
InputStream inStream = u.openStream();
Uniprot up = UniProtTools.readUniProtFromInputStream(inStream); |
|
Any examples on how to do that? That may be easier then using the proxy I could see an advantage of having it all in our db, though. However, we On Thu, Aug 25, 2016 at 5:22 PM, Andreas Prlic notifications@github.com
|
|
We can reload all of UniProt (and a small subset of Trembl) over night, so we never bothered with writing an incremental update strategy. |
|
so you just tend to dump it all and reload? With some sort of script which On Thu, Aug 25, 2016 at 5:31 PM, Andreas Prlic notifications@github.com
|
|
Also, does everything from the uniprot record currently go in the database, On Thu, Aug 25, 2016 at 5:32 PM, Erik McKee emckee2006@gmail.com wrote:
|
We should move this thread over to the other project :-) |
|
How do we do that? On Thu, Aug 25, 2016 at 5:46 PM, Andreas Prlic notifications@github.com
|
|
UniProt schema changes quite regularly. The project has a built in schema version that is used as a default (see file uniprot.xsd in the resources folder). I'll set up a documentation page for how to run a database load. |
|
So it's not feasible to have the db schema before hand, and always just On Thu, Aug 25, 2016 at 5:54 PM, Andreas Prlic notifications@github.com
|
|
No, the db-schema gets generated at compile time. (The datamodel is auto-generated by the XML schema). As such this is available also before hand. That's why the database in principle is not needed and the code works also when only UniProt XML files are available. Truncating an existing db and just loading is possible. |
|
here some documation for how to load a DB |
No description provided.