OrthoInspector v2.21: New option in the Uniprot utility (11/08/2015)
An option was added to ignore the correction of the gene redundancy (only the longest transcript per gene was kept). Filtered sequences are now mentionned in a log file. See tutorial for details.
OrthoInspector v2.20: New tool to build OrthoInspector inputs from Uniprot reference proteomes (06/08/2015)
+ Addition of a new utility: OrthoInspector_uniprot_tools.jar
This tool facilitates the initial selection of a panel of organisms to create the orthology database. Currently, it does some checkups for the Uniprot fasta proteomes and generates the corresponding organism.xml and fasta files which are used as inputs in the first step of OrthoInspector installation. A new website tutorial details its use.
* Graphical interface:
Some corrections in the phylogenetic profile tool, related to errors when duplicated names occur in some NCBI Taxonomy phylums (mostly when sub-genus have the same name as the genus).
Minor bug fixes.
What is OrthoInspector ?
OrthoInspector is a package dedicated to orthology inference and analysis. The package can be installed for local use on a desktop computer or configured for a server implementation. Its purpose is to facilitate the installation of a database of newly inferred orthology relations and facilitate the exploitation of the resulting orthology database by both expert and non-expert users.
What are the requirements ?
OrthoInspector is compatible with all systems supporting Java 1.6.x JVM and superior. Its components are based on a SQL database / Java client architecture. Consequently, you need to setup an access to an SQL engine (locally or remotely). Any database engine can be used as soon as a Java JDBC driver is available for it. (see complete Requirements for details). Exploiting the inferred orthologs will also require a local installation of the latest NCBI blast+ package.
Why do I need a SQL database ?
While the package can be installed on a single machine, we choose to code OrthoInspector in Java and through a SQL database / Java client architecture to facilitates its exploitation in a network environment. Orthlogy relations are first predicted from a set of proteomes and a blast all-against-all and are transferred to an SQL database. Then, these relations can be exploited by several clients, allowing several users to query the database via a graphical interface or allowing several pipelines to exploit these data simultaneously.
Be aware that OrthoInspector can be installed either on a network or on a single desktop computer !
How is organized the package and what can I do with it ?
The package is separated in three Java executables (and a few configuration files) :
Both the Installation command-line and the Graphical interface can be used to infer orthology relations. The query command-line and most other functions from the graphical interface are designed to exploit these predictions.
This component is recommended to infer large orthology databases from large datasets (> 100 species). A similar procedure is available in the graphical interface but the Installation command line provides several useful options to handle larger datasets. For instance, it can read compressed blast inputs, create compressed outputs, share predictions to several CPUs and facilitate crash recovery (useful when your calculations require a weeks of CPU...).
This component is designed for basic data retrieval from the orthology database. You can query the databases with textual searches, fasta queries or export the entire set of orthologs related to a particular organism. It can easily be integrated into other bioinformatic pipelines.
Contains most functionalities and is a good option for non-specialists which want to avoid command-line manipulations (but you still need to install and configure the SQL database). You can install a small database (<100 organisms) with an automated procedure. This procedure is fully automated if you use MySQL or Postgresql engines. The graphical interface contains advanced querying tools, such as sequence extractions based on phylogenetic profiles. All the graphical tools dedicated to orthology analysis and visualization (Euler diagrams, automated phylogenetic profile construction...) are also in the Graphical interface.
To know more about these tools, consult the dedicated tutorials which are available on this website.
What about the online databases ?
We have pre-computed 3 orthology databases, exploring most known clades:
Distribution and modifications.
OrthoInspector is licensed under the GNU GPL version 3. You can download it, modify it and share it at your convenience but don't forget to cite the authors ! ;-)
OrthoInspector v2.15 Bugfixes (23/06/2015)
* All tools:
Embedded property files have been removed, forcing the user to correctly configure local property files. New error messages help to verify that OrthoInspector is attempting to load the correct property files ('/path/to/' errors, due to unconfigured oi_client.properties files, are now gone). Several other minor bug fixes.
OrthoInspector v2.14 Bugfixes (1/09/2014)
* Graphical interface:
Bug correction in the VennEuler diagram interface.
* MySQL database dump:
Spelling correction in the MySQL database dumps. This error only occurred in a MySQL-based installation. The Best-hit density graph was crashing, the 'Hit' table had a column called 'pk_sequence_hit' instead of the expected 'pk_sequence' column).
OrthoInspector v 2.13 - Bugfixes (18/07/2014)
* Command-line: Correction of some bugs introduced with multi-threading options. The last step (orthology calculation) was crashing in some circumstances.
* Several minor bugs corrections
OrthoInspector v 2.12 - Bugfixes (7/05/2014)
OrthoInspector v 2.11 - Bugfixes and (virtual) multi-threading (7/04/2014)
OrthoInspector v 2.10 - Introduction of GZIP support and bugfixes (15/07/2013)
OrthoInspector v 2.00 - Major update (5/02/2013)
A new major version of OrthoInspector is now released. It is focused on interface improvements for a more user-friendly navigation. No changes were done on the orthology prediction algorithm. Consequently, the command-line binaries are quite similar to previous versions.
Major software improvements:
+ Interactive tree to select organisms and whole phylums directly.
+ Release of the second version of the online database (it contains now 259 eukaryotic species).
We would be grateful for your replies if you find new bugs in this major release.
New website section - "Reference proteomes" (09/07/2012)
Following the "Quest for Orthologs" consortium recommandations, several curated reference proteomes are now freely available in EBI ftp. Orthology calculations based on these datasets can be downloaded from "Reference proteomes" menu.