This web front-end to the OrthoInspector database will allow you to search for orthology and inparalogy relations over various eukaryotic species, searching either by using keywords (textual search button) or by using protein BLAST (BLAST query button). Furthermore, you can view some stats about the OrthoInspector database (statistics button).
If you would like to learn more about OrthoInspector itself, feel free to visit our website (home button).
We wish you an enjoyable stay in the orthology/inparalogy world!
You can download the proteomes used to produce the OrthoInspector predictions.
Each organism is represented by a single file.
You can download all orthology predictions.
Each file contains predictions for one organism compared to 59 other organisms.
⇒ ACCESS ONE-TO-MANY RELATIONS
⇒ ACCESS MANY-TO-MANY RELATIONS
⇒ WHOLE SET OF RELATIONS (563 Mo)
| Organization of a single line | relation_id;protein_1;protein_2 |
| Data related to a protein | identifier Gender species description(n_words) cross_references(ensembl_only) |
| Cross references (only for sequences coming from ensembl) | xref::sp=(swissprot_id):tr=(trembl_id) :np=(refseqp_np_id):xp=(refseqp_xp_id) |
One-to-many and many-to-many relations are described with multiple lines. The relation id must be used as a reference to differentiate the relations.
Example:
1245;protein_1;protein_2
1245;protein_1;protein_3
→ describes the 1-to-many relation protein_1 ↔ (protein_2,protein_3)
13465;protein_1;protein_3
13465;protein_1;protein_4
13465;protein_2;protein_3
13465;protein_2;protein_4
→ describes the many-to-many relation (protein_1,protein_2) ↔ (protein_3,protein_4)
The blast all against all used to produce the OrthoInspector results used a 1.10-9 blast Evalue threshold.
For the 22 higher eukaryotes, protein sequence datasets from Ensembl 56 were used. To avoid multiple transcript issues, the longest protein sequence was selected for each Ensembl-predicted gene annotated as 'protein-coding'.
For data coming from Uniprot or RefseqP, each sequence was compared to all others from the same organism using Blast (1.10-9 Evalue threshold). For sequences sharing more than 99% identity, manually-annotated entries from Swissprot were preferred over TrEMBL and RefseqP entries, otherwise the longest sequence was retained.