Web Server
Query Sequence ID: None
Query Protein: None
BLAST Job ID: None
BLAST Status: n/a

Welcome to OrthoInspector Web Front-End

This web front-end to the OrthoInspector database will allow you to search for orthology and inparalogy relations over various eukaryotic species, searching either by using keywords (textual search button) or by using protein BLAST (BLAST query button). Furthermore, you can view some stats about the OrthoInspector database (statistics button).

If you would like to learn more about OrthoInspector itself, feel free to visit our website (home button).

We wish you an enjoyable stay in the orthology/inparalogy world!

Data sources

You can download the proteomes used to produce the OrthoInspector predictions.
Each organism is represented by a single file.

⇒ ACCESS PROTEOME SOURCES

Orthology predictions

You can download all orthology predictions.
Each file contains predictions for one organism compared to 59 other organisms.

⇒ ACCESS ONE-TO-ONE RELATIONS

⇒ ACCESS ONE-TO-MANY RELATIONS

⇒ ACCESS MANY-TO-MANY RELATIONS

⇒ WHOLE SET OF RELATIONS (563 Mo)

Fields description :

Organization of a single linerelation_id;protein_1;protein_2
Data related to a proteinidentifier Gender species description(n_words)
cross_references(ensembl_only)
Cross references
(only for sequences coming from ensembl)
xref::sp=(swissprot_id):tr=(trembl_id)
:np=(refseqp_np_id):xp=(refseqp_xp_id)

One-to-many and many-to-many relations are described with multiple lines. The relation id must be used as a reference to differentiate the relations.
Example:

1245;protein_1;protein_2
1245;protein_1;protein_3
→ describes the 1-to-many relation protein_1 ↔ (protein_2,protein_3)

13465;protein_1;protein_3
13465;protein_1;protein_4
13465;protein_2;protein_3
13465;protein_2;protein_4
→ describes the many-to-many relation (protein_1,protein_2) ↔ (protein_3,protein_4)

OrthoInspector Database Statistics

The blast all against all used to produce the OrthoInspector results used a 1.10-9 blast Evalue threshold.

For the 22 higher eukaryotes, protein sequence datasets from Ensembl 56 were used. To avoid multiple transcript issues, the longest protein sequence was selected for each Ensembl-predicted gene annotated as 'protein-coding'.

For data coming from Uniprot or RefseqP, each sequence was compared to all others from the same organism using Blast (1.10-9 Evalue threshold). For sequences sharing more than 99% identity, manually-annotated entries from Swissprot were preferred over TrEMBL and RefseqP entries, otherwise the longest sequence was retained.

Textual search

Enter your keyword(s) for a text search query: You can search using accession numbers (UniProt, RefSeq, Ensembl) as well as gene or protein names.
For example, you could use the following IDs that are all related to the same protein: MTMR1_HUMAN , NP_003819 , Q13613 or ENSG00000063601 .

Protein BLAST for sequence similarity

Enter your query protein sequence below in FASTA format or plain text:
(If multiple sequences are uploaded, only the first one is taken into account)
Or upload file containing sequence: Expect value cutoff:
 EXPAND 


Filter your relations by selecting organisms in this tree.
You can select them manually or use the grey search box with genus/species/taxonomic_ids.
Tip: To search several organism/phylum at the same time, separate each word with a comma (,).

No organisms selected.
   At least 3 characters are required.

   At least 10 characters are required.