Click here for a list of ortholog datasets based on the
QfO reference proteomes of 2020.
Please note that for this service the most recent dataset we use is the QfO dataset from 2020_04.
Provide us your own ortholog inference data. After uploading your predictions, you will be able to compare the quality of your predictions against other orthology inference efforts using several tests.
Orthology inference is most often based on molecular protein sequences. For a comparison of different orthology prediction methods, a common set of sequences must be established. Therefore, only identical proteins are mapped to each other.
To make comparisons of method easier, the orthology research community has agreed in 2009 to established a common QfO reference proteome dataset. Currently we are using the reference proteomes from 2020_04 and 2022_02 for benchmarking. The 2011 dataset can no longer be used for benchmarking, but the results of the public projects stay for reference.
All releases of the QfO Reference Proteomes are available from UniProtKB's archive FTP server. The currently recommended datasets for benchmarking are:
See the documentation section for additional instructions how to format your orthology predictions.