ProGMap: an integrated annotation resource for protein orthology

A. Kuzniar, K. Lin, Y. He, H. Nijveen, S. Pongor, J.A.M. Leunissen

Research output: Contribution to journalArticleAcademicpeer-review

11 Citations (Scopus)


Current protein sequence databases employ different classification schemes that often provide conflicting annotations, especially for poorly characterized proteins. ProGMap (Protein Group Mappings, is a web-tool designed to help researchers and database annotators to assess the coherence of protein groups defined in various databases and thereby facilitate the annotation of newly sequenced proteins. ProGMap is based on a non-redundant dataset of over 6.6 million protein sequences which is mapped to 240 000 protein group descriptions collected from UniProt, RefSeq, Ensembl, COG, KOG, OrthoMCL-DB, HomoloGene, TRIBES and PIRSF. ProGMap combines the underlying classification schemes via a network of links constructed by a fast and fully automated mapping approach originally developed for document classification. The web interface enables queries to be made using sequence identifiers, gene symbols, protein functions or amino acid and nucleotide sequences. For the latter query type BLAST similarity search and QuickMatch identity search services have been incorporated, for finding sequences similar (or identical) to a query sequence. ProGMap is meant to help users of high throughput methodologies who deal with partially annotated genomic data
Original languageEnglish
Pages (from-to)W428-W434
JournalNucleic acids research
Publication statusPublished - 2009


  • database
  • gene
  • information
  • families
  • genomes
  • mbl2
  • tool


Dive into the research topics of 'ProGMap: an integrated annotation resource for protein orthology'. Together they form a unique fingerprint.

Cite this