A Perl Based Data Mining Workflow For Animal Breeding - From Phenotype To SNP

H.J.W.C. Megens, R.P.M.A. Crooijmans, B. Harlizius, S. Kollers, E. Mullaart, B.J. Jungerius, H.H.D. Kerstens, M.A.M. Groenen

Research output: Contribution to conferenceAbstract


With vast and rapidly increasing resources of genomic data available, there is a challenge for breeders to utilize this information in their breeding practices. While tools to combine multiple databases to aid in selecting genes and SNPs for medical studies have become publicly available, none exist that can comprehensively and species specifically do this for livestock animals. A consortium of Wageningen University and three major Dutch animal breeding companies aims to develop tools for mining genomic data specifically for breeding purposes. This is done by a Perl-based pipeline integrating genomic data, both from livestock and model species, from multiple databases, such as EnsEMBL and NCBI. The pipeline has a modular design which facilitates its use as modular tool development platform for our genomics research. A CGI-based interface provides user-friendly access to the tool (log-in is currently required). Based on PubMed, GO, gene-network information and QTL data, multiple candidate genes are automatically selected. These candidate genes are subsequently searched for putative functional information. To this end, several databases (e.g. MiRbase) and programs (e.g. clover, footprinter, miRanda) are implemented to find putative functional sites in or near the gene such as miRNA and transcription factor binding sites. Finally, public SNP databases are searched for variation at these functional sites. The main objective of the tool is the development of large SNP sets consisting of putative functional SNPs, thus facilitating a `high throughput¿ implementation of the candidate gene approach.
Original languageEnglish
Publication statusPublished - 2008
EventPlant and Animal Genome XVI Conference -
Duration: 12 Jan 200816 Jan 2008


ConferencePlant and Animal Genome XVI Conference


Dive into the research topics of 'A Perl Based Data Mining Workflow For Animal Breeding - From Phenotype To SNP'. Together they form a unique fingerprint.

Cite this