TY - JOUR
T1 - A new web-based data mining tool for the identification of candidate genes for human genetic disorders
AU - van Driel, Marc A.
AU - Cuelenaere, Koen
AU - Kemmeren, Patrick P.C.W.
AU - Leunissen, Jack A.M.
AU - Brunner, Han G.
N1 - Funding Information:
Partly supported by grants from N.W.O./Unilever (grant number 326756 to JAM Leunissen) and from the Irene kinderziekenhuis Foundation (to HG Brunner).
PY - 2003/1/1
Y1 - 2003/1/1
N2 - To identify the gene underlying a human genetic disorder can be difficult and time-consuming. Typically, positional data delimit a chromosomal region that contains between 20 and 200 genes. The choice then lies between sequencing large numbers of genes, or setting priorities by combining positional data with available expression and phenotype data, contained in different internet databases. This process of examining positional candidates for possible functional clues may be performed in many different ways, depending on the investigator's knowledge and experience. Here, we report on a new tool called the GeneSeeker, which gathers and combines positional data and expression/phenotypic data in an automated way from nine different web-based databases. This results in a quick overview of interesting candidate genes in the region of interest. The GeneSeeker system is built in a modular fashion allowing for easy addition or removal of databases if required. Databases are searched directly through the web, which obviates the need for data warehousing. In order to evaluate the GeneSeeker tool, we analysed syndromes with known genesis. For each of 10 syndromes the GeneSeeker programme generated a shortlist that contained a significantly reduced number of candidate genes from the critical region, yet still contained the causative gene. On average, a list of 163 genes based on position alone was reduced to a more manageable list of 22 genes based on position and expression or phenotype information. We are currently expanding the tool by adding other databases. The GeneSeeker is available via the web-interface (http://www.cmbi.kun.nl/GeneSeeker/).
AB - To identify the gene underlying a human genetic disorder can be difficult and time-consuming. Typically, positional data delimit a chromosomal region that contains between 20 and 200 genes. The choice then lies between sequencing large numbers of genes, or setting priorities by combining positional data with available expression and phenotype data, contained in different internet databases. This process of examining positional candidates for possible functional clues may be performed in many different ways, depending on the investigator's knowledge and experience. Here, we report on a new tool called the GeneSeeker, which gathers and combines positional data and expression/phenotypic data in an automated way from nine different web-based databases. This results in a quick overview of interesting candidate genes in the region of interest. The GeneSeeker system is built in a modular fashion allowing for easy addition or removal of databases if required. Databases are searched directly through the web, which obviates the need for data warehousing. In order to evaluate the GeneSeeker tool, we analysed syndromes with known genesis. For each of 10 syndromes the GeneSeeker programme generated a shortlist that contained a significantly reduced number of candidate genes from the critical region, yet still contained the causative gene. On average, a list of 163 genes based on position alone was reduced to a more manageable list of 22 genes based on position and expression or phenotype information. We are currently expanding the tool by adding other databases. The GeneSeeker is available via the web-interface (http://www.cmbi.kun.nl/GeneSeeker/).
KW - Bioinformatics
KW - Candidate gene prediction
KW - Data mining
UR - http://www.scopus.com/inward/record.url?scp=0037265647&partnerID=8YFLogxK
U2 - 10.1038/sj.ejhg.5200918
DO - 10.1038/sj.ejhg.5200918
M3 - Article
C2 - 12529706
AN - SCOPUS:0037265647
SN - 1018-4813
VL - 11
SP - 57
EP - 63
JO - European Journal of Human Genetics
JF - European Journal of Human Genetics
IS - 1
ER -