link

March 2, Sunday
12:00 – 14:00

Using Information Retrieval for Large Scale Gene Analysis
Computer Science seminar
Lecturer : Hagit Shatkay
Affiliation : Informatics Research group, Celera/ABI
Location : -101/58
Host : Mayer Goldberg
Current genomic research has generated an immense volume of data and a tremendous increase in the number of gene-related publications. This wealth of information presents a major data analysis challenge. The ultimate goal is to understand the complex biological interrelationships among all discovered genes and proteins. Meeting this goal requires scanning the abundant literature about each gene and plenty of human expertise. As several research groups have recently noted, automated systems for extracting relevant information from the literature can complement existing techniques, speed up analysis, and greatly enhance our understanding of genetic processes.

We present a new approach, based on probabilistic information retrieval, which uses the literature to establish functional relationships among genes on a genome-wide scale. Experiments applied to documents discussing yeast genes, and a comparison of the results to well-established gene function, demonstrate the effectiveness of our approach.