Invited Talk:
Designing Filtering Strategies for Faster Protein and RNA
|
|
| Speaker: |
Yanni Sun
|
| When: |
Thursday, Feb 7th, 2008 |
| Time: |
2:00pm |
| Where: |
ECS 243 |
|
Abstract:
With the availability of sequenced genomes for multiple species, an urgent task today is to decipher the biological functions of these sequences. Annotating genomic sequence function helps us understand the genetic background of complex diseases and thus aids drug design. The state-of-the-art method for function annotation is to compare a query sequence against database of sequences with known functions. However, the high computational cost of comparison algorithms and the sheer amount of genomic data pose a great challenge for genome function analysis. For example, it takes several CPU months to compare a bacterial genome with a database of noncoding RNA sequence families.
In this talk, I will present systematic filter design methods for accelerating protein and noncoding RNA function annotation. A filter excludes a large portion of the database that is unlikely to be related to the query and hence comparisons are only conducted on regions with functional similarity. The computational challenge lies in designing filters with optimal tradeoff between sensitivity and specificity from a large design space. I will first present our filters based on regular expression patterns and weight matrices for protein annotation. Then, I will focus on designing secondary structure profiles to accelerate noncoding RNA annotation. I will demonstrate that, by using our designed filters, a protein sequence annotation program based on profile hidden Markov model can obtain 20 to 35 times speedup and a noncoding RNA annotation program based on stochastic context-free grammar can obtain 200 times speedup on average. I will conclude with an overview of my research interests and plan of future works.
Biography:
Yanni Sun is a Ph.D. candidate in the Department of Computer Science and Engineering at Washington University in St. Louis. She received her BS and MS degrees in Computer Science from Xi'an Jiao Tong University, China, in 1998 and 2001 respectively. Her research interests include Bioinformatics/computational biology, pattern design, and similarity search in large-scale databases. She has published her work in Bioinformatics, Journal of Computational Biology, BMC Bioinformatics, as well as at major Bioinformatics conferences such as Annual International Conference on Computational Molecular Biology (RECOMB) and European Conference on Computational Biology (ECCB).
|