HOME    TEACHING    RESEARCH    PUBLICATIONS    SEMINARS    PROSPECTIVE STUDENTS

Waterloo Logo
 

 


Research Topic: Software Frameworks for Parallel Bioinformatics on Computational Grids

'Computational grids' are spatially distributed heterogeneous collections of computing resources. For example, Ontario's SHARCNET is a powerful networked system of compute clusters spread out over Ontario's Universities.

If the computing power available in such grids can be unleashed effectively, large scientific problems can be solved that would be unsolvable using stand-alone machines.

In particular, many bioinformatics problems pose a dire computational challenge, both because the amount of data to be processed can be huge, and because the processing algorithms can be extremely complex.

In this project you will develop techniques that allow to apply the power of computational grids to complex parallel bioinformatics problems.

You will build upon the success of the TaskSpaces software framework for scientific computing on computational grids, and initial application to an RNA folding statistics problem, see also 'A Software Framework for Parallel Bioinformatics on Computational Grids' [pdf].

Several directions are possible for this project, and may include further development of a generally usable grid computing software framework for parallel bioinformatics, for instance allowing the use of problem specification in the Python language, which is becoming the lingua franca of bioinformatics. Scalability, fault-tolerance, and security are other important aspects for a grid computing framework that may be further investigated. Alternatively, you may prefer to work closer to bioinformatics applications, and may as such contribute to solving bioinformatics problems that require large computational resources. Some applications involve extensive use of combinatorics, search algorithms, and other applied mathematics techniques, which may be other interesting topics to pursue.

Computing resources provided by SHARCNET will be studied and employed extensively in this project.

Software development and bioinformatics applications will involve collaboration with Prof. Rob Knight of the MCDB Department of the University of Colorado at Boulder, USA. For the software environment development aspects of the project, a good knowledge of Java and/or Python and a strong interest in (parallel and grid) computing are required. The emphasis of your project can either be on the software framework development, or on parallel bioinformatics applications and the mathematics required to solve them, or both.

This topic is suitable for a PhD or Master's thesis. Please contact Hans De Sterck if you are interested.


Created by Hans De Sterck.
Department of Applied Mathematics, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada.
Phone: 1-519-888-4567 ext 7550, Fax: 1-519-746-4319, E-mail: hdesterck@math.uwaterloo.ca.
Office: MC 5016. campus map