Research

I am involved in research in bioinformatics, classification and data mining. I am the leader of the classification and data mining research group in the Statistics section. My full publication info is here

Presently, I am particularly interested in a number of things:

Statistical analysis for cell biology

Modern microscopy enables cell biologist to obtain high resolution 3D images of the cell nucleus. We are especially interested in the spatial configuration of sub-nuclear bodies, and the relationship of configuration to function. This raises new challenges for imaging and analysis. Some recent work

This is on-going collaboration with Paul Freemont and Dave Stephens.
We have recently been awarded a grant from the BBSRC to continue our work on the study of nuclear architecture. The project will start in October. More details to follow.

Streaming data methodology

Advances in data acquisition technology have enabled the continuous collection of data in a variety of contexts, notably sensor networks. Such contexts require data analysis algorithms that operate with a single view of each data item, and the have the capacity to adapt to changes in the underlying stochastic mechanism. Some recent work

This work is partially supported by the ALADDIN project.

Classification and data mining

Classification problems are ubiquitous, and are a good example of the effectiveness of Statistics in the real world. I am especially interested in complicated contexts, like credit card fraud detection. Outputs from the EPSRC ThinkCrime project include

Data mining involves various types of analysis on large data sets. We have been involved in defining the sub-area of "pattern detection and discovery", concerned with finding small local structures in large data sets. Some work in this area includes

And a recent review-type article on data mining: