Ph.D
Group : Bioinformatics
Algorithmic Aspects of Genome Rearrangements: Duplications and Partial Orders
Starts on 01/10/2006
Advisor : DENISE, Alain
[VIALETTE, M. Stéphane]
Funding : Attaché temporaire d'enseignement et de recherche
Affiliation : Université Paris-Saclay
Laboratory : LRI
Defended on 06/11/2009, committee :
Philippe DAGUE
Alain DENISE
Sylvie HAMEL
Bernard MORET
Marie-France SAGOT
Stéphane VIALETTE
Research activities :
- Algorithms
- Bioinformatics
- Combinatory
- Complexity
Abstract :
Comparative genomics aims to better understand differences between species. Several methods for genome comparison exist; in this PhD, we have focused on the computation of three measures of (dis)similarities, namely the number of adjacencies, the number of breakpoints, and the number of commons intervals. In presence of duplicated genes or when the order of genes is only partially known, computing these measures is a NP-hard problem. Our contribution is twofold.
First, we want to compute the number of adjacencies and the number of breakpoints for three models (exemplar, maximum and the intermediate model introduced in this work) between two genomes with duplications. In order to obtain exact results, we have used a pseudo-boolean programming approach. After a test on 12 genomes of ?-proteobacteria, we got enough results to compare different combinations of measure/model. Additionally, we have proposed and evaluated (thanks to the above-mentioned results) a family of heuristics based on a search of a longest common subsequence, which gave very good results on these data.
In parallel, we proved that there exists no approximation algorithm (unless P=NP) to compute the number of adjacencies (on the exemplar model) and the number of breakpoints (on the exemplar and intermediate models).
Second, we set up a pseudo-boolean approach to compute the number of adjacencies and the number of common intervals between two partially ordered genomes. Using nearly 800 simulated genomes, we have studied the influence of parameters associated to partial orders and compared both measures.
More information: http://www.lri.fr/~thevenin/